5 STAR AI.IO

TOOLS

FOR YOUR BUSINESS

Our NEW Site OFFERS FREE AI TOOLS & FREE STUDIES To SITES With 5-STAR Artificial Intelligence TOOLS That Will HELP YOU Run YOUR BUSINESS Quickly & Efficiently & Increase YOUR SALES

5-STAR AI PRESENTS OUR FREE

CHATAI TOOLS

OPEN TO THE PUBLIC

HI! My Name is STAR

What Can I Help you With?

Feel FREE 2 Ask!!!

HELLO

I AM STAR & I AM YOUR ASSISTANT TO GUIDE YOU AND YOUR BUSINESS IN THE AI IoT WORLD.

SO ... FINELY WELCOME TO OUR NEW SITE

THE 5-STAR AI & IO TOOLS FOR YOUR BUSINESS.

IS ARE NEW WEBSITE AND IT'S ABOUT ALL THE TOP & THE BEST AI & IoT TOOLS On The NET.

We provide you the best Artificial Intelligence tools and services that can be used to create and improve YOUR BUSINESS Websites BOT & CHANNELS.

This Site includes Tools for Creating Interactive Visuals, Animations, 3D & videos.

As Well As TOOLS for SEO, Marketing & Web Development.

IT Also Includes Means FOR Creating & Editing Text, Images & Audio.

The Website IS Intended To Provide Users With A Comprehensive List OF AI-Based Tools To HELP Them Develop & Improve Their Businesses.

This Website IS A Collection OF Artificial Intelligence TOOLS & Services That Can BE Used To Create & Improve Websites.

IT Includes TOOLS FOR Creating Interactive Visuals, Animations, & Videos TOOLS and ALSO TOOLS FOR SEO, Marketing & web Development.

שלום

אני סטאר ה"בינה המלאכותית ואני העוזרת המקצועית שלך באתר 5 כוכבים כדי להדריך אותך ואת העסק שלך לעולם החדש של הבינה מלאכותית

אז... ברוכים הבאים לאתר החדש שלנו

כלי בינה מלאכותית בעליי דירוג של 5 כוכבים עבור העסק שלך

האתר הוא על כל הכלים החדשים ברשת בעליי דירוג של 5 כוכבים

אנו מספקים לך את מיטב הכלים והשירותים של בינה מלאכותית שניתן להשתמש בהם בימינו בכדי ליצור ולשפר אתרים וערוצים עסקיים

אתר זה כולל כלים ליצירת ויזואליה אינטראקטיבית, אנימציות, תלת מימד וסירטונים

האתר מציג גם כליי בינה מלאכותית לקידום אתרים, שיווק ופיתוח אתרים

האתר כולל גם אמצעים ליצירה ועריכה של טקסט, תמונות ואודיו

האתר נועד לספק למשתמשים רשימה מקיפה של כלים מבוססי בינה מלאכותית כדי לעזור להם לפתח ולשפר את העסקים שלהם והכל בחינם

אתר זה הוא אוסף של כלים ושירותים של בינה מלאכותית שניתן להשתמש בהם כדי ליצור ולשפר אתרים

האתר נועד לספק למשתמשים רשימה מקיפה של כלים מבוססי בינה מלאכותית כולל הסברים שיסייעו לכם לפתח ולשפר את העסק הדיגיטאלי שלכם בחינם במקצועיות וביעילות

HELLO & WELCOME TO THE

5 STAR AI.IO

TOOLS

FOR YOUR BUSINESS

LIP READ

Generate Your First Professional

AI TensorFlow PROJECT & GET YOUR

BUSINESS 2 Another Level.

5 Star Free AI YouTubeshorts Title Generator

Feel FREE 2 UZ

How to Code a Machine Learning Lip Reading App with Python Tensorflow and Streamlit

ENTIRE TRANSCRIPT NO TIME

a couple of weeks ago I managed to release the most amazing machine learning model that I've ever had the chance to work on in my entire life it takes in a number of video frames passes it through to a machine learning model which is built in tensorflow and Python and is in effect able to go and perform that's right it's able to take a set of videos and transcribe what a person is saying now whilst I'm still waiting for my juicy defense contract we're going to take this a step further and build it out into a full stack application using streamlit python tensorflow and a whole bunch of other great python libraries now this can be extended to a whole bunch of use cases If eventually you wanted to go and replace the video feeds with a webcam if you wanted to go and deploy it on a edge device there's a whole range of possibilities plus this being an absolutely brilliant example of what is possible with machine learning great for Eurasia map anyway ready to do it let's get to it alrighty so the first thing that we are going to go on ahead and do is get our project open up inside of vs code because we're going to be doing all of the coding inside of vs code now if you haven't gone and checked out the original tutorial I'll include a link somewhere up there so you can go and check that out but all of the code is available on GitHub so if you want to go and pick this up you want to get the existing model checkpoints those are going to be available there so you can definitely go on ahead and grab those now what we're going going to go on ahead and do is jump into vs code so I'm going to open up the existing folder that I've got and just type in code Dot and you can see we've got a ton of stuff floating around up here let me quickly explain this model structure so over here we've got our data folder this was set up in the previous tutorial we've got our lips virtual environment and we've got our pre-trained model checkpoints those pre-trained model checkpoints are available inside of GitHub I've included the checkpoints for 50 epochs and I think 96 epochs 96 epochs Works absolutely brilliantly so you can go on ahead and pick those up we're going to be mainly doing our work inside of the app folder now I've got two existing files now these are going to look really familiar too if you saw the previous tutorial because really all we're doing here is we're instantiating our deep learning model which I went through how to build from scratch in the previous video and we are going to be loading up the weight so right down the bottom you can see that we are loading the existing weights from the 96 Epoch checkpoint so this will give us the ability ability to run this function here load model inside of our full stack application and bring that into our app we interrupt your regular programming to tell you the courses from it is officially a lot if you'd like to get up and running in machine learning deep learning and data science head on over to www.courses from Nick to find the latest and greatest I'm also going to be releasing a free python for data science course in the upcoming weeks so be sure to stay in the know but if you're ready to hit the ground running well I highly recommend you check out the full stack machine learning course this goes through seven different projects 20 hours of content all the way through full stack production ready machine learning projects head on over to www.courses from nick4 bundles forward slash full stack ML and use the discount code YouTube 50 to get 50 off back to our regular programming now the core advantage of having these existing checkpoints is we're taking a bit more of a full stack feel this means that we're picking up from the existing machine learning endpoint right it's almost as though you've gone and done the entire machine learning engineering bit you're now passing this off to a software engineer and it's good practice to get into a habit of learning how to take a deep learning model or take a machine learning model and bring it into a full stack environment and really get it out there into your users this is exactly what the focus of this particular video is going to be we're taking this deep learning trained deep learning model and we're going to be integrating it inside of a streamlit app now the other helper file that we've also got is this utils.pi file again this is coming directly from the existing tutorial we've got a couple of imports we are handling our vocab and we're defining a chart to num function and a num to char this effectively takes our characters that our machine learning model is going to spit out convert them to numbers the number to char model or the num2 chart function does the opposite it takes a number of tokens and converts them to cash is because in fact our lipnet model is actually outputting characters now this is one of the cool things about libnet in the fact that it is a character-based model this means that if we were to go and pass through additional videos in the future it would be able to go and learn how to decode those specific words even though it might not have necessarily seen that entire word before there was a question on the previous video about whether or not this is in fact the case yeah it is a character-based model which is what makes it so cool the other functions that we've got inside of here are load video load alignments and load data we're not going to use those specific sub functions we're going to be using load data which is going to return to us our pre-process video and our alignments now again inside of that bigger tutorial I actually went through what that actually looks like and the cool thing is if you actually go and take a look at the thumbnail from that particular video this actual function was used to go and generate those thumbnails which is kind of meta but anyway I thought that was kind of cool all right so we've got two helper functions so we've got utils.pi and we have model util.pi we're going to be using those inside of our full stack streamlit app and I'll make these two files available inside of the updated GitHub repo so you can go into that we're going to be working inside of the app folder all right let's actually do some coding now so I'm going to open up my uh we are opening up a command prompt so I'm going to send it into app and we are now inside of the app folder let's just see that can we zoom in let me zoom in so wow that's very zoomed in as you can see we can't let's zoom out a little so if we open bring that up you can see that we are inside of our main project folder and then we're inside of a subdirectory called app so if I actually go and type in LS you can see I should have let's bring this let's do that again so you can see that we've got two files in there right now model util.play and utils.play cool brilliant okay so let's actually get to building this app because right now we've just been messing around and taking a look at what our project looks like so we're going to create a new file and we're going to call this streamlit app dot pie beautiful and then what we're going to do is we are first I'm going to import a couple of libraries so first up we are going to import streamlit as St so this is going to last work with streamlit we're then going to import OS as that's just going to make it a whole bunch easier to work with our different file paths and we're also going to be using it to list out the files inside of our directory we're also going to import image IO and that is going to allow us to take a series of videos and convert it into a gif which just looks really really cool and it allows you to see what our machine learning model is actually going to be able to take in as input before making a prediction all right so we've got streamlit we've got OS and we have image i o we then need to import tensorflow as TF what do you reckon should I start doing some more tutorials on pytorch I've been doing quite a fair bit on tensorflow but I know that people like pytorch as well let me know in the comments below all right cool so we've got import tens flows TF we then need to bring in some stuff from model util and utils.pi so we are going to import uh actually we're going to from in from utils import we need load data and we need num to char beautiful and then from model util we are going to load up the load model function import load model alrighty cool so those are our six key Imports that we're going to need now come here want to know a secret are you looking for your next dream job in data science machine learning deep learning or just data in general or you need to join jobs from net each and every week I send you a curated list of the best jobs in data these are jobs with great perks great people and great roles plus you'll get access to exclusive content like amas interviews and resume reviews so why not join plus it's completely free link is in the description below what are you waiting for all right back to the video so you're probably thinking why are we using streamlit why are we using tensorflow why are we using each one of these libraries well extremely it just makes it ridiculously easy to go and build full stack applications and it's kind of designed for machine learning Engineers data scientists to be able to take their production eyes or their trained machine learning models and bring it into a full stack environment that's why we're using streamlit image i o it really is going to be about that gif so if you don't want to go and create or convert your video to a give to be able to visualize it you don't really need that but I personally think it just makes things look a whole lot nicer to tensorflow actually gives us our deep learning capability if you've got a GPU on your machine it's going to run way faster but that being said you don't necessarily need a GPU although highly recommend it anyway let's get back to it so now that we've got our Imports done so these are all our Imports so import all of the dependencies let's save that first things first let's go and try to kick off our applications we haven't really done much inside of there so far so I'm just going to open up a terminal so on vs code I typically hit control and then tilde which gives me this terminal ability right now I'm running it inside of Powershell but you can run it inside of a command prompt inside of a Mac OS machine you're probably going to be doing this inside of your terminal so what are we going to do we're going to start up our app so if we type in streamlit and then we are going to type in streamlit run let's make this a little bit smaller so we can see it streamlit run and we're going to run streamlit app.pi so keep in mind we are inside of the app folder so hence why we're able to do this all right so if we go and run this this should open up our streamlined app inside of a browser but we're not going to see anything but let's go in ahead and take a look that's all looking promising so this is our streamlit app blank screen doesn't matter it's going to work we are going to build this okay so that is the beanings of our app doesn't look like we've got any errors yet so we are in a good State let's bring this over here okay so now that we've got that done we should probably start adding a little bit of some uh structure to it so let's go ahead and do that let's first up add a sidebar so we can type in with st dot sidebar and then we're going to add St dot image and I'm just going to grab an image that I personally like hold up a few moments later so I've got this link to this image it just looks kind of cool it's got a data and AI feel to it so if I go and paste that in that's sort of what it looks like it shows massive throughput going through a GPU or a bunch of hands but anyway you've got to get the idea so we're going to type in with st dot sidebar this is going to give us sidebar capability inside of our streamlit app we are then going to add a title so we can do that using St dot title and we might call this lip buddy oh wow my typing is a shocker today and then we can add St dot info so that's going to give us the ability to add some information about our application so we might say um this application is originally developed from the lip net deep learning my head's covering that now deep learning model cool so if we go and save that let's just make these single quotes got a bit of a bad habit of using both what do you guys do let's get rid of that beautiful okay cool let's uh go and refresh our app now we can hit r let's open up our sidebar boom take a look at that the beginnings of our applications we've now got the image so that corresponds to this line here St dot image we've got the title which you can see is lib buddy that corresponds to that and we've also got the info box which corresponds to that bit down there so that is the beginning structure of our application now up and running so the next thing that we're going to go on ahead and do is start laying out our app and giving it a little bit of structure so we're going to set up two separate columns so one column will be able to display the originating video the other one is going to go through all of our machine learning steps so first up we're going to create a little bit of a gif which decodes what we're Transforming Our original video to post processing we're then going to have the raw predictions and then we're going to have the decoder predictions which take the individual number tokens and convert this into an actual set of words before we do that we actually need to be able to go and get an existing set of videos that we're going to have selected we're going to use St but select box to go ahead and do this so let's go ahead and start setting out our options now if you cast your mind back to the original video inside of the data folder we had all of our videos inside of this S1 folder now there's one thing that I noticed which was a little bit of a pain when it came to streamline in that it wasn't able to play these dot MPG files because I think they're an older codec we're going to solve this using ffmpeg so I'm going to show you how to do that as well okay let's go ahead and write a little more code so so far I've done our dependencies we've set up the sidebar set up the sidebar what we're now going to go ahead and do is get a list of drop downs for all of our potential options so here is where OS is going to play actually we're going to do one additional thing we're going to set the layout through the streamlit app as wide so to do that we can type in St dot set page config boom and then we can type in layout equals to wide so this is going to give us the ability to have our page set up to White I'll include a little bit of documentation over there so you can take a look at what that means that is our page config now done now let's jump back over to these options so get so what we're really doing is we're generating a list of options or videos so in the future if we wanted to go and sub this out for a webcam I'd imagine that this is where I'd be plugging in that block of code let me know if we want to extend this app out to that okay so what we're going to do is first up we're going to get all of our potential options so let's go and type in options and we're going to say options is equal to os.path.join and remember our options are going to be our videos so our videos are going to be inside of a folder called so we need to back out of our app folder we're then going to go into our data folder and then we're going to go in a folder called S1 now ys1 well the original lipnet data set had a ton of different speakers I'll include a display of what that looks like there are a ton of speakers S1 is Speaker one but there were a whole bunch of others which were included in the original data set that was built for the lipnet architecture we're just using one speaker okay so that is going to give us the file path to that folder but if we type in OS Dot dot uh Listia that's going to give us all of the different options so if we go and type in print options let's just go and let's open this up and let's go and refresh our app I'll be printing out uh what's Happening Here can only be called set page config can only be called once per app so we need to go and restart our apps let's do that shut this down and then restart good work okay boom that looks promising all things holding equal we should be able to print out full of do we have an error there what's happening St dot pagecon I think we need to bring this up this should actually be further up let's bring this back up here beautiful so this should be above the St dot sidebar that's my bad let's go and refresh that beautiful all right and you can see here that we're printing out all of the different video names right so we've now got a set of video names that we can use as a drop down inside of our app so to do that we can then go and type in um we're going to save this as a variable so we're going to say selected video is equal to St dot select box we're then going to pass through an option or we're going to pass for a title actually we're going to say choose video and then we're going to pass through our options to that so if we go and save that now we'll bring up that app hit refresh boom so we've now got our sidebar we've got this ability to go in ahead and choose our video so later on when we go and select this video first I'm going to render it so you'll actually be able to play it and then we'll go through all of that deep learning process which is going to be great okay where are we at so we've now gone and set up at the ability to go and choose a video so the next thing that we want to go on ahead and do is start setting up that layout because we don't have that set up right now so we can first up double check that we've got an option selected so we can say if options and then we're going to create two columns so let's create should we create outside options now let's just do whatever here so we're going to say call one or call One Call two is equal to St dot columns and then we're going to pass through two so I'll include a little bit of documentation up on the screen right about now so you can see what columns does but really this is going to generate two columns and then we're going to say with call one we're going to do something and then with call 2 we're going to do something beautiful so let's just have some text for now so if we take in St dot text and we're going to say this is column one and then let's paste that over here and then we're going to say this column two so let's get rid of these past statements and let's open up our app backup so let's refresh all right so you can see that we've now got column one we've now got column two so we've now gone and set up our layout so just to recap so we've now got our sidebar we've now got the ability to go on ahead and choose our video and we've now got column one over here and if we scroll on over we've got column two so that is the base layout of our app now ready now the next thing that we're going to need to go on ahead and do is actually get a video set up so that we can play it inside a streamlit I sort of alluded to this a little bit previously in that streamlit doesn't like the dot MPG file format which is perfectly okay we can use an extension or a helper Library called ffmpeg which allows us to go and convert this dot MPG file into MP4 so we're going to do that we're then going to read in our video and then we're going to display it inside of our application so that we are able to go and play it inside of our streamlit app we're going to be using the St dot video function so to streamlit dot video display helper function which actually allows us to go and render the video inside of our application now in order to get our video up and running we are going to first up need to grab that video and we need the full file path so right now this is just going to give us the specific file name we need the entire file path so for now let's get rid are we happy with this St dot now we don't want this column um let's actually go and do our magic to get our files so first up what we need to do is we need to get the full file path so we're going to say file path is equal to OS and the reason that we need this file path is because right now if we go and choose one of these options the value selected file over here or selected video is really only going to hold this so bbafn dot MPG it's not going to hold the entire file path to that particular file we need the full file path to be able to go and do this conversion using ffmpeg so let's go on ahead and do that we're just going to effectively append this together so os.path.join we are going to then say we want to go back out of this folder we're then going to go into Data we're then going to go into S1 we are then going to go and pass through our selected video so that should give us the full file path then what we're going to do is we're going to do a little bit of system magic using ffmpeg so ffmpeg it does a ton of uh video processing so if you've ever used youtube.dlp to get some YouTube videos completely legally ladies and gentlemen we got them ffmpeg is actually behind that so we are going to be converting this so we are going to say os.system and this is going to allow us to run a command line call so we are then going to run the full command line call which is going to be ffmpeg we are then going to pass through Dash I which will will allow us to pass through your file path and we are going to pass through a string or pass through our variable there so inside the squiggly brackets I've gone and passed through the variable file path so the full line is OS dot system using F we're able to go on ahead and use some string formatting we're then passing through or calling ffmpeg Dash I we're passing through this variable into our string so if we closed it that would we'd get rid of those errors but now we need to say what we want to go and convert this dot MPG file to we are going to convert it into a mp4 file format so to do that we can pass you dash V codec uh and it's going to be lib x264 I think something like that I think that's right and then we're going to Output this file to a file called test video dot MP4 and then we need to pass through Dash yes to say that we want to do that conversion and overwrite the lib executed all right so let's take a look at that full line let me zoom out a little bit so we've now gone and written file path os.path.join we're grabbing we're jumping out of our existing folder going into Data going into S1 and grabbing our selected video so this will give us the full file path to our selected video we're then running os.system and then we're running our FFM Peg conversion to take our DOT MPG file and convert it to MP4 I know it's a little bit of a effort to go and get to this stage but hopefully that should make our life easier when it comes to rendering our video so let's actually go and test this out now so if we go and are we still printing out all the file paths no okay that's much better we don't want to go on ahead and do that again so let's uh are we still running it up let's scroll on down looks like we are let's just shut down our app and restart cool so that looks like it's running the conversion already so that looks promising right so take a look we've now got our test video.mp4 so that looks let me put my headphones on so I can hear this I cannot hear anything maybe you don't get audio out of out of whatever I'm playing it in so let's go into the main folder which will be inside of here pin blue F2 now cool so successfully gone and converted our video and these are the original videos from the lipnet model I'll include a link in the description if you don't want to go and take a look at the original entire data set okay but that looks promising so we've now successfully gone and converted it what we now want to do is render it inside of our application so we're now going to do some rendering inside of the app it's pretty straightforward it's just three lines of code to go on ahead and do this so the first line is let's grab our video so we can type in video equals open and then we're going to be opening up test video.mp4 and we're going to be reading it as a binary so we're going to pass through RB so video equals open passing through test underscore video dot MP4 and RB to read it as binary we're then going to read it so video bytes is equal to video dot read and then we want to render it so we're going to use St dot video and pass through video bytes and this should give us the ability to see a video inside of our app so let's go and rerun our app let's close these now boom boom boom boom boom take a look at that that is our video now rendering inside of our application so we've got to play it pin blue F2 now all right let's add a little bit of info but take a look so pimp blue F2 now now the determining characteristic as to whether or not this is working is if we go and choose a different video are we going to be able to see that rendering so if we go and choose a different one let's choose this one so bbal 8p means I think binblue at L8 please bimb blue L8 please there you go all right cool we're looking positive so it looks like it's successfully converting each one of these videos let's add a little bit of information to our app so just above options let's add a title we're going to say St dot title and we're going to say lipnet full stack app boom beautiful and then above our video let's add in an info box so we can type in St dot info this is con uh display the video below displays the converted video in MP4 format save that all right let's go and rerun beautiful all right so we've now got our title and we've now got our info box so if we go and change our video I've got that YouTube speed player hence why it's playing so quick so we can drop that down to been blue at L7 soon beautiful all right cool that is our video now rendering so the video is now successfully rendering but what we actually need to do is we need to take this video and pre-process it before passing it through to our lipnet app itself now luckily we've got that utils file which is going to allow us to load in data simply by passing through a file path this is going to return back the pre-process video which is actually going and isolating the mouth within the original video so you're actually going to see the isolated component of that we're also going to get the associated annotations for that video so if we wanted to we could actually go and compare now we've got the video itself so we can actually see what was being spoken and determine whether or not our lipnet model is actually performing well or not now I'm going to add in a little bit of flare here and we're actually going to take that video and output it as a gif which just looks really cool if you wanted to go and embed this inside a markdown or share the results or share the pre-processing of this particular video before it goes to the lipnap model this is exactly what we are going to be able to see d right now so on to loading some data it is so let's clean this up let's just double check how we're doing so far so we've gone and brought in our dependencies gone and set up our layout set up a sidebar set up a title we've got our options and we've got our column one which is all about rendering the video and then the next thing that we are going to want to go on ahead and do is there's going to be three parts to this to the second column right so let's just add in a couple of info boxes so if I type in St dot info it's one of my favorite streamlit functions it just looks really cool because it helps tell your user what on Earth they're looking at I find it so often that people are building different applications and nobody really has an idea as to how they work adding a little bit of info just adds that extra little bit of flair and tells your user that you care about them I just think it goes that little step further anyway all right estee.info so first up we are going to display um this is the pre-process gif which will actually we can actually this is all the machine learning model sees when making a prediction because that it's true right like when when we actually go and run this pre-processing that is all our machine learning model is going to be seeing so their second info box is going to display the predictions this is the output of the machine learning model as tokens so our machine learning model actually returns a set of tokens which goes through a decoder called CTC it's called connectionist temporal classification there's a brilliant blog post that I found about it which actually explains how this works it's also used quite a fair bit for automatic speech recognition which I think is kind of cool so that is what we are going to be returning back and then the last thing that we're going to do is we're going to run St dot info and we are going to decode the raw tokens into um into words and this is where numb to char is going to work in so num2 chart is going to be helping us there okay first things first what we want to go on ahead and do is pre-process our data I've just kicked to my table all right so let's go ahead and do this the first thing we need to do is load data so we're going to get back the video and we're going to get back annotations back and no notations and to that we need a part or run the low data function remember we run imported load data right up here so we're going to be using that now to the full line is video comma annotations equals load underscore data and then to that we are going to be passing through the full file path which we just defined up here so we're going to take that file path we are going to pass it through to here but before we do that we need to convert this into a tensorflow tensor because that is the expected file format that this particular function is expecting to do that relatively easy we can type in TF dot convert underscore 2 underscore tensor and if we close that this will actually return back our video and our annotations now we're not going to stop that oh no what we're now going to do is we're going to take this video and we're going to convert it into a gif and then we're going to render the GIF inside of our streamlit app I know so what we're now going to do is to use image i o so image IO Dot mimsave to that we are going to specify what our output GIF is going to be called so it's going to be called animation dot gif and then to that we'll go in and pass through our video and the number of frames per second that we want our app would give to be so three lines out so we've got our St info which is going to say this is all the machine learning model sees when making a prediction we're then using the load data function and we this should be convert 210 sub we're passing through our file path which we defined up here from that we're going to get video and annotations back or the pre-processed video back and as well as the annotations we don't really need the annotations but our low data function brings them back anyway and then we're using imageio dot mimsafe to actually go and convert this video into a gif so imageio.memsave we pass through the name of the GIF that we want to Output we pass through the video and we pass through how many frames per second we want our GIF to be then what we can do is we can actually go and render this inside of our app we can use St dot image and then we're going to pass through animation.give because it's actually going to be actually before I do this let me just show you this working so if we go and save this let's go is our app still running app looks like it's still running let's go and refresh what should happen if this runs successfully is that when we go and run this we're going to get a file called animation.gif inside of our app file so let's go and refresh this just by hitting r so that looks promising right so it doesn't look like we've got any errors if we go back to our app take a look we've got animation.gif and so the cool thing is that this GIF is all the machine learning model sees when it makes a prediction how cool is that no audios passed that is all that's being used to go and do that lip reading which is why I find this absolutely phenomenal we just need one more line to render this inside of our application now so we can go and pass through St dot image and pass through animation.gif so if we go and refresh we should be able to see this take a look that's our gift there it is a little bit small so we can make that a little bit bigger if we go back into St dot image and pass through width equals 400 save that go and refresh boom much better how cool is that so we've now got our video playing we've now got our pre-processed gift playing what we now need to do is actually going ahead and start making some Productions right so predictions what we first are going to need to do is use the load model function that we imported from Models util to load up our sequential model which we originally authored inside of that previous tutorial inside of a Jupiter notebook so we're going to load that up that is also going to load the pre-trained weights which effectively means that we've got a trained deep learning model to be able to go on ahead and use inside of our application what we're then going to do is we're going to take the video that we just loaded using the load data function we're going to pass that through to our model using the model.predict method and then we're going to take the predictions and pass it through the CTC decoder which comes from tensorflow Keras this is going to help take any duplicate predictions and condense them down this is the beautiful thing about the CTC decoder it kind of does all of this tricky coding for us again that blog post that I referenced previously makes your life a ton easier when it comes to understanding connectionist temporal classification highly recommend you check that out anyway let's start making some predictions so what we need to do now is load up our models so that's exactly what we're going to do now keep in mind that we already brought in this load model function from model util now again that file is going to be available on GitHub so you can pick that up you don't need to go on ahead and write it yourself I just wanted to make your life a little bit easier so we're going to load in our model let's go ahead and do this so we're going to say model equals load model right and this is going to return our tensorflow Keras model inside of this variable model which means we're going to have all of the amazing things that Keras and tensorflow gives us we're going to be able to namely use the dot predict method to go and make our predictions so model equals load model we then want to make some predictions so we're going to say y hat is equal to model.predict and then we're going to take this video and we're going to pass it through to the model.predict method but keep in mind when our model dot or when we use model.predict it's expecting a batch of inputs we only have one input or one example that we're going to be passing through to our model so we need to go and wrap it inside of another set of arrays so in order to do this relatively easily we can just run TF Dot expand dims Pastor our video and then pass through axis equals zero and then we need to close that so that's going to return our predictions what we then need to do is we need to go and run this through the Keras CTC decoder I'll include some information about the CTC decoder up there it isn't exactly a hugely well documented feature inside of Keras but there is a little bit of information which I managed to find so I'll uh give you a bit of that info right about now over here yeah cool all right cool so now what we're going to go ahead and do is we are going to take this model or take these set of predictions and we are going to run it through that decoder so we can then go and take y hat and we're going to pass it through to this so we're going to set TF or we're going to set a variable called decoder.tf dot Keras dot back end dot CTC decoder let me zoom out a little bit and we're going to pass through the predictions to that we then need to pass through the length of those predictions the length is 75 we established this when we created the original tutorial we're then going to specify what type of algorithm we want to use when it comes to decoding so we are going to use a greedy algorithm which means we're going to take the most probable prediction when it comes to generating our outputs we're going to say greedy is equal to true now when you get this prediction back it's nested inside of a bunch of arrays so we're going to grab the first value and the first value again so this should give us what we actually need so that should effectively be our decoded output if we go and just print this out we should be in a good position so let's quickly recap so we've gone and loaded our model we've gone and made some predictions we are then running it through that CTC decoder so if we run SC or we're then going to Output it as st.txt and we'll pass through decoder save that let's go and refresh this now so if this works we should get a prediction in this little space here so let's refresh oh we have got an error CTC decoder module Keras API Keras backend has no attribute CTC decoder uh CTC is it one word a ctcd code not CTC decoder okay my bad save that let's go and refresh now foreign you can see there that those are our outputs and so this is sort of what I mean so we're going to get be getting back our tokens from our deep learning model this isn't necessarily a set of words as of yet so what we actually need to do is we need to go and convert this into a set of words actually before I do that let me show you what this would look like if I output it before running it through the decoder so if I type in st.txt and just pass through y hat you'll see the before and after so we're going to up with the roll predictions and then the set of predictions that we get after running it through the decoder so let's refresh this let me zoom out and you should see that we get a bunch of duplicates uh so this is just giving us the raw probabilities back um if we ran through uh tf.org Max and we said access equals one let's see what we get back there foreign there you go all right so take a look so you're getting back a bunch of duplicates right so you can see we've got our output 62 14 6 19 19 10 23 19. so you can see that we're getting all of these duplicates additionally when we go and run it through the CTC decoder there is a special algorithm which goes and decodes this to get us better sets of predictions which is exactly the reason that we're going ahead and use this so you can see that's the raw output if we were just to go and run it through an ARG Max function this is what we actually get back once we run it through that CTC decoder so we are not going to use this we are going to use the CTC decoder output so if we save that that is our decoder now we can also just grab the raw string by typing dot numpy rather than outputting the entire tensorflow value so if we go and refresh that boom much better so you can say that that is the raw output out of our machine learning model but we're not going to stop there oh no no no what we're now going to do is we're now going to take these set of tokens and run it through the num to char function which we previously imported from the utils library that we went and created and this is going to take that raw set of numbers and convert it into a set of words at the same time what we're going to do is run it through a function called tf.strings dot reduce join which is going to concatenate it together into a single sentence let's go wrap this up home stretch now so what we now need to do is we now need to take this decoder and or this decoder output and we are going to go and decode those raw tokens into words so first things first we're going to go num to char and pass it I'm not numb to chat all right enough Chit Chat Empire sit through that decoder output beautiful so if we just go and output that so if I run St dot text and pass that through so just to quickly recap so we're taking this decoder output we're passing it through to the num to char function which comes from up there so if we go and save our app and let's go and refresh you'll see we got a bunch of words down here A bunch of letters right so right now we're just getting all of these letters and we've got a bunch of blank spaces so that's really difficult to read so what we can actually do is we can condense this down into something which looks a little bit more sensical so let's go and do that so we can go and grab let's just do it on another line um we'll say converted prediction and we are going to say so numb to Cha we're effectively just grabbing this right now so setting that to that and then what we're going to do is we'll run tf.strings dot reduce join pass that through and then we're going to take this converted prediction and pass it through our St dot text method so this is coming from streamlit it's just a streamlit uh text element but we can get rid of that comment down here let's actually add it up here so convert prediction to text beautiful all right so let's go and refresh all things holding equal should see magic down here take a look at that binblue at L7 soon okay so right now this is still inside of a tensorflow tensor we can actually we can leave it like this or we can actually go and clean it up so if we wanted to clean it up we can just do uh over here we can type in Dot numpy let me zoom out so you can see this Dot numpy and then dot d code and then pass through utf-8 boom and then if we go and refresh we're gonna test this out take a look so that is our raw prediction so bin blew at L7 soon so let's go and play this and see how we're actually doing so if we go and select a video let's go and choose one right down here so LG we're going to choose this one you can say that this is our model doing our video conversion oh god that's way too fast lay green with G5 again lay grain with G5 again and take a look that is what our model is predicted lay green with G5 again what if we chose another one um let's choose way further down uh what about this praj1s please swear to J1 soon based red at J1 soon how amazing is that so we're able to go on ahead take a video pass it through this entire pipeline which has quite a fair few steps but it's able to make a prediction now keep in mind that this isn't using the audio it's just using the raw video to be able to go and decode what is being said so crazy anyway let's go and grab another one what about this one so red 01 again set red at 01 again he's got a bit of an English accent we'll we'll give him that what about uh this one blue F2 now in blue at f2 now absolutely amazing let's go all the way down what's the last video a thumbs DB that's incorrect we don't need that one uh swv9a set right with v9 again set why all right so in this particular case not perfect set white with SP S9 again not too bad so maybe we could do some additional training there and this sort of goes to show that we're not faking this that this is a genuine set of predictions please wait by Q7 again Place White by Q7 again is that what I said play Sprite by Q7 again how cool is that guys that is the complete application now done now again all of this code is going to be available on GitHub so you can go on ahead and check it out and that is the application now built so we've gone through quite a fair few steps we set up our streamlit framework we then loaded in our video and we built through some of those challenges that we encountered right so streamlit can't play The Dot MPG file formats we converted it using ffmpeg we then set up our layout so that we're able to go and visualize our videos as well as take a look at each one of the steps that our machine learning model is going through we took a look at the pre-processed gift we then took a look at those output tokens and then we took a look at what it happened or what happened once we went and converted those raw tokens into a set of words and that gives us our final application let me know what you thought of this tutorial hopefully you enjoyed this I know that we put up the poll determining whether or not we wanted this as a code that episode or rather as a raw tutorial but I wanted to give this a little bit more Flair even though we won't necessarily do encode that hopefully you enjoyed it I will catch you you in the next one now as I mentioned all of this code is going to be available by GitHub so if you jump on over to knick knock knock and go to the lipnet repository I'm going to make sure that the entire application is uploaded inside of there so you can just run a quick git clone and grab that yourself let me know if you end up putting this on your resume or inside of your portfolio projects I'd love to hear all about it anyway thanks so much for tuning in thanks so much for tuning in guys hopefully you've enjoyed this video if you have be sure to give it a big thumbs up hit subscribe and tick that Bell and all that other good stuff and let me know what you thought of this video do you think we should take it a little bit further do you think we should make some amendments where would you like to see this go anyway in the meantime I am working on a ton of amazing new sets of tutorials namely around Transformers plus the math for ML course is underway and that'll all be released on YouTube so you'll get a chance to get up to speed hopefully you're also enjoying the shorts I've been putting in a ton more effort into those to be able to share a little bit or a couple of nuggets of knowledge that I'm gaining as I'm going through this process and on The Learning Journey when it comes to picking up machine learning data science and deep learning hopefully you're enjoying it anyway I will catch you in the next one peace

Links: CTC Blog Post: https://distill.pub/2017/ctc Oh, and don't forget to connect with me! LinkedIn: https://bit.ly/324Epgo Facebook: https://bit.ly/3mB1sZD GitHub: https://bit.ly/3mDJllD Patreon: https://bit.ly/2OCn3UW Join the Discussion on Discord: https://bit.ly/3dQiZsV

17,085 views 26 Feb 2023

Get notified of the free Python course on the home page at https://www.coursesfromnick.com Sign up for the Full Stack course here and use YOUTUBE50 to get 50% off: https://www.coursesfromnick.com/bundl... Hopefully you enjoyed this video. 💼 Find AWESOME ML Jobs: https://www.jobsfromnick.com Get the Code: https://github.com/nicknochnack/LipNet

TensorFlow 2.0 Complete Course - Python Neural Networks for Beginners Tutorial

What is Tensorflow for Python?

Published on: August 19, 2020 by Sagnik Banerjee

All AI and ML engineers use some or the other kind of programming languages to work on. These programming languages can be any like Python, Scala, R, C++, etc. and to work using these programming languages they either need to know the in-depth knowledge of coding from scratch or should have specific libraries that make their life easy. One such type of programming language that uses Object-Oriented Programming and thus allows code reusability and several libraries in Python. Being the most preferred language by all the Data Scientists out there this language has proven to be a true friend when it comes to solving Machine Learning and Deep Learning problems.

Tensorflow Python’s library: The reason for the success of Python out there is because it has many libraries and a huge community that contributes their knowledge in building this programming much simpler every day. One such powerful and largely used by data scientists is Tensorflow. This library is related to carry out all Machine Learning and Deep Learning related work. The base concept that it uses to run these things in Neural Networks.

Yes, this is the way our brain works by responding to stimulus and this is the concept behind the working of Tensorflow. This library was developed by Google for its internal purpose but was made open source because they found out that it can help the Data Science community to a great extent in solving their problems. The released date for this API is November 9 2015 and was licensed under the Apache License 2.0. Read more: 5 Most common programming languages used in AI (Artificial Intelligence)

On this Page show

Some of the interesting works and features of Python’s Tensorflow library are given below:

It integrates well with Python: This library integrates very well with the Python language and provides all the features that it contains to be used by Python developers.
Helps in building Neural Networks from scratch: Yes it is possible to code your very own Neural Network from scratch with the help of Tensorflow and allows users to understand the basics that how we should construct our model.
Available in different versions: This library has upgraded itself by the passage of time and now there are many versions of Tensorflow available in the market. The different versions of Tensorflow are Tensorflow 1.15, 1.14, 1.13, 1.12, 1.11, 1.10, 1.9, 1.8, 1.7, 1.6, 1.5, 1.4, 1.3, 1.2, 1.1, 1.0, Tensorflow 2.3, 2.2, 2.1, 2.0, Tensorflow Lite for mobile and many more. For more knowledge, you can go to the official website of Tensorflow Tensorflow.org.
Integrates with high-level APIs to help to build NN very easily: As mentioned earlier that with the help of Tensorflow we can code from scratch and build our Neural Net but there is also a provision for those who find it difficult to code from the beginning and also find it boring to use Tensorflow with high-level APIs that are built on top of Tensorflow. This API is known as Keras and finds a huge application out there. With this library in hand, one does not have to worry about the inside mechanism of how the Neural Net is working. He/she just needs to write a few lines of code and the work will be done. This library integrates with Tensorflow version 2 and not with the lower versions that is Tensorflow 1.
Provides Data Scientists to work on different kinds of Data: With Tensorflow there is a huge diversity of data to work on and therefore helps Data Scientists to work on Images, textual, sound, video-related data.
Supports Transfer Learning: This library is used by many researchers to make benchmark models that can be used by any Data Scientist out there. This type of transferring of one model to be used by another to create their model is known as Transfer Learning and Tensorflow is capable of supporting the same. Some of the Transfer Learning models are ResNet, VGG, etc.
Well integrates with other Machine Learning models: This library well integrates with other Machine Learning Libraries like Scikit Learn and Pytorch (Humming Bird) and makes it easier for Data Scientists to work in a mixed way to make their codes easy and robust.
Helps in easy deployment: There is a feature in Tensorflow known as Tensorflow serving that allows users to deploy their work in the cloud and show it to the Data Science community. This is good for those who want to get connected to different people and get fame and recognition for their work.
Helps in Building Mobile Applications for Android and ios: With the help of Tensorflow Lite, we can develop mobile applications as well that works on the concept of Machine Learning and Deep Learning and then can be deployed to Google Playstore.

There are many more features that can be performed with the help of Tensorflow and for further details, you can go to the official website and can also follow different tutorials from various sources like Udemy, DataCamp, Intellipat, Coursera, etc.

Conclusion

If you want to learn the concepts of Neural Networks and want to play with them by inculcating them into your programs then you should go with this amazing library that will make your life easy. Also if you have the requisite skills then you can show your talent to everyone with the help of this library feature. And yes, you can contribute to the development of this API by providing necessary codes that can enhance the working of Tensorflow.

Deep Learning for Computer Vision with Python and TensorFlow – Complete Course

GitHub.com/TensorFlow

Documentation

TensorFlow is an end-to-end open source platform for machine learning. It has a comprehensive, flexible ecosystem of tools, libraries, and community resources that lets researchers push the state-of-the-art in ML and developers easily build and deploy ML-powered applications.

TensorFlow was originally developed by researchers and engineers working within the Machine Intelligence team at Google Brain to conduct research in machine learning and neural networks. However, the framework is versatile enough to be used in other areas as well.

TensorFlow provides stable Python and C++ APIs, as well as a non-guaranteed backward compatible API for other languages.

Keep up-to-date with release announcements and security updates by subscribing to announce@tensorflow.org. See all the mailing lists.

Install

See the TensorFlow install guide for the pip package, to enable GPU support, use a Docker container, and build from source.

To install the current release, which includes support for CUDA-enabled GPU cards (Ubuntu and Windows):

$ pip install tensorflow

Other devices (DirectX and MacOS-metal) are supported using Device plugins.

A smaller CPU-only package is also available:

$ pip install tensorflow-cpu

To update TensorFlow to the latest version, add --upgrade flag to the above commands.

Nightly binaries are available for testing using the tf-nightly and tf-nightly-cpu packages on PyPi.

Try your first TensorFlow program

$ python

>>> import tensorflow as tf

>>> tf.add(1, 2).numpy()

>>> hello = tf.constant('Hello, TensorFlow!')

>>> hello.numpy()

b'Hello, TensorFlow!'

For more examples, see the TensorFlow tutorials.

Contribution guidelines

If you want to contribute to TensorFlow, be sure to review the contribution guidelines. This project adheres to TensorFlow's code of conduct. By participating, you are expected to uphold this code.

We use GitHub issues for tracking requests and bugs, please see TensorFlow Forum for general questions and discussion, and please direct specific questions to Stack Overflow.

The TensorFlow project strives to abide by generally accepted best practices in open-source software development.

Patching guidelines

Follow these steps to patch a specific version of TensorFlow, for example, to apply fixes to bugs or security vulnerabilities:

Clone the TensorFlow repo and switch to the corresponding branch for your desired TensorFlow version, for example, branch r2.8 for version 2.8.
Apply (that is, cherry pick) the desired changes and resolve any code conflicts.
Run TensorFlow tests and ensure they pass.
Build the TensorFlow pip package from source.

Continuous build status

You can find more community-supported platforms and configurations in the TensorFlow SIG Build community builds table.

Official Builds

Build Type

Status

Artifacts

Linux CPU

PyPI

Linux GPU

PyPI

Linux XLA

TBA

macOS

Windows CPU

Windows GPU

Android

Raspberry Pi 0 and 1

Raspberry Pi 2 and 3

Libtensorflow MacOS CPU