This is the next piece. Consider that today there are 800 million videos on YouTube. The average length of each of those videos is about 12 minutes. The average human speaks at 150 words per minute. If we take those 1700 words, multiply that by 800 million videos, we may get 1.4 trillion words of […]
Yes today readers of my paid mailing list the memo got a bit of a surprise. Besides getting an invitation to mid-journey where they can go play around with text to image generation similar to Dolly, maybe better than Dolly according to some but a bit dark for me. They also got 100 page PDF […]
Hey everyone, my name is Devya. I’m representing the CARES team at Google. I’ll be talking about diffusion models, what can be done with them and how to enable them using CARES. So what are diffusion models and why should you care? Historically, you could generate fake shoe pictures. These aren’t really real shoes, not […]
Dear Fellow Scholars, this is 2 Minute Tapers with Dr. Karo Zsolnai-Fehir, when creating a video game, an animated movie, or to create any believable virtual world, among many other things we need geometry. Tons and tons of geometry, both to store and to render images of our characters and the environment. And having lots […]
Hello folks, I was in New Orleans last week and I had the pleasure of interviewing Laura Ruiz, the primary author on this paper, large language models are not zero-shot communicators. Now this is exploring the ability of language models to perform in clicker check, which I guess from a machine learning audience point of […]
Deep mind out of London recently released a visual language model. 80 billion parameters across the board, 70 billion of those from Chinchilla, the large language model, plus an additional 10 billion parameters from images. They’re calling this model Flamingo. I quite like that name. We’ve gone gofa, Chinchilla, Flamingo. This model is not publicly […]
Seriously, is that enough to pull this off? Jiju saw the latest hot open source model released by OpenAI transcribing this guy. It’s called Whisper and it’s really really good. Well, in reality what Jiju saw didn’t happen in real time. That was just me and this video. But I did put together some code […]
Dear Fellow Scholars, this is “Two Minute Papers” with Dr. Károly Zsolnai-Fehér. Earlier, we talked about AI-based techniques that can learn to clone your voice, and then we can perform text-to-speech. So, it would learn this… I think they have to change that. Further details are expected later. And then, the AI would generate this. […]
Our first story is the most interesting one. Brain reading is more and more becoming a thing. There is a paper called Seeing Beyond the Brain, conditional diffusion models with Sparse Masked Modeling for Vision decoding.
GPT-3 was announced nearly two years ago in May 2020. It came out a year after the original GPT paper was published. OpenAI CEO Sam Altman stated a few months ago that GPT4 is on the way.
OpenAI’s new Whisper AI is able to listen to what we say, and transcribe it. Your voice goes in, and this text comes out. Like this. This is incredible and it is going to change everything! As you see, when running through these few sentences, it works with flying colors.
A lot of the text video models have recently come out, but not only that, a lot of other stuff has happened too, such as multiplayer, stable, diffusion, and OpenAI is looking for even more money from Microsoft. Stay tuned, this is ML News. Hello everyone, as you can see, I’m not in my usual […]
Hi everybody, thanks for joining us. Hopefully we’ll have Mira on screen in a moment. Let me apologize first of all if you were expecting to see Will Heven from MIT Tech Review here moderating this. Will is ill, so I’m filling in for him. But hopefully we’ll have Mira here on screen in a […]