Inside Multimodal LLaMA 3.2: Understanding Meta’s Vision-Language Model ArchitectureIn September 2024, Meta released Llama 3.2 with multimodal (MLLaMA), their latest advancement in multimodal AI that integrates vision and…Nov 28, 20242Nov 28, 20242
The Quest for Q*: Leveraging Q-Learning to Enhance LLMsLLM learning to solve grade level math problemsMay 3, 2024May 3, 2024
OpenAI Sora’s Technical ReviewExploring Sora: A Deep Dive into OpenAI’s Latest InnovationFeb 19, 20241Feb 19, 20241
Looking for Funds for your StartupI started the startup in May. I hope to give advices on how we get $5k AWS & $3K Google credits and a lot of feedbacks for early startup.Oct 30, 2020Oct 30, 2020