jamalir
jamalir
gEEkstr33t
151 posts
spreading the news in something you can barely see
Don't wanna be here? Send us removal request.
jamalir 5 months ago
Text
Paper page - Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources
0 notes
jamalir 5 months ago
Text
rasbt/llama-3.2-from-scratch 路 Hugging Face
0 notes
jamalir 5 months ago
Text
0 notes
jamalir 6 months ago
Text
https://x.com/llamafactory_ai/status/1893879214727991504?t=0rz_iG3YO_ppFRatiDqh0A&s=09
0 notes
jamalir 6 months ago
Text
Meta AI Releases the Video Joint Embedding Predictive Architecture (V-JEPA) Model: A Crucial Step in聽Advancing Machine Intelligence - MarkTechPost
0 notes
jamalir 6 months ago
Text
Paper page - Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation
0 notes
jamalir 6 months ago
Text
SmolVLM2: Bringing Video Understanding to Every Device
0 notes
jamalir 6 months ago
Text
Magma: A Foundation Model for Multimodal AI Agents
0 notes
jamalir 7 months ago
Text
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face
1 note View note
jamalir 7 months ago
Text
TGI Multi-LoRA: Deploy Once, Serve 30 Models
0 notes
jamalir 7 months ago
Text
Paper page - DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
0 notes
jamalir 7 months ago
Text
Qwen2.5 VL! Qwen2.5 VL! Qwen2.5 VL! | Qwen
0 notes
jamalir 7 months ago
Text
Open-R1: a fully open reproduction of DeepSeek-R1
4 notes View notes
jamalir 7 months ago
Text
GitHub - DAMO-NLP-SG/VideoLLaMA3: Frontier Multimodal Foundation Models for Image and Video Understanding
0 notes
jamalir 8 months ago
Text
Paper page - MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
1 note View note
jamalir 8 months ago
Text
Paper page - 1.58-bit FLUX
0 notes
jamalir 8 months ago
Text
Apollo: An Exploration of Video Understanding in Large Multimodal Models
0 notes