#
        
        
          video-large-language-models
         
       
      
        
        
          
       
     
   
 
  
    
      
        
  
    Here are
    11 public repositories
    matching this topic...
   
    
  
  
  
  
  
  
  
 
  
      
        Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.
       
      
    
      
          
            Updated
            Mar 10, 2025 
           
          
            
   
  Jupyter Notebook 
 
           
       
     
   
 
  
  
  
  
  
  
 
  
      
        ✨✨[NeurIPS 2025] This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"
       
      
    
      
          
            Updated
            Oct 28, 2025 
           
          
            
   
  Python 
 
           
       
     
   
 
  
  
  
  
  
  
 
  
      
        Awesome papers & datasets specifically focused on long-term videos. 
       
      
    
   
 
  
  
  
  
  
  
 
  
      
        [ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling
       
      
    
      
          
            Updated
            Aug 22, 2025 
           
          
            
   
  Python 
 
           
       
     
   
 
  
  
  
  
  
  
 
  
      
        [AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
       
      
    
      
          
            Updated
            Dec 10, 2024 
           
          
            
   
  Python 
 
           
       
     
   
 
  
  
  
  
  
  
 
  
      
        This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension"
       
      
    
      
          
            Updated
            Apr 28, 2025 
           
          
            
   
  Python 
 
           
       
     
   
 
  
  
  
  
  
  
 
  
      
        [NeurIPS'25] HoliTom: Holistic Token Merging for Fast Video Large Language Models
       
      
    
      
          
            Updated
            Oct 10, 2025 
           
          
            
   
  Python 
 
           
       
     
   
 
  
  
  
  
  
  
 
  
      
        🚀 Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models
       
      
    
      
          
            Updated
            Oct 23, 2025 
           
          
            
   
  Python 
 
           
       
     
   
 
  
  
  
  
  
  
 
  
      
        [CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"
       
      
    
      
          
            Updated
            Oct 13, 2025 
           
          
            
   
  Python 
 
           
       
     
   
 
  
  
  
  
  
  
 
  
      
        [ICCV 2025] Streaming VideoLLMs for Real-time Procedural Video Understanding
       
      
    
      
          
            Updated
            Oct 26, 2025 
           
          
            
   
  Python 
 
           
       
     
   
 
  
  
  
  
  
  
 
  
      
        This repo integrates DyCoke's token compression method with VLMs such as Gemma3 and InternVL3
       
      
    
      
          
            Updated
            Oct 8, 2025 
           
          
            
   
  Python 
 
           
       
     
   
 
       
      
          
            
              Improve this page
             
            
              Add a description, image, and links to the
              video-large-language-models 
              topic page so that developers can more easily learn about it.
            
            
              
                Curate this topic
                
     
 
               
            
           
          
            
              Add this topic to your repo
             
            
              To associate your repository with the
              video-large-language-models 
              topic, visit your repo's landing page and select "manage topics."
            
            
              
                Learn more
                
     
 
               
            
           
       
     
   
 
       
   
          
     
  
    
     
 
    
      
     
 
     
    You can’t perform that action at this time.