TimeRefine [Paper]

Official PyTorch implementation of the paper "TIMEREFINE: Temporal Grounding with TimeRefining Video LLM".

TimeRefine Overview

Data Preparation

We follow the same data preparation pipeline as VTimeLLM. Please check out VTimeLLM training for instructions on downloading pretrained models and datasets. Please download the stage2 and stage3 training files and the best checkpoint here.

Installation

For installation, please check out install_env.md.

Training

For training, check out train_scripts.md.

Evaluation

For evaluation, check out eval_scripts.md.

Acknowledgements

We sincerely appreciate the incredible projects that contributed to the development of TimeRefine:

LLaVA: Large Language and Vision Assistant
FastChat: An Open Platform for Training, Serving, and Evaluating Large Language Model based Chatbots
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models
LLaMA: Open and Efficient Foundation Language Models
Vid2seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
InternVid: A Large-scale Video-Text dataset
VTimeLLM: Empower LLM to Grasp Video Moments
VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

If you're using TimeRefine in your research or applications, please cite using this BibTeX:

@misc{wang2024timerefinetemporalgroundingtime,
      title={TimeRefine: Temporal Grounding with Time Refining Video LLM}, 
      author={Xizi Wang and Feng Cheng and Ziyang Wang and Huiyu Wang and Md Mohaiminul Islam and Lorenzo Torresani and Mohit Bansal and Gedas Bertasius and David Crandall},
      year={2024},
      eprint={2412.09601},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2412.09601}, 
}

Looking forward to your feedback, contributions, and stars! 🌟

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
docs		docs
images		images
scripts		scripts
tools		tools
vtimellm		vtimellm
.DS_Store		.DS_Store
.vim-arsync		.vim-arsync
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TimeRefine [Paper]

TimeRefine Overview

Data Preparation

Installation

Training

Evaluation

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

SJTUwxz/TimeRefine

Folders and files

Latest commit

History

Repository files navigation

TimeRefine [Paper]

TimeRefine Overview

Data Preparation

Installation

Training

Evaluation

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages