Skip to content

Summarizing a video using Gemini 1.5's 2M tokens long context capability.

License

Notifications You must be signed in to change notification settings

sk5268/Video_Summary

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Video_Summary

Summarizing a video using Gemini 1.5's 2M tokens long context capability.


Requirements

  1. pip install google-generativeai
  2. Gemini API Key

Why Gemini ?

Historically, large language models (LLMs) were significantly limited by the amount of text (or tokens) that could be passed to the model at one time. The Gemini 1.5 long context window, with near-perfect retrieval (>99%), unlocks many new use cases and developer paradigms.

Using its insanely large 2 million token window, the model can "remember" the entire video at once, enabling tasks like generating comprehensive summaries, answering in-depth questions about specific scenes, identifying patterns across the video, performing detailed video-to-text analysis, and even creating complex, multi-turn dialogues about the video's content without losing context.

About

Summarizing a video using Gemini 1.5's 2M tokens long context capability.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published