Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrate Mixpeek Video Embedding Service #65

Open
6 tasks
esteininger opened this issue Aug 28, 2024 · 0 comments
Open
6 tasks

Integrate Mixpeek Video Embedding Service #65

esteininger opened this issue Aug 28, 2024 · 0 comments

Comments

@esteininger
Copy link

esteininger commented Aug 28, 2024

Description

We should integrate the Mixpeek video embedding service into the multi-modal starter kit to enhance the video processing capabilities. This will allows us to generate embeddings for video chunks, which can be used for more advanced video analysis and search functionality.

Proposed Implementation

  1. Add Mixpeek as a dependency to the project.

  2. Create a new utility function in src/utils/ to handle video embedding:

    • Initialize the Mixpeek client with an API key (to be stored in .env).
    • Process video chunks using mixpeek.tools.video.process().
    • Generate embeddings for each chunk using mixpeek.embed.video().
    • Store the embeddings along with their corresponding time ranges.
  3. Update the video processing pipeline to include the embedding step.

  4. Modify the Tigris storage implementation to store the embeddings alongside the video data.

  5. Update the .env.example file to include the Mixpeek API key.

  6. Add documentation in the README about the new video embedding feature.

Tasks

  • Add Mixpeek dependency
  • Create video embedding utility function
  • Integrate embedding into video processing pipeline
  • Update Tigris storage to handle embeddings
  • Update environment variable setup
  • Update documentation

Additional Considerations

  • We need to decide on the optimal chunk interval and resolution for our use case.
  • Consider adding a configuration option to enable/disable video embedding.
  • Evaluate the impact on processing time and storage requirements.
  • Explore potential use cases for the embeddings (e.g., semantic search, scene detection).

Resources

Please comment on this issue if you have any questions or suggestions regarding the integration of Mixpeek video embedding.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant