forked from SEACrowd/seacrowd-datahub
-
Notifications
You must be signed in to change notification settings - Fork 0
/
video.py
27 lines (25 loc) · 810 Bytes
/
video.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
"""
General Video-to-Text Schema, including:
- Video captioning
- Video to text retrieval
Video datasets can be very large. For datasets with remote videos
('video_path' = video URL), ensure that the URL is publicly accessible
and the video is downloadable. Extra caution is needed, as the URL
might contain harmful and/or malicious files.
"""
import datasets
features = datasets.Features(
{
"id": datasets.Value("string"),
"video_path": datasets.Value("string"),
"text": datasets.Value("string"),
"metadata": {
"resolution": {
"width": datasets.Value("int64"),
"height": datasets.Value("int64"),
},
"duration": datasets.Value("float32"),
"fps": datasets.Value("float32"),
},
}
)