Transcribing the ActivityNet into descriptions

Installation

I used Python 3.11.4, but other versions probably work fine too.

Download, setup venv with python -m venv .venv, activate the venv, then

pip install -r requirements

NOTE: These requirements are bloated and should be skimmed aggressively.

Usage

This repo transcripts video streams from the ActivityNet database into .description files.

The videos are downloaded in random batches using FiftyOne. Watch out, because running this script for too long ~~can~~will get you banned from YouTube.

The camera-server must run on the local net for this client to work, as it queries the server worker at http://localhost:40000.

You can start transcribing ActivityNet into descriptions with

cd [ROOT OF THIS REPO]
./activitynet.sh 2>&1 | tee activitynet.sh.log

Statistics

Here is some pseudo code together with some outputs.

$ cd ./dataset-zoo

# Number of .description files (videos watched)
$ find . -type f -name "*.description" | wc -l
6603

# Number of descriptions (text lines)
$ find . -type f -name "*.description" -exec cat {} \; | wc -l
326815

# Number of words including timestamps
$ find . -type f -name "*.description" -exec cat {} \; | wc -w
25747879

# Number of words excluding timestamps (in the descriptions only)
= 25747879 - 326815 = 25421064 = 25M words = 34M tokens

# Cat a random description file
$ find . -type f -name "*.description" | shuf -n 1 | xargs cat

# Cat a random description file without timestamps
$ find . -type f -name "*.description" | shuf -n 1 | xargs cat | cut -d" " -f2-

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
activitynet.sh		activitynet.sh
describe_frame.py		describe_frame.py
describe_stream.py		describe_stream.py
embed_description.py		embed_description.py
embed_descriptions.sh		embed_descriptions.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Transcribing the ActivityNet into descriptions

Installation

Usage

Statistics

About

Uh oh!

Releases

Packages

Uh oh!

Languages

mvsoom/camera-activitynet-client

Folders and files

Latest commit

History

Repository files navigation

Transcribing the ActivityNet into descriptions

Installation

Usage

Statistics

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages