InstaVariety Dataset

Raw Data

In addition to releasing our pre-processed tfrecords (see below) that contain the exact InstaVariety data we introduced in Learning 3D Human Dynamics from Video, we also release the full dataset that we downloaded from Instagram. In particular, to generate InstaVariety, we scraped 92 movement-oriented Instagram tags (full tag list) for 1000 posts each. Instagram posts can be either videos or photos, so after filtering out the photos, we end up with many fewer videos than 1000 per tag. Across the 92 tags, we collected 28,272 videos in total. We provide a file InstaVariety.json that contains the links to each of these videos, along with relevant metadata for each video, including:

{
 'edge_media_preview_like': {'count': int}, 
 'urls': [str], //(note: we use as filename) 
 'edge_media_to_caption': {'edges': [{'node': {'text': str}}]}, //(note: caption str also contains tags) 
 'dimensions': {'width': int, 'height': int}, 
 'tags': [str], //(note: list of tag strings, without '#')
 'edge_media_to_comment': {'count': int}, 
 'video_view_count': int, 
 'comments_disabled': boolean, //(note: not always available)
 'download_tag': str, //(note: corresponds to tag we used to identify and download this video) 
 'edge_liked_by': {'count': int}, 
 'shortcode': str, //(note: the shortcode is used to construct the download link) 
 'taken_at_timestamp': int, //(note: reported in unix time) 
 'video_link': str, //(note: of the form 'https://www.instagram.com/p/{}'.format(shortcode))
 'is_video': boolean, //(note: should always be True) 
 'id': str
}

We also provide a convenience script download_insta_variety.py for downloading the videos. The script relies on youtube-dl, which can be downloaded from here. The script can be run as:

foo@bar:~$ python download_insta_variety.py --savedir /path/to/your/save/directory

To use the same test split of videos that we used in Learning 3D Human Dynamics from Video, we provide the list of test split videos in test_set_video_list.txt.

Pre-processed tfrecords

We provide our pre-processed tfrecords, which can be accessed here. Details of the tfrecord format can be found in doc/datasets.md.

Visualizing the tfrecords

Once you download the tfrecords, you can visualize them by:

foo@bar:~$ python -m src.datasets.visualize_train_tfrecords --data_rootdir /path/to/your/tfrecord/rootdir --dataset insta_variety

Where /path/to/your/tfrecord/rootdir contains the insta_variety directory with tfrecords in a train subdirectory.

Citation

If you use this data for your research, please cite:

@InProceedings{humanMotionKZFM19,
  title={Learning 3D Human Dynamics from Video},
  author = {Angjoo Kanazawa and Jason Y. Zhang and Panna Felsen and Jitendra Malik},
  booktitle={Computer Vision and Pattern Recognition (CVPR)},
  year={2019}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

insta_variety.md

insta_variety.md

InstaVariety Dataset

Raw Data

Pre-processed tfrecords

Visualizing the tfrecords

Citation

Files

insta_variety.md

Latest commit

History

insta_variety.md

File metadata and controls

InstaVariety Dataset

Raw Data

Pre-processed tfrecords

Visualizing the tfrecords

Citation