# AlphaVideo
**Repository Path**: dagongji10/AlphaVideo
## Basic Information
- **Project Name**: AlphaVideo
- **Description**: 动作识别
- **Primary Language**: Unknown
- **License**: Not specified
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 0
- **Forks**: 0
- **Created**: 2020-06-21
- **Last Updated**: 2020-12-19
## Categories & Tags
**Categories**: Uncategorized
**Tags**: None
## README
## Introduction
AlphaVideo is an open-sourced video understanding toolbox based on [PyTorch](https://pytorch.org/) covering multi-object tracking and action detection.
In AlphaVideo, we released the first one-stage multi-object tracking (MOT) system **TubeTK** that can achieve 66.9 MOTA on [MOT-16](https://motchallenge.net/results/MOT16) dataset and 63 MOTA on [MOT-17](https://motchallenge.net/results/MOT17) dataset.
For action detection, we released an efficient model **AlphAction**, which is the first open-source project that achieves 30+ mAP (32.4 mAP) with single model on [AVA](https://research.google.com/ava/) dataset.
## Quick Start
### pip
Run this command:
```shell
pip install alphavideo
```
### from source
Clone repository from github:
```bash
git clone https://github.com/Alpha-Video/AlphaVideo.git alphaVideo
cd alphaVideo
```
Setup and install AlphaVideo:
```bash
pip install .
```
## Features & Capabilities
* #### Multi-Object Tracking
For this task, we provide the [TubeTK](https://github.com/BoPang1996/TubeTK) model which is the official implementation of paper
"TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training Model (CVPR2020, **oral**)."
Detailed training and testing script on [MOT-Challenge](https://motchallenge.net/) datasets can be found [here](https://github.com/BoPang1996/TubeTK).
* Accurate end-to-end multi-object tracking.
* Do not need any ready-made image-level object deteaction models.
* Pre-trained model for pedestrian tracking.
* Input: Frame list; video.
* Output: Videos decorated by colored bounding-box; Btube lists.
* For details usages, see our [docs](https://github.com/Alpha-Video/AlphaVideo/wiki).
* #### Action recognition
For this task, we provide the [AlphAction](https://github.com/MVIG-SJTU/AlphAction) model as an implementation of paper ["Asynchronous Interaction Aggregation for Action Detection"](https://arxiv.org/abs/2004.07485).
* Accurate and efficient action detection.
* Pre-trained model for 80 atomic action categories defined in [AVA](https://research.google.com/ava/).
* Input: Video; camera.
* Output: Videos decorated by human boxes, attached with corresponding action predictions.
* For details usages, see our [docs](https://github.com/Alpha-Video/AlphaVideo/wiki).
## Paper and Citations
```
@inproceedings{pang2020tubeTK,
title={TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training Model},
author={Pang, Bo and Li, Yizhuo and Zhang, Yifan and Li, Muchen and Lu, Cewu}
booktitle={CVPR},
year={2020}
}
@article{tang2020asynchronous,
title={Asynchronous Interaction Aggregation for Action Detection},
author={Tang, Jiajun and Xia, Jin and Mu, Xinzhi and Pang, Bo and Lu, Cewu},
journal={arXiv preprint arXiv:2004.07485},
year={2020}
}
```
## Maintainers
This project is open-sourced and maintained by Machine Vision and Intelligence Group ([MVIG](http://mvig.sjtu.edu.cn)) in Shanghai Jiao Tong University.