21-S2-2-C-Cinema

1. Project Introduction

Cinefly is a media-tech company aiming at developing the most advanced patented storytelling and file platform. The project team mainly focus on developing machine learning algorithms to extract data, such as the uses' name, daily activities, interests, spending habits, etc., from videos provided by Cinefly. Another function that the team may develop is to classify the data extrated into three categories: Demographic info., Phychographic info. and Behavior info. And then store this tagged information into a database.

Code repo: https://github.com/ABlackPenny/CineflyProject

2. Project Team

Name	E-mail	Role	Aeras of Expertise
Jiaye Li	[email protected]	Software Programmer	Data Science, NLP, Writing Skills
Jiawei Fan	[email protected]	Software Developer, Project Manager	Machine Learning, Computer Vision, NLP
Yuliang Ma	[email protected]	Software Programmer, Spokesperson	Programming, Writing Skills, Machine Learning
Yuchen Wang	[email protected]	Software Developer, Software Tester	Machine Learning, CV, Debugging
Tao Qu	[email protected]	Software Developer, Project Manager	Machine Learning, NLP, Programming
Yixian Qiu	[email protected]	Software Programmer, Spokesperson	Machine Learning, Data Science, NLP
Xiaoxiang Kong	[email protected]	Software Tester, Database Administrator	Machine Learning, Programming, Debugging

3. Project Documents

Project Administration

Team Charter

SoW

USM

Resources

Project Constraints

Quality and Risk Control

Future Scope
Research Report
Task Management

Jira Link: https://anucinema.atlassian.net/jira/

Work Log

Time Schedule
Project Logs

Meeting Log

Decision Log

Reflection Log

Tutorial Agenda
Audits and Videos

Audit 1

Audit 2

Showcase

4. User Manual

a. Overview

This project developed an algorithm for Cinefly to use machine learning technology to extract key information from videos and build user profiles.

This project adopts agile development management method and is divided into two development stages. This user manual provides an explanation of the output of the second phase of the project.

b. Code Structure

In the second stage of development, the team produced a total of 5 demos. Among them, demo 1 and demo 2 are both early iterative versions of demo 3. Demo 4 is a technical route exploration that proved to be a failure. Demo 5 is a successful alternative to demo 4 and can provide inspiration for the follow-up work of this project.

The files of the project code package include:
Demo 1: DemoV1.py
Demo 2: DemoV2_part1.py, DemoV2_part2.py
Demo 3: DemoV3_part1.py, DemoV3_part2.py
Demo 4: DemoV4.py
Demo 5: DemoV5.py

Demo 3's CV detection module (from the first stage of this project):
age_deploy.prototxt
age_gender_detect.py
age_net.caffemodel
gender_deploy.prototxt
gender_net.caffemodel
opencv_face_detector.pbtxt
opencv_face_detector_uint8.pb
MIT License for age & gender part

Other files:
mask_rcnn_coco.h5 (pre-trained model used in Demo 4)
README.md

Output example:
The Google key used by each demo was not included in the final delivery. You need to download the service key of Google Cloud, rename it to "GCKey.json", and place it in the project directory.

c. Functions and usage description

Demo 1 extracts the voice in a single input video and converts it into text. Demo 1 has been encapsulated as a python function and used in demo 3. So you don't have to run demo 1 separately.

Demo 2 is an early iterative version of demo 3. Run DemoV2_part2.py to run this demo. It can parse the voice information of all videos in two given folders and save it in PersonInfo.csv. You can adjust the location of the two input directories in DemoV2_part2.py.

PersonInfo.csv is a file that includes file name, user name, age (CV detection), gender (CV detection) and voice text.

After getting PersonInfo.csv, you need to manually log in to Amazon cloud service and select the custom model entity detection service in Amazon "comprehend" to extract the tags you are interested in. For specific operations, please refer to the Amazon comprehend documentation.

Demo 3 is the main delivery object of this project. It can process videos from different users and stored in different directories, extract information from different videos of the same user, and implement information integration. Before running this demo, please make sure that you save the original video in the correct format: first you should create a folder named "video_cut" in the project directory, and in this folder, please press the "shot_#" format , Create folders for eight different shots in the storyboard, such as "shot_1". Finally, please store the corresponding shot video in the corresponding directory.

After saving the original video correctly, you should run DemoV3_part1.py to get All_Txt.csv first. This file contains a user's voice information in all shots. Then please follow the similar steps as in demo2, log in to Amazon "comprehend", extract the entities you are interested in, and get Amazon's return file "output".

Next, please modify the two input directories in DemoV2_part2.py to "shot_1" and "shot_7" in "video_cut" to obtain PersonInfo.csv. At this time, three files, PersonInfo.csv, output and All_Txt.csv, should exist in your directory at the same time. At this time, please run DemoV3_part2.py. This script will parse the output into a readable file and combine the above three files to generate a Final.csv file which will contain all the information you are interested in.

Demo 4 and demo 5 are both attempts to further improve the accuracy of demo 3. Demo1-3 all use NLP technology. And demo 4 and demo 5 try to use CV technology to identify and classify all objects in the video. Among them, the Mask_RCNN technology used in demo 4 was proved by the team to require extremely high computing power, so this technical route was abandoned.

Demo 5 uses Google Vision API. The video source storage method it requires is the same as that of demo 3. The final output uid_shots.csv file will contain all the detected objects in the video. In Demo 5, the parameters you can adjust are the probability threshold in runDemoV5() and the number of frames uploaded for each video interception in processSingleVideo(). (Due to cost and necessity considerations, we cannot upload all the frames in the video)

If you need technical support, please contact our team! Thank you very much!

Name		Name	Last commit message	Last commit date
Latest commit History 159 Commits
01 Team Charter		01 Team Charter
02 Constraints		02 Constraints
03 Quaility and Risk Control		03 Quaility and Risk Control
04_Decision_Making		04_Decision_Making
05_Meeting_Minutes		05_Meeting_Minutes
06_Resources		06_Resources
07_Statement of Work		07_Statement of Work
09_Time_Schedule		09_Time_Schedule
10_Work_Log		10_Work_Log
11_Tutorial_Agenda		11_Tutorial_Agenda
13 Reflection		13 Reflection
14_Showcase_Video		14_Showcase_Video
Audit 1		Audit 1
Audit 2		Audit 2
Audit 3		Audit 3
Future Scope		Future Scope
API Research Report.pdf		API Research Report.pdf
README.md		README.md
Signed_checklist.pdf		Signed_checklist.pdf
User Manual.pdf		User Manual.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

21-S2-2-C-Cinema

1. Project Introduction

Code repo: https://github.com/ABlackPenny/CineflyProject

2. Project Team

3. Project Documents

4. User Manual

a. Overview

b. Code Structure

c. Functions and usage description

About

Releases

Packages

Contributors 7

ch4ser/21-S2-2-C-Cinema

Folders and files

Latest commit

History

Repository files navigation

21-S2-2-C-Cinema

1. Project Introduction

Code repo: https://github.com/ABlackPenny/CineflyProject

2. Project Team

3. Project Documents

4. User Manual

a. Overview

b. Code Structure

c. Functions and usage description

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 7

Packages