Market-Trend-Prediction

This is a project of building knowledge graph course. The project leverages historical stock price, and integrates social media listening from customers to predict market Trend On Dow Jones Industrial Average (DJIA).

Data period: Aug 1, 2016 to Oct 31, 2017. Data range for DJIA: Aug 1, 2016 to Nov 30, 2017. Data Source: Business Insider (record no.: 2,017), Reddit finance(4,383), facebook (11,528), yahoo finance(10,478), Twitter(24,271) Structure Data: Facebook, Twritter.

Predict Results:

Please refer this link to the Jupyter code for prediction results with social media information as part of input features.
Please refer this link to the Jupyter code for prediction results without social media information as input features.

T+1 Prediction
              precision    recall  f1-score   support

Decrease       0.00      0.00      0.00         7
Increase       0.70      1.00      0.82        16

avg / total    0.48      0.70      0.57        23

T+30 Prediction
              precision    recall  f1-score   support

Increase       1.00      1.00      1.00        23

avg / total    1.00      1.00      1.00        23

Program lists:

Programs	Description	Link
JSONLines	Once your crawler download a lot of pages, how can you aggregate all of those files into single one? Json Lines is your answer. The program will package each of your file into single JSON object into the file which will contain multiple JSON objects.	Source Code
JSONLines content classifier program	The input is a json line document and a new json line file containing only the pages relevant to dow 30 companies based on glossary of Dow30 companies and their products. New tags containing company and product names will be added to html body to make it convenient for inferlink. NLTK tools was used for positive, negative message or post recognition. rltk tools was adopted to perform string similarity for web content and glossary of company names and productions.	Source Code
Data Integration program	This program based on csv file of Dow30 companies and add with facebook, twitter social media likes, dislikes into csv for machine learning input.	Source Code
Facebook Crawler for Dow30	This is a crawler program to crawl facebook post via facebook graph api. A special facebook id and Dow 30 companies dictionary are integrated into this version. A CSV with like, dislike will provide by this program for machine learning.	Source Code
Yahoo Crawler for Dow30	This is a crawler program to crawl yahoo finance stock price via yahoo api. A special ticker id and Dow 30 companies dictionary are integrated into this version. A CSV with full company name will provide by this program for machine learning purpose. Input a list of stock ticker and time period for those price data.	Source Code
Twitter Crawler for Dow30 Companies	This program scrapes the offcial Titter accounts of Dow components	Source Code

Team members:

Cheng-Lin Li & YuCheng Guo

Date: Project kick off date

Oct., 2017@University of Southern California

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Market-Trend-Prediction

Predict Results:

Program lists:

Team members:

Date: Project kick off date

Files

README.md

Latest commit

History

README.md

File metadata and controls

Market-Trend-Prediction

Predict Results:

Program lists:

Team members:

Date: Project kick off date