-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathindex.html
205 lines (161 loc) · 8.66 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">
<meta name="description" content="Pro">
<meta name="author" content="Pradeep">
<title>My Profile</title>
<!-- Bootstrap Core CSS -->
<link href="css/bootstrap.min.css" rel="stylesheet">
<!-- Custom CSS -->
<link href="css/one-page-wonder.css" rel="stylesheet">
<!-- HTML5 Shim and Respond.js IE8 support of HTML5 elements and media queries -->
<!-- WARNING: Respond.js doesn't work if you view the page via file:// -->
<!--[if lt IE 9]>
<script src="https://oss.maxcdn.com/libs/html5shiv/3.7.0/html5shiv.js"></script>
<script src="https://oss.maxcdn.com/libs/respond.js/1.4.2/respond.min.js"></script>
<![endif]-->
</head>
<body>
<!-- Navigation -->
<nav class="navbar navbar-inverse " role="navigation">
<div class="container">
<!-- Brand and toggle get grouped for better mobile display -->
<div class="navbar-header">
<button type="button" class="navbar-toggle" data-toggle="collapse" data-target="#bs-example-navbar-collapse-1">
<span class="sr-only">Toggle navigation</span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
<span class="icon-bar"></span>
</button>
<a class="navbar-brand" href="#">Home</a>
</div>
<!-- Collect the nav links, forms, and other content for toggling -->
<div class="collapse navbar-collapse" id="bs-example-navbar-collapse-1">
<ul class="nav navbar-nav">
<li>
<a href="#about">About</a>
</li>
<li>
<a href="#services">Project1</a>
</li>
<li>
<a href="#contact">Project2</a>
</li>
</ul>
</div>
<!-- /.navbar-collapse -->
</div>
<!-- /.container -->
</nav>
<!-- Full Width Image Header -->
<header class="header-image">
<div class="headline">
<div class="container">
<h1>Python Developer</h1>
<h2># Eat Pray and Code</h2>
</div>
</div>
</header>
<!-- Page Content -->
<div class="container">
<hr class="featurette-divider">
<!-- First Featurette -->
<div class="featurette" id="about">
<img class="featurette-image img-circle img-responsive pull-right"
src="mypic.png">
<h2 class="featurette-heading">Hello,
<span class="text-muted">a bit about me:</span>
</h2>
<p class="lead">
</p>
<p>
<h4>
I love coding.
I am a python developer and would like to be a part of data science.
I am also studying about r language.
I will update my portfolio as the time goes by however for the time being I am trying to upgrade my technical
skills.
</h4>
</p>
</div>
<hr class="featurette-divider">
<!-- Second Featurette -->
<div class="featurette" id="services">
<img class="featurette-image img-circle img-responsive pull-left" src="https://4.bp.blogspot.com/-8xqOqAdqLaQ/V1AgHje8beI/AAAAAAAABAM/jo8WG5BAHJY3KNB47-pXl0KpX-1ADuyTgCKgB/s1600/web-data-crawling1.jpg">
<h2 class="featurette-heading">Projects
<span class="text-muted">Project | 01 Web Scraping </span>
</h2>
<p class="lead"> <h5>Kindly note : That web scraping was performed on IMDB website only for educational purpose.
Problem Statement
IMDB provides a list of celebrities born on the current date. Below is the link: http://m.imdb.com/feature/bornondate
Get the list of these celebrities from this webpage using web scraping (the ones that are displayed i.e top 10). You have to extract the below information:
1. Name of the celebrity
2. Celebrity Image
3. Profession
4. Best Work
Once you have this list, run a sentiment analysis on twitter for each celebrity and finally the output should be in the below format
1. Name of the celebrity:
2. Celebrity Image:
3. Profession:
4. Best Work:
5. Overall Sentiment on Twitter: Positive, Negative or Neutral
Hint: Use IMDB scrapping sample example as reference for scraping the mentioned web page. For sentiment analysis use the Twitter sentiment code as reference.
Python version 2.7
Tools and Packages Used
• Version: Python 2.7[VERY IMPORTANT]
Tweepy: An easy-to-use Python library for accessing the Twitter API.
Text Blob:
Text Blob is a Python (2 and 3) library for processing textual data. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis, classification, translation, and more.
• Beautiful Soup [Symbol] Beautiful Soup provides a few simple methods and Pythonic idioms for navigating, searching, and modifying a parse tree using Python parsers like lxml and html5lib. It automatically converts incoming documents to Unicode and outgoing documents to UTF-8. Here's the documentation.
• Selenium [Symbol] The web driver kit emulates a web-browser (I chose chrome driver) and executes the JS scripts to load the dynamic content.
Challenges Faced during the project
Well as being a complete newbie in programming it took more time to understand the programming concept but the time spent for this project was worth every second.
I understand it took little longer time for me to complete the project .Initially when I started working on IMDB project I used just beautiful soup to scrape the content from IMDB website however I was unable to scrape the required content from the website as there was dynamic script used for images and other contents of the website.
Did thought of using regular expression to scrape the information from IMDB but still because of JavaScript I couldn’t scrape the required details from the website.
Did took help of “Edureka” team to know what needs to be done in order to scrape the JavaScript content from IMDB website . As per the suggestion from Edureka team downloaded the selenium driver and also went ahead and installed the chrome driver (resources used to understand the functionality of selenium driver was YouTube and also Selenium web driver website: http://www.seleniumhq.org/docs/01_introducing_selenium.jsp
Other references:
Reference: http://fruchter.co/post/53164489086/python-headless-web-browser-scraping-on-amazon
Used text blob library to evaluate the sentiment analysis of tweets:
Reference: https://textblob.readthedocs.io/en/dev/</h5></p>
</div>
<hr class="featurette-divider">
<!-- Third Featurette -->
<div class="featurette" id="contact">
<img class="featurette-image img-circle img-responsive pull-right" src="https://s3.amazonaws.com/nicholasyager.com/assets/2015-02-13/wordcloud_freq_tweet-001.png">
<h2 class="featurette-heading">Project | 02 Twitter Sentiment Analysis
Mining Twitter Data with Python
<span class="text-muted">Worked on Django mainframe</span>
</h2>
<p class="lead">
<h4>
I dont call myself Django webframe Professional however I wish to master the django framework someday soon
I think Django is one of the awesome web framework to work with...</h4>
<h5>Twitter Sentiment Analysis Mining Twitter Data with Python:
Python packages used Tweepy and Textblob
Need Twitter API so that you can fetch the tweets for any subject.
Also created Reddit bot using reddit API
Source Code is uploaded in GIT HUB.</h5>
</p>
</div>
<hr class="featurette-divider">
<!-- Footer -->
<footer>
<div class="row">
<div class="col-lg-12">
<p><strong> Email:[email protected]</strong></p>
<p><strong><a href="https://twitter.com/galaxyofmonks">twitter</a>
</strong></p>
</div>
</div>
</footer>
</div>
<!-- /.container -->
<!-- jQuery -->
<script src="js/jquery.js"></script>
<!-- Bootstrap Core JavaScript -->
<script src="js/bootstrap.min.js"></script>
</body>
</html>