-
Notifications
You must be signed in to change notification settings - Fork 24
/
index.html
103 lines (97 loc) · 5.51 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
---
layout: default
title: Scrapinghub Learning Center
---
<section class="on-demand-courses">
<h2>Free Video Courses To Learn Web Scraping</h2>
<div class="wrap container-fluid">
<div class="courses">
<!-- Scrapy -->
<div class="row course-summary">
<div class="col-md-3 course-icon">
<h2>Learn Scrapy</h2>
<p>(9 videos)</p>
{% include icons/icon-scrapy.svg %}
<a href="{{ site.baseurl }}/scrapy" class="primary-cta">Start now</a>
</div>
<div class="col-md-9">
<h3>With the evergrowing amount of data spread around the web, the need for gathering and structuring that data is also increasing day by day. This is exactly where web scraping comes into play.</h3>
<p>In this quick video course, you'll learn everything you need to get started with web scraping using Python and Scrapy. Among other things, you'll learn how to:</p>
<ul>
<li>Extract data from the web using CSS selectors</li>
<li>Follow pagination buttons with a spider</li>
<li>Handle websites that use infinite scrolling</li>
<li>Authenticate your spider in a website</li>
<li>Deploy and run your spiders in the cloud</li>
</ul>
</div>
</div>
</div>
</div>
</section>
<section class="training">
<div class="wrap container-fluid">
<div class="col-md-offset-1 col-md-10 col-md-offset-1">
<h2>Personalized Training Program</h2>
<div class="row">
<div class="col-md-1">
<i class="fa fa-info-circle" aria-hidden="true"></i>
</div>
<div class="col-md-11">
<h4>About</h4>
<p>Our personalized training program is all you need to get you and your team up and running with <a href="https://scrapy.org">Scrapy</a> and a modern web scraping technology stack. You'll start from the basics and gradually learn the most common challenges you'll face in the day-to-day job of a web scraping expert.</p>
<p>The training program is priced on a per-seat basis and delivered over the course of 2 weeks through a combination of all-hands sessions and one-one coachings</p>
</div>
</div>
<div class="row">
<div class="col-md-1">
<i class="fa fa-check-square-o" aria-hidden="true"></i>
</div>
<div class="col-md-11">
<h4>Requirements</h4>
<ul>
<li>Basic Python knowledge</li>
<li>A basic understanding of how the web works</li>
<li>Basic HTML and CSS selectors syntax</li>
</ul>
<p>Check out a <a href="http://github.com/scrapinghub/scrapy-training">preview of the course material.</a></p>
</div>
</div>
<div class="row">
<div class="col-md-1">
<i class="fa fa-map" aria-hidden="true"></i>
</div>
<div class="col-md-11">
<h4>Organization</h4>
<p>This program consists of 6 one-hour units spread around two weeks:</p>
<ul>
<li>Extracting data with Scrapy (1h)</li>
<li>Navigating websites with Scrapy (1h)</li>
<li>Running Spiders in the Cloud (1h)</li>
<li>Handling HTML Forms (1h)</li>
<li>Scraping JavaScript based pages (1h)</li>
<li>Extending Scrapy (1h)</li>
</ul>
<p>Right after a unit, each trainee gets 30 minutes of individual coaching with the instructor.</p>
</div>
</div>
<div class="training-topics">
<h3>What you'll learn</h3>
<ul>
<li><i class="fa fa-check" aria-hidden="true"></i> Extract data from web pages using CSS selectors</li>
<li><i class="fa fa-check" aria-hidden="true"></i> Build crawlers that follow through the links in a website</li>
<li><i class="fa fa-check" aria-hidden="true"></i> Simulate the user behavior in your spiders</li>
<li><i class="fa fa-check" aria-hidden="true"></i> Scrape JavaScript-based websites</li>
<li><i class="fa fa-check" aria-hidden="true"></i> Identify the best strategy to deal with challenges</li>
<li><i class="fa fa-check" aria-hidden="true"></i> Extract data hidden behind login walls</li>
<li><i class="fa fa-check" aria-hidden="true"></i> Extend Scrapy via middlewares and pipelines</li>
<li><i class="fa fa-check" aria-hidden="true"></i> A toolset to scrape effectively</li>
<li><i class="fa fa-check" aria-hidden="true"></i> Best practices in web scraping and web crawling</li>
<li><i class="fa fa-check" aria-hidden="true"></i> Deploy and run your crawlers in a cloud platform</li>
</ul>
</div>
</div>
</div>
<a href="/request/" class="primary-cta">Schedule a personalized training</a>
</section>
{% include footer.html %}