Skip to content

Commit 0fe0b19

Browse files
committed
NLP course 4
1 parent 8452369 commit 0fe0b19

File tree

190 files changed

+193003
-4
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

190 files changed

+193003
-4
lines changed

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 1/Assignment 1- Neural Machine Translation.ipynb

+2,233
Large diffs are not rendered by default.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 1/w1_unittest.py

+590
Large diffs are not rendered by default.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/Assignment 2 - Transformer Summarizer.ipynb

+2,952
Large diffs are not rendered by default.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-test.tfrecord-00000-of-00001

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00000-of-00016

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00001-of-00016

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00002-of-00016

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00003-of-00016

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00004-of-00016

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00005-of-00016

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00006-of-00016

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00007-of-00016

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00008-of-00016

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00009-of-00016

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00010-of-00016

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00011-of-00016

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00012-of-00016

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00013-of-00016

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00014-of-00016

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00015-of-00016

Whitespace-only changes.

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-validation.tfrecord-00000-of-00001

Whitespace-only changes.
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,138 @@
1+
{
2+
"citation": "@article{DBLP:journals/corr/SeeLM17,\n author = {Abigail See and\n Peter J. Liu and\n Christopher D. Manning},\n title = {Get To The Point: Summarization with Pointer-Generator Networks},\n journal = {CoRR},\n volume = {abs/1704.04368},\n year = {2017},\n url = {http://arxiv.org/abs/1704.04368},\n archivePrefix = {arXiv},\n eprint = {1704.04368},\n timestamp = {Mon, 13 Aug 2018 16:46:08 +0200},\n biburl = {https://dblp.org/rec/bib/journals/corr/SeeLM17},\n bibsource = {dblp computer science bibliography, https://dblp.org}\n}\n\n@inproceedings{hermann2015teaching,\n title={Teaching machines to read and comprehend},\n author={Hermann, Karl Moritz and Kocisky, Tomas and Grefenstette, Edward and Espeholt, Lasse and Kay, Will and Suleyman, Mustafa and Blunsom, Phil},\n booktitle={Advances in neural information processing systems},\n pages={1693--1701},\n year={2015}\n}",
3+
"description": "CNN/DailyMail non-anonymized summarization dataset.\n\nThere are two features:\n - article: text of news article, used as the document to be summarized\n - highlights: joined text of highlights with <s> and </s> around each\n highlight, which is the target summary",
4+
"downloadSize": "585439472",
5+
"location": {
6+
"urls": [
7+
"https://github.com/abisee/cnn-dailymail"
8+
]
9+
},
10+
"name": "cnn_dailymail",
11+
"schema": {
12+
"feature": [
13+
{
14+
"name": "article",
15+
"type": "BYTES"
16+
},
17+
{
18+
"name": "highlights",
19+
"type": "BYTES"
20+
}
21+
]
22+
},
23+
"splits": [
24+
{
25+
"name": "test",
26+
"numBytes": "49879392",
27+
"numShards": "10",
28+
"shardLengths": [
29+
"11490"
30+
],
31+
"statistics": {
32+
"features": [
33+
{
34+
"bytesStats": {
35+
"commonStats": {
36+
"numNonMissing": "11490"
37+
}
38+
},
39+
"name": "article",
40+
"type": "BYTES"
41+
},
42+
{
43+
"bytesStats": {
44+
"commonStats": {
45+
"numNonMissing": "11490"
46+
}
47+
},
48+
"name": "highlights",
49+
"type": "BYTES"
50+
}
51+
],
52+
"numExamples": "11490"
53+
}
54+
},
55+
{
56+
"name": "train",
57+
"numBytes": "1260529260",
58+
"numShards": "100",
59+
"shardLengths": [
60+
"17945",
61+
"17944",
62+
"17945",
63+
"17944",
64+
"17945",
65+
"17944",
66+
"17945",
67+
"17944",
68+
"17945",
69+
"17945",
70+
"17944",
71+
"17945",
72+
"17944",
73+
"17945",
74+
"17944",
75+
"17945"
76+
],
77+
"statistics": {
78+
"features": [
79+
{
80+
"bytesStats": {
81+
"commonStats": {
82+
"numNonMissing": "287113"
83+
}
84+
},
85+
"name": "article",
86+
"type": "BYTES"
87+
},
88+
{
89+
"bytesStats": {
90+
"commonStats": {
91+
"numNonMissing": "287113"
92+
}
93+
},
94+
"name": "highlights",
95+
"type": "BYTES"
96+
}
97+
],
98+
"numExamples": "287113"
99+
}
100+
},
101+
{
102+
"name": "validation",
103+
"numBytes": "57678453",
104+
"numShards": "10",
105+
"shardLengths": [
106+
"13368"
107+
],
108+
"statistics": {
109+
"features": [
110+
{
111+
"bytesStats": {
112+
"commonStats": {
113+
"numNonMissing": "13368"
114+
}
115+
},
116+
"name": "article",
117+
"type": "BYTES"
118+
},
119+
{
120+
"bytesStats": {
121+
"commonStats": {
122+
"numNonMissing": "13368"
123+
}
124+
},
125+
"name": "highlights",
126+
"type": "BYTES"
127+
}
128+
],
129+
"numExamples": "13368"
130+
}
131+
}
132+
],
133+
"supervisedKeys": {
134+
"input": "article",
135+
"output": "highlights"
136+
},
137+
"version": "3.0.0"
138+
}
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,19 @@
1+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-test.tfrecord-00000-of-00001
2+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00000-of-00016
3+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00001-of-00016
4+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00002-of-00016
5+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00003-of-00016
6+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00004-of-00016
7+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00005-of-00016
8+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00006-of-00016
9+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00007-of-00016
10+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00008-of-00016
11+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00009-of-00016
12+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00010-of-00016
13+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00011-of-00016
14+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00012-of-00016
15+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00013-of-00016
16+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00014-of-00016
17+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-train.tfrecord-00015-of-00016
18+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/cnn_dailymail-validation.tfrecord-00000-of-00001
19+
https://storage.googleapis.com/tensor2tensor-data/tmp/data/cnn_dailymail/plain_text/3.0.0/dataset_info.json

Diff for: Natural Language Processing/Course 4 - Natural Language Processing with attention models/Assignments/Week 2/model.pkl.gz

Whitespace-only changes.

0 commit comments

Comments
 (0)