- Pick a video to transcribe (if the column
Transcriber
has a?
, that means it needs a transcriber.) - Open an issue and indicate which video you are working on.
- You can start with the given YouTube preliminary text.
- Attention to details is very important requirement for this task.
- Good spelling and grammar is critical to this task.
- Being comfortable with reading, writing and listening to the English language is important for this task.
- Helpful: It is helpful to have some familiarity with Markdown.
- Resource: Mastering Markdown
- Helpful: It is helpful to know Git so a pull request (PR) can be submitted with edits.
- Resource: Git Intro Workshop
Here are some examples of transcripts that are well-edited:
- scikit-learn Pull Request Example: 2020/06-reshama-shaikh-sklearn-pr.md
- Emily Robinson Career Talk: 2020/10-emily-robinson-career.md
Here is what the raw markdown file looks like:
https://raw.githubusercontent.com/data-umbrella/event-transcripts/main/2020/06-reshama-shaikh-sklearn-pr.md
You can submit edits in one of these formats:
- GitHub Pull Request: Edit the file on GitHub and submit a PR that way.
- Pull Request: Use a code editor and submit a PR (Markdown file) using command line.
- For beginning of sentences, make them capital letters
- Example: when "i" as in "i think", can you change to capital letter: "I think ..."
- Break sections down into paragraphs (not one whole file)
- For the sections, add some timestamps, so people know where they can find it in the video? Breaking it down every 10 minutes or so is acceptable. (That would be 6 timestamps for a 1-hour talk.)
- Example of sub-section and timestamps
### Set Up Repository (3:46)
- Add period to the end of sentences.
- Fix spelling
- Example: PY Ladies ===> PyLadies
- Example: "data umbrella" ===> Data Umbrella
- Example: "reishmas" ===> Reshama
- When I did this, I listened to the video for a couple of sentences, and then stopped it. Then I edit the text and then I start the video again for another sentence or two.
- It can take a good 2 to 4 hours to edit a transcript for a 1-hour video.
- It is not necessary to include screenshots of slides.
Here is an example of a standard header in a transcript file
# Ty Shaikh: Scraping Poshmark Data with Python
## Key Links
- Transcript: https://github.com/data-umbrella/event-transcripts/blob/main/2020/03-ty-shaikh-webscraping.md
- Meetup Event: https://www.meetup.com/nyc-data-umbrella/events/270285046/
- Video: https://youtu.be/0L1uM_18TTA
- Slides: not applicable
- GitHub repo: https://github.com/ty-shaikh/scraping_poshmark_webinar
- Binder: https://mybinder.org/v2/gh/ty-shaikh/scraping_poshmark_webinar/master?filepath=1-scraping_poshmark_listings.ipynb
- Transcriber: ? [needs a transcriber]
## Transcript
Our videos typically run 1 hour in recording duration and can take 2 to 4 hours to edit.
It is perfectly ok, and very much welcome and appreciated!, to submit edits in 10-minute segments.