downloading subtitles from youtube to help studying japanese
intro
As you may know, I’ve been studying Japanese for a while. It’s a super difficult language for us European language speakers. There’s the alphabets (3!) to learn, the grammar, vocabulary etc.
Right now I am at a point where I don’t have to look up every single vocabulary and have some basic grasp of grammar. I haven’t taken the official test but I am pretty sure I can pass the JLPT N5 test. For all practical purposes, I am as a fluent as a 3 years old native Japanese child.
As I am studying just for fun, ie there’s no real rush on learning the language (as opposed to someone that’s going to live there, or work in a Japanese-speaking environment), I tend to slack a bit.
During my learning journey, I’ve came across the AJATT method, MIA (now called Refold) and other methods. Their point is to immerse as much as possible in the target language, by (preferably) consuming media in the native language, since it’s the most fun.
One thing I do in my Japanese classes is to watch comedy skits. I like comedy because a) it’s fun b) it shows a bit of the culture and c) doesn’t require super an advanced vocabulary.
My favourite is the さんま御殿(Sanma Goten). They even upload their episodes on YouTube.
So at class we go over 1 or 2 skits, for most skits I tend to understand almost everything, except the punch. Luckily my teacher is there to explain the cultural nuances, the slang and whatnot.
Then later I review the episode, the grammar, vocabulary and sometimes create anki cards for them.
But sometimes I simply can’t understand what they are saying. Or I can understand, but can’t find in the dictionary, which is pretty annoying.
Recently I came up with a strategy to deal with that: use youtube auto-generated subtitles to guide me through dialogue. Instead of trying to type out weird vocabulary in the dictionary, I can simply copy paste the word from the subtitle.
meat
For that we need to download the subtitle, surprisingly (or not) youtube-dl supports it.
Given a video, for example this one
It’s just a matter of calling
|
|
Which creates a file
|
|
The flags mostly self explain themselves, the only thing is that to figure out the available languages you first need to run --list-subs
. For example:
|
|
Yields
|
|
Then I can just manipulate that subtitle file as normal:
Or better yet, opening in the browser and leveraging yomichan