Convert .srt to text paragraphs

UPDATED

in

by

Tags:


Have captions? Need paragraphs? Lost your .txt file from transcribeme? Here’s the method I found.

Note: SRTs don’t have actual “paragraph” indicators. All the newlines in an SRT work to make it function as a caption — what line shows with what other line, at what time. All semantically meaningful paragraph returns got stripped out when the text was originally made into an SRT. If you want the resulting transcript to read like any kind of prose, you will need to edit some paragraphs.

Method 1: One sentence per paragraph

  1. Paste your SRT file in here: Extract Text only from subtitle and remove timestamps
  2. Copy that result into a text editor that can do a Regex find and replace (I did this in TextMate)
  3. Find: (?<![\n.?!]"|[\n.?!])\n+
  4. Replace with a single space.

Optional if you have empty lines you want to remove

  1. Find: \n\n
  2. Replace with \n

Method 2: Wall of text

  1. Paste your SRT file in here: Extract Text only from subtitle and remove timestamps
  2. Paste the resulting text in here: Line break removal tool