- Transcription should represent all words as spoken – including hesitations, filler words, and false starts.
- Transcriptions must be orthographic, not phonetic. Refer to American Heritage Dictionary for reference: https://ahdictionary.com/
- Transcription should include only upper and lowercase letters, apostrophes, periods, question marks, commas, and spaces. No numbers or other special characters.
- If you cannot understand what the speaker says and the speech is unintelligible, use the footnote [INAUDIBLE hh:mm:ss] as described in this article.
Speech event transcription.
Use orthographic spelling:
“Call your representive.” = “Call your representative.”
“Issall well n’ good darlin’.” = “It’s all well and good darling.”
“The volcano said: I lava you.” = “The volcano said I lava you.”
“He’s 6 ft 2!” = “He’s six foot two.”
“I live in Cambridge, Mass.” = “I live in Cambridge, Mass.”
“Talk to Dr. Smith at Cal.” = “Talk to Doctor Smith at Cal.”
Stumbled speech and corrections.
“Directions to the… to the… the hotel” = “Directions to the… To the… The hotel.”
“Ale… Alexa play Janet Jackson… no wait…” = “Ale*… Alexa play Janet Jackson… No, wait.”
- Use end-punctuations (full stop, question mark) to indicate the end of a complete sentence.
- Use punctuation symbols that are an essential part of the word, such as apostrophes.
- Use commas to break up long stretches of speech. This is to facilitate reading comprehension.
- AVOID semi-colons.
Acronyms or Spelled Out Words.
“My name is John – jay, oh, eich, en”. = “My name is John J O H N.”
“I work at IBM” = “I work at I B M.”
“I work at NASA” = “I work at NASA.”
- To separate items in a list of three or more, using the serial (aka Oxford) comma (i.e., the comma before the conjunction that joins the last two elements:
I enjoy skydiving, snowboarding, and mountain biking.
- To set off a direct address:
Maryam, listen to me carefully.
I'm not calling you, my friends, just to whine about my life.
- To break up the compound and complex sentences:
I would like to join you, but I'm afraid I have class at that time.
Marcos and I couldn't go to the jazz concert, so we watched it on TV instead.
- To set off introductory words and phrases:
Therefore, they cancelled their trip.
After taking a break, the team resumed their meeting.
- Around parenthetical phrases:
That report on the New York Times was, to say the least, a bombshell.
Getting a hotel by the sea, like the one we stayed last year, would be superb.
“That’s where it’s at” = “That’s where it’s at.”
“Project Q’s timeline” = “Project Q’s timeline.”
“Sinead O’Connor” = “Sinead O’Connor.”
“Eleven o’clock” = “Eleven o’clock.”
“Read Jess’ email” = “Read Jess email.”
“…exa, stop the mu…” = “*Exa stop the mu*…”
“Ale… alexa … stop the mu… the music.” = “Ale*… Alexa stop the mu*… The music”
“I have like $0” = “I have like zero dollars.”
“It was great/weird” = “It was great slash weird.”
“… and +, she didn’t know!” = “and plus she didn’t know.”
“My email is m-golden@...” = “My email is M dash golden at.”
“http://www.amazon.com” = “H T T P colon slash slash W W W dot Amazon dot com”
“http://www.wikipedia.org” = “H T T P colon slash slash W W W dot Wikipedia dot O R G”
“I want to visit Oregon” = “I want to visit Oregon.”
“I work at IBM” = “I work at I B M”
“George W Bush paints now” = “George W Bush paints now.”
“I’m going to Mexico on Thursday” = “I’m going to Mexico on Thursday.”
"5" should be transcribed as five.
“5th” should be transcribed as fifth.
“306” = should be: three hundred and six, three O six, or three zero six depending on how it was pronounced.
“Play radio 109.4 FM” = “play radio one O nine point four F M”.
“Beverly Hills, 90210” = “Beverly Hills nine O two one oh”.
“25” = twenty five.
“I work for IBM.” = “I work for I B M”
“I work for NASA.” = “I work for NASA”
“Check it out on IMDB” = “Check it out on I M D B”
“I like ZZ Top.” = “I like Z Z Top.”
“The SATs are nerve-wracking.” = “The S A Ts are nerve wracking.”
“He’s from Washington DC’s downtown.” = “He’s from Washington D C’s downtown.”
Unintelligible words and phrases.
“Alexa play ???? on spotify” = “Alexa play [INAUDIBLE 00:22:13] on Spotify”
“Alexa read ????? from audible.” = “Alexa read Cat In The Hat [sic 00:00:07] from audible”
Non-speech sound inventory.
When a nonverbal sound occurs such as a yawning, applauses, music, interjections, or a filler word you should always register it by using the F8 key.
These are the nonverbal sounds you must register (categorized):
- ah -- Expression of surprise, pain, etc.
- eh -- For example: "Eh, you"
- er -- The speaker is thinking
- ew -- Expression of disgust or disgust
- jeez -- Expression of surprise or annoyance
- mm -- for example: "Mm, is delicious"
- nah -- The speaker is saying no
- oh -- Expression of surprise, pain, etc.
- uh -- Question or confusion expression
- uh-huh -- The speaker is nodding
- uh-oh -- The speaker is saying no
- um -- The speaker is thinking
- whew -- Relief expression
- whoa -- Surprise expression
- yay -- Expression of happiness
- yep -- The speaker is nodding
Other human sounds:
- Yawn, breath, or sigh
- Sneeze, cough, or throat
- Crying or sob
- Cheers -- Hip hip hurra
- Other human sounds - different to all listed
- Beep -- Answering machine
- Telephone dialing
- Static - Continuous background noise
- Voice recorder answering machine
- Music or singing
- Keyboard sounds
- Telephone ring
- Other ambient sounds - different to all listed
“Wait … *click* *click click* *click* there.” = “wait [click hh:mm:ss] there”
“I will abso-*ring*-lutely open it!” = “I will [ring hh:mm:ss] absolutely open it”
Check this article for full details.