P2TK, the Penn Phonetics Toolkit, developed in the Penn Linguistics Phonetics Lab, is a collection of Python and Praat scripts and other tools to aid speech research.
Unless otherwise noted, scripts here can be freely copied under the terms of the MIT open source license and should be cited by their name, the last modified date of the script, and as a part of the package "P2TK: Penn Phonetics Toolkit". We have a SourceForge project page.
Suggestions and patches with improvements to the scripts are welcome, sent to the contact person associated with each script.
The scripts are generally fairly simple and documentation for how to use them is located in the comment block at the top of the script.
syllabifier is a Python script to perform
automated syllabification of phonemic transcriptions
in English. A syllabified version of cmudict is available.
Contact:
freq is a Python script to build a frequency or conditional frequency table and then to convert that to probabilities or conditional probabilities.
Praat-Py is an extension to Praat that allows scripts to be written in Python, rather than in Praat's own scripting language.
quickTrans: a simple shell interface for transcription. The audio file is first broken up into small utterance groups which can be easily transcribed one at a time. Requires Praat and Python. Contact:
extractFormants: a tool for extracting vowel formants from a large corpus of phoneme-aligned speech. Requires Python and either Praat or ESPSContact: