This post is about phrase-structure grammars, which can be both entertaining and educational.
If you're a linguistics student, you will be interested in this. We’re going to learn how to define
a little set of rules for a made up language, and then generate possible sentences in that
language based on the rules. We can also use it to test if something is grammatical in our
You may already be familiar with phrase structure from linguistics class, or parsing in programming. Regardless, this introduction is accessible for everyone - including novices.
We will first learn the basics of these little rules, and then illustrate by generating random plot summaries for possible episodes of the TV show Midsomer Murders
(à la the Midsomer Murders Bot on twitter) and also Beatles lyrics.
Even Barnaby can see the templatic nature of the show. How many nas do we need to generate this song? Nearley parser We will be using the Nearley parser, a computer program that helps parse se…
Hedvig here. I'm currently writing up my PhD thesis, hence the lack of writing here. Hopefully I'll be able to pick it up after submission, there's a lot of drafts lying on blogger waiting for completion. If you really, really miss me in particular, you could listen to my ramble at Talk the Talk - a weekly show about linguistics.
Now that the shameless plug and excuses are done with, let's get down and talk about:
THE TRANSCRIPTION CHALLENGE!
In this blog post, I will focus on a part of this challenge¹ - the workflow for segmenting and transcribing audio material. This is a rough guide, if it turns out people appreciate something like this I'll re-write it more thoroughly. This is a bit sloppily written in places, but trust me - if I do this "properly" right now I will lose days of work time that I should be spending on my thesis... so, I'll only do it if people really want it - and I might wait a while until I do. Sorry, but it is wh…