Category Archives: preliminaries

Interrogative frequencies in DASG

One aspect of Gaelic I want to look at more closely is interrogatives. Just as all the wh- words in English (who, when, why, what, how) go to the front of the sentence, so do all the c- words in … Continue reading

Posted in grammar, preliminaries | Leave a comment

Three sorts of PP

Le means “with”, roughly, but if you want to say “with X”, there are three different ways of doing it. le Alasdair: “with Alasdair”. This is the form used before a noun phrase that doesn’t begin with a definite article. … Continue reading

Posted in grammar, preliminaries | Leave a comment

Why do we bother with grammatical frameworks?

Most natural languages, like English French Chukchi Basque Gaelic Italian Russian Latgallian Finnish Tamil and so forth, can be reasonably well modelled by a context-free grammar, which is the sort of grammar that people write computer languages in. Parsers for … Continue reading

Posted in grammar, preliminaries | Leave a comment

Getting OpenCCG to work on the Mac

OpenCCG is a java/python toolkit for working on combinatory categorial grammar, so is ideal for this exercise. Update 2014-07-14: if you’re using OpenCCG 0.95, the latest version, on Mac OS X 10.6.8, then as long as you have Python 2.x … Continue reading

Posted in other people's code, preliminaries | Leave a comment

But what can we tell from the 100 top word tokens?

26 are prepositions of some sort 23 are nouns 10 are conjunctions 10 are verbs 5 are articles 7 are adjectives 7 are pronouns 4 are preverbal particles 2 are adverbs The number of prepositions is unusually high and indicates … Continue reading

Posted in grammar, preliminaries | Leave a comment

What kind of language is this? The top 100 word tokens in Gaelic

I downloaded all of the Gaelic wikipedia. This is not hard. It is at http://dumps.wikimedia.org/gdwiki/latest/ and you probably want gdwiki-latest-pages-articles.xml.bz2, which contains all the text. Now I can do word-token counts on it, using terrible code like the following: #!/usr/bin/perl … Continue reading

Posted in grammar, preliminaries | Leave a comment