Author Archives: Colin Batchelor

piuthar

The immediate family members in Scottish Gaelic are màthair, athair, bràthair, all of which are clearly related to other familiar European languages, and piuthar, “sister”, which looks odd. Irish is yet odder at first glance, with deartháir meaning “brother” and deirfiúr meaning “sister”. I’ve … Continue reading

Posted in Uncategorized | Leave a comment

Second Celtic Language Technology Workshop revised deadline April the 20th

… which is next Wednesday rather than this Friday. Or if you’re in the UK or Ireland it’s very early next Thursday, but clearly nobody reading this would leave submission till the last moment. No.

Posted in conferences | Leave a comment

Numbers

(1) Dìreach aona mìos deug roimhe sin… “Just eleven months before that”. In my annotation guidelines I have blithely stated “Attributive numbers are N/N“, which is fine for aona, but less so for deug, which I am going to treat … Continue reading

Posted in grammar | Leave a comment

Second Celtic Language Technology Workshop deadline April the 15th

I have partly been quiet here because I have been hard at work putting together something for this: http://www.lattice.cnrs.fr/CLTW/index-en.html and clearly I should not prejudice the double-blindness of the refereeing too much. Ahem.

Posted in conferences | Leave a comment

Resumptivity resumed

I said (four years ago) that Gaelic doesn’t have resumptive pronouns. However, while scouring William Lamb’s Scottish Gaelic for unusual uses of agus, I found these examples, with the resumptive bit in bold. sin an gille a shuidh Cèit air (that is the … Continue reading

Posted in grammar | 2 Comments

Interrogative frequencies in DASG

One aspect of Gaelic I want to look at more closely is interrogatives. Just as all the wh- words in English (who, when, why, what, how) go to the front of the sentence, so do all the c- words in … Continue reading

Posted in grammar, preliminaries | Leave a comment

DASG and the second comparative

If you haven’t come across Dachaigh airson Stòras na Gàidhlig/Digital Archive of Scottish Gaelic you should stop what reading this and go straight there. … Welcome back. It contains eight and a half million words and is a resource I keep … Continue reading

Posted in grammar | 2 Comments

Training a dependency parser on gdbank

A very quick note to say that I’ve trained maltparser, a dependency parser, with the current gdbank sentences (a mere 1223 tokens spread across 70-odd sentences), the Universal POS tagging scheme and the current Universal-ish gdbank dependency annotation scheme, and then … Continue reading

Posted in dependency parsing, maltparser | Leave a comment

MaltParser cheat mode

If you train MaltParser using the learnwo flowchart in place of learn, it does all the same things, except that it writes out the sentences as it reads them in. This means that if you have, ahem, misformatted any of … Continue reading

Posted in dependency parsing, maltparser | Leave a comment

Installing MaltParser on Mac OS X 10.6.8

MaltParser is a dependency parser and it’s available here: http://www.maltparser.org/download.html If you try to run the ready-built jar under Mac OS X 10.6.8 and you haven’t updated to Java 1.7, you’ll get a major.minor version number error. However, if you … Continue reading

Posted in dependency parsing, maltparser | Leave a comment