Drawing syntactic trees...
I have been asked by many students and colleagues, how to generate nice looking trees for presentations, assignments, papers etc. Here is a small summary of tools I have tried or seen.If you want to...
View ArticleYet another comment related to Lexc, XFST and compilation
You can use Helsinki Finite-State Transducer Technology HFST3 and Foma to compile XFST or Lexc defined morphologies and transducers…
View ArticleUpdated Python code and tools
The Charty parser code is updated to Python 3.x (implementing an Earley parser for context-free grammars), and a compact module, TextStat.py, with some useful functions for N-gram models, frequency...
View ArticleIt took a while...
to settle down in Ann Arbor and start teaching at EMU, but now we will get back to the project work…
View ArticleELS 2012
The European List Symposium in 2012 will be organized at the University of Zadar, and I am on the organizing committee, and participating as well. Stay tuned…
View ArticleSNLTK
There is an update to be expected on the Scheme Natural Language Toolkit (SNLTK) (and there is soon an update of Scheme coming as well), and the SNLTK is also being ported to common Racket.
View ArticleIlse Lehiste Memorial Symposium: Melody and Meter
I’ll be at the Ilse Lehiste Memorial Symposium: Melody and Meter at the Ohio State University on the 11th of November 2011.
View ArticleThe Schemers become active again...
The Schemers and Racketeers are meeting again, join us, see the SNLTK pages…
View ArticleBuilding the Google V8 JavaScript engine as a Shell interpreter for Mac OS X
Here is an instruction for building the Google V8 JavaScript engine on Mac OS X as a shell tool for testing:http://kourge.net/node/123Just keep in mind, when you want to build it for Mac OS X Lion, the...
View ArticleIntensive Python class for Linguists (for corpuslinguistics, language data...
I am offering an intensive class for the LING519 students, all the Linguist List people, and whoever might be interested, this Saturday 19th of Nov. 2011 at 10 AM Eastern Time in Cooper, the...
View ArticleSome DrRacket videos...
Here are some introductory video clips for DrRacket:http://www.youtube.com/playlist?list=PLD0EB7BC8D7CF739AThanks to John Clements.DC
View ArticleScheme and Racket meeting at ILIT (Cooper building)
The Schemers at EMU meet on Thursday 31st of Nov. at 3 PM Eastern Time in the Cooper building for an initial 1.5 hours intro and coordination meeting.If you would like to participate, bring your...
View ArticleScheme and Racket implementation of a parser
The GUI-based Charty implementation (agenda-based chart parser for CFGs) is finally available on the SNLTK pages.
View ArticleComputational Approaches to Slavic Languages 2012
Computational Formal Approaches to Slavic Languages 2012Slavic Computational Linguistics: Computational Formal Approaches to Slavic Languages (10-11 May 2012, Bloomington, Indiana); Co-located with:...
View ArticleC-FASL 2012, you should join it...
You should submit a paper to Computational Formal Approaches to Slavic Languages (C-FASL) 2012:http://cl.indiana.edu/~cfasl/
View Articlethe linguistic Wolfram Demonstrations Projects
Check out these demonstrations from the Wolfram Demonstrations Project:Collocation by Chi SquareCollocation by Symmetric Conditional ProbabilityMultilanguage Word LengthsZipf's Law Applied to Word and...
View Articlejust restored the pages from backups...
I just restored a bunch of web pages of summer schools and workshops. Some had interesting material on them, in particular pictures. Check out the JSSECL 2006 event…Fourth Annual Meeting of the Slavic...
View ArticleOnline tool for IPA transcription
Here is an online tool for IPA transcription, i2speak:http://www.i2speak.com/
View ArticleTikZ-dependency graph LaTeX library
The TikZ-dependency graph library for LaTeX can be found here…
View ArticleDictionaries for Mac OS X
Here are some of the dictionaries for the OS X Dictionary.app:The dict.cc dictionary plugin English-German, German-EnglishTekl.de German Thesaurus and English-German dictionary
View ArticleUsing Antconc: Notes 1
Here is a short instruction on using Antconc for simple statistical analysis.
View ArticleLanguage Technology Lab (LTL) up
The Language Technology Lab (LTL) (ILIT and EMU) is up, check it out:http://ltl.emich.edu/More content to come in the next days and weeks… stay tuned!
View ArticleChanged Privacy Policy
Since privacy policy changes seem to be all around now, here is one by me for the pages here:If you want to make your web-experience somewhat more private, and prevent me from being able to read out...
View ArticleLREC 2012 workshop on Challenges in the management of large corpora
You should really consider joining this LREC 2012 workshop on Challenges in the management of large corpora!
View ArticleStanford-CoreNLP corenlp.sh script on Mac OS X Lion
To make the Stanford CoreNLP tools work on your Mac OS X 10.7.x (Lion) distribution with the included bash script do this...
View ArticleCharty in JavaScript...
Ben Cool ported Charty (CFG-based Chart parser) to JavaScript for a class project and added in one version feature augmentation and unification to it. You can test it online. This is running on mobile...
View ArticleText analyzed and parsed to TEI XML wrapper
I set up a simple testing page for a wrapper of raw text to TEI XML. It uses in this version just the Stanford CoreNLP tools to tokenize, recognize sentences, part of speech annotate and lemmatize the...
View ArticleLINGUIST List Fund Drive 2012 has started
Please consider supporting LINGUIST List, just go to the Fund Drive 2012 pages and donate!
View ArticleLithuanian Morphology and LFG-Grammar...
The poster for the DGfS annual meeting 2012 on a Lithuanian Morphology and LFG Grammar is done. This was the result of a grad course at the University of Konstanz on rule-based natural language...
View ArticleTEI online converter: OxGarage Converter
The online OxGarage Converter on the TEI pages converts almost anything to something else, in particular to TEI XML. This is obviously using the OpenOffice filters and converters in the backend as...
View ArticleThe LTL corpus
The first version of the small LTL corpus with a couple of million tokens is online. It contains TEI P5 XML encoded books from the public domain. See here…
View ArticleWorking with the Philologic interface on the LTL corpora
Here is a brief first introduction to the Philologic interface for the LTL corpora and the LINGUIST List corpus;
View ArticleThe LINGUIST List corpus
The LINGUIST List corpora can be found here:http://ltl.emich.edu/llc/You can find in there the LINGUIST List mailings converted to TEI P5 XML. The linguistically annotated version will be available in...
View ArticleTokenization, frequency profiles and N-gram models in Python 3
This is a brief description about how to use the Python 3 scripts to generate N-gram models for word tokens and characters from text. I expect you to have a Python 3 interpreter installed on your system.
View ArticleTalk: M. Cavar "On the influence of L1 on the L2 perception: The case of...
Date: April 13th, 2012Time: 1:30 PMLocation: Cooper Building, Suite 104, EMU, 2000 Huron River Drive, YpsilantiDirections: Take Washtenaw heading east from Ann Arbor toward Ypsilanti. Go past Hwy 23,...
View ArticleTalk: Piotr Banski "TEI XML for Linguists"
Please join us for a talk by:Dr. Piotr Banski (Institute for German Language/Institut fuer Deutsche Sprache, Mannheim, Germany)Title: "TEI XML for Linguists"Time: Friday, April 20, 2012 at 2:00...
View ArticleCourse at LSA Institute 2013: Python 3 for Linguists
Malgosia and I will be teaching a course at the LSA Institute 2013 at the University of Michigan in Ann Arbor: Python 3 for Linguists.Thanks to the Institute Steering Committee for accepting our proposal!
View ArticleTalk at the IDS 8th of May
Tomorrow, 8th of May 2012, I will be presenting at the Institute of German Language in Mannheim, and there is the last day of Maimarkt… I might meet U there???
View ArticleClozure CL on Mac App Store
Clozure CL, an open source and free implementation of Common Lisp for Mac is available on the App Store:http://itunes.apple.com/us/app/clozure-cl/id489900618?mt=12
View ArticleEndangered languages is up
The Endangered Languages site has been launched today:http://www.endangeredlanguages.com/
View ArticleJava programming sessions for the ILIT group
We are meeting Fridays at 9 AM in the Cooper building for Java programming.You might want to prepare your machine by installing:1. the Java SE 7u7...
View ArticleWSU talk: info on corpora and tech that will be discussed
I’ll give a talk on corpora and relevant technologies at Wayne State University in Detroit on the 19th of October at 11 AM. Here are some links, papers and slides that might be interesting for...
View ArticleXFST: Python 3 script to convert prolog file to DOT-graph
If you write out a stack (or network) in XFST to a prolog file:write prolog > mymorph.plgand you want to convert it to DOT and visualize it in Graphviz, here is a Python 3.x script to do so:Download...
View ArticleLibreOffice and TEI Stylesheets for file conversion
If you want to batch convert a lot of files to some more accessible format (for example ODT or DOCX to HTML or TEI XML), you can use first of all LibreOffice.Here is a brief introduction how to batch...
View ArticleSome old files about the Linguistics Program at the University of Zadar
Since I was asked many times about this MA program and the original text that went to the accreditation committee in Croatia (where we got one very nasty and absolutely irrelevant review, if I find it,...
View ArticleMoving projects and code to GitHub
I am moving code and project folders to GitHub. I don’t know, whether this is a good idea, it just turns out to be easier to use… :-)This port includes the SNLTK code, all kinds of Python 3 projects,...
View ArticleAARDVARC Workshop May 2013
AARDVARC - Automatically Annotated Repository of Digital Audio and Video Resources CommunityNSF sponsored workshops at ILIT/EMU and CUNY.
View ArticlePython 3 for Linguists at the LSA Summer Institute 2013 Course Material
The course material for the LSA Summer Institute 2013 course Python 3 for Linguists will be made available at:Python for Linguists Wiki (LTL, EMU)Python 3 for Linguists (Dropbox)There is a (currently...
View ArticleMidwest Speech and Language Days 2013
The Midwest Speech and Language Days 2013 at the Toyota Technological Institute at Chicago are happening on the 2nd and 3rd of May 2013.
View Article
More Pages to Explore .....