LKB system back online
Posted: Wed Oct 14, 2009 10:25 pm
This morning I gave a talk to my department on my work on this website and future plans for it, including my forthcoming master's thesis work, which will probably be on extending the computational grammar of the Thai language that I started last spring. As part of this talk, I resurrected the experimental Thai syntax parsing system I was working on back then.
The set of sentences that are included in the system's vocabulary are listed here, along with links to a version of the dictionary entry that includes live syntactic parsing of the sentence. A green "bead" on that page indicates either:
Yellow highlighting indicates grammatical sentences that the system was not able to parse (false negative).
Red highlighting indicates ungrammatical sentences that the system found one or more parses for (false positive).
If the backend connection to my Linux system which runs the LKB parsing engine is working, then clicking on the id number in the left column brings you to a special dictionary page which includes a live re-parse of the sentence. If parses are found, then this section includes three parts:
Much of this information is technical and requires familiarity with the HPSG grammar formalism, but over time I expect to make the information more accessible. At this point, I'm just documenting--for the curious--an obscure backwater of this website which one day may become more prominent.
The set of sentences that are included in the system's vocabulary are listed here, along with links to a version of the dictionary entry that includes live syntactic parsing of the sentence. A green "bead" on that page indicates either:
- a sentence that is expected parse (i.e. is syntactically correct), and for which the system found one or more parses (green highlight), or
- a sentence that is not expected to parse (i.e. that is syntactically incorrect), and which the system correctly failed to parse (white highlight).
Yellow highlighting indicates grammatical sentences that the system was not able to parse (false negative).
Red highlighting indicates ungrammatical sentences that the system found one or more parses for (false positive).
If the backend connection to my Linux system which runs the LKB parsing engine is working, then clicking on the id number in the left column brings you to a special dictionary page which includes a live re-parse of the sentence. If parses are found, then this section includes three parts:
- one or more parse trees showing how the parts of the sentence fit together into a syntax tree
- the meaning of the sentence, expressed in Minimal Recursion Semantics (MRS) (Copestake 1999)
- full Attribute-Value Matricies (AVMs) for each parse.
Much of this information is technical and requires familiarity with the HPSG grammar formalism, but over time I expect to make the information more accessible. At this point, I'm just documenting--for the curious--an obscure backwater of this website which one day may become more prominent.