On Fri, Jul 1, 2011 at 11:58 PM, Rogan Creswick <creswick@gmail.com> wrote:
On Fri, Jul 1, 2011 at 12:38 PM, Dmitri O.Kondratiev <dokondr@gmail.com> wrote:
> On Fri, Jul 1, 2011 at 9:34 PM, Rogan Creswick <creswick@gmail.com> wrote:
>>
>> On Fri, Jul 1, 2011 at 3:31 AM, Dmitri O.Kondratiev <dokondr@gmail.com>
>> wrote:> First of all I need:
>
> Unfortunately 'cabal install' fails with toktok:
>
> tools/ExtractLexicon.hs:5:35:
>     Module `PGF' does not export `getLexicon'
> cabal: Error: some packages failed to install:
> toktok-0.5 failed during the building phase. The exception was:
> ExitFailure 1

Oh, right - I ran into this problem too, and forgot about it (I should
have reported a bug...) I think this fails because of (relatively)
recent changes in GF, which isn't constrained to specific versions in
the toktok cabal file...

--Rogan

Any other then 'toktok' Haskell word tokenizer that compiles and works?
I need something like:
http://nltk.googlecode.com/svn/trunk/doc/api/nltk.tokenize.regexp.WordPunctTokenizer-class.html

Thanks!