Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regression: Frog no longer outputs both FoLiA XML and columned output when requested #103

Closed
proycon opened this issue Nov 24, 2023 · 2 comments
Assignees
Labels

Comments

@proycon
Copy link
Member

proycon commented Nov 24, 2023

Invoking frog as follows:

frog --language=nld -t test.txt -n --skip=npc--id=test -X test.xml -o test.out

This used to produce both a test.out file and a text.xml file. Now it only produces columned output and disregards the FoLiA.

The frog webservice relies on this behaviour to produce both files in a single pass and now breaks on this.

@kosloot
Copy link
Collaborator

kosloot commented Nov 25, 2023

Well, I cannot reproduce this:

$ cat  > test.txt
Dit is een test.
^D
$ frog --language=nld -t test.txt -n --skip=npc --id=test -X test.xml -o test.out
frog 0.32 (c) CLST, ILK 1998 - 2023
CLST  - Centre for Language and Speech Technology,Radboud University
ILK   - Induction of Linguistic Knowledge Research Group,Tilburg University
based on [ucto 0.31, libfolia 2.18, timbl 6.10, ticcutils 0.35, mbt 3.11]
...
...
frog-:Frogging in total took: 0 seconds, 6 milliseconds and 822 microseconds
frog-:FoLiA stored in test.xml
frog-:results stored in test.out
frog-:Sat Nov 25 09:02:45 2023 Frog finished

$ more test.out
1       Dit     dit     [dit]   VNW(aanw,pron,stan,vol,3o,ev)   0.777085                                                                    
2       is      zijn    [zijn]  WW(pv,tgw,ev)   0.999891                                                                    
3       een     een     [een]   LID(onbep,stan,agr)     0.999113                                                                    
4       test    test    [test]  N(soort,ev,basis,zijd,stan)     0.903055                                                                    
5       .       .       [.]     LET()   1.000000                                                              
$ more test.xml
<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="folia.xsl"?>
<FoLiA xmlns:xlink="http://www.w3.org/1999/xlink" xmlns="http://ilk.uvt.nl/folia" xml:id="test" generator="libfolia-v2.18" version="2.5.4">
  <metadata type="native">
    <annotations>

So both output formats are produced!
What version of Frog and other tools do you use?

@kosloot
Copy link
Collaborator

kosloot commented Nov 27, 2023

Ok there are some problems in the MBMA data-files .
I implemented a quick fix, but it should be better to fix the data. (which is a tedious job)

NOTE: in fact this is not a bug, Frog is unable to handle (part of) an input and continues with the next file.
But this is a surprise for the user. NO FoLia can be created, but some partial Tabbed output is possible (up to the problem)

@proycon proycon closed this as completed Dec 5, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants