Skip to content
Browse files

Added LGPL license

  • Loading branch information...
1 parent 8159da6 commit d9cebdb02a25a37838dca62a157a15415e01fe9b @gangeli committed Aug 4, 2012
View
165 COPYING.LESSER
@@ -0,0 +1,165 @@
+ GNU LESSER GENERAL PUBLIC LICENSE
+ Version 3, 29 June 2007
+
+ Copyright (C) 2007 Free Software Foundation, Inc. <http://fsf.org/>
+ Everyone is permitted to copy and distribute verbatim copies
+ of this license document, but changing it is not allowed.
+
+
+ This version of the GNU Lesser General Public License incorporates
+the terms and conditions of version 3 of the GNU General Public
+License, supplemented by the additional permissions listed below.
+
+ 0. Additional Definitions.
+
+ As used herein, "this License" refers to version 3 of the GNU Lesser
+General Public License, and the "GNU GPL" refers to version 3 of the GNU
+General Public License.
+
+ "The Library" refers to a covered work governed by this License,
+other than an Application or a Combined Work as defined below.
+
+ An "Application" is any work that makes use of an interface provided
+by the Library, but which is not otherwise based on the Library.
+Defining a subclass of a class defined by the Library is deemed a mode
+of using an interface provided by the Library.
+
+ A "Combined Work" is a work produced by combining or linking an
+Application with the Library. The particular version of the Library
+with which the Combined Work was made is also called the "Linked
+Version".
+
+ The "Minimal Corresponding Source" for a Combined Work means the
+Corresponding Source for the Combined Work, excluding any source code
+for portions of the Combined Work that, considered in isolation, are
+based on the Application, and not on the Linked Version.
+
+ The "Corresponding Application Code" for a Combined Work means the
+object code and/or source code for the Application, including any data
+and utility programs needed for reproducing the Combined Work from the
+Application, but excluding the System Libraries of the Combined Work.
+
+ 1. Exception to Section 3 of the GNU GPL.
+
+ You may convey a covered work under sections 3 and 4 of this License
+without being bound by section 3 of the GNU GPL.
+
+ 2. Conveying Modified Versions.
+
+ If you modify a copy of the Library, and, in your modifications, a
+facility refers to a function or data to be supplied by an Application
+that uses the facility (other than as an argument passed when the
+facility is invoked), then you may convey a copy of the modified
+version:
+
+ a) under this License, provided that you make a good faith effort to
+ ensure that, in the event an Application does not supply the
+ function or data, the facility still operates, and performs
+ whatever part of its purpose remains meaningful, or
+
+ b) under the GNU GPL, with none of the additional permissions of
+ this License applicable to that copy.
+
+ 3. Object Code Incorporating Material from Library Header Files.
+
+ The object code form of an Application may incorporate material from
+a header file that is part of the Library. You may convey such object
+code under terms of your choice, provided that, if the incorporated
+material is not limited to numerical parameters, data structure
+layouts and accessors, or small macros, inline functions and templates
+(ten or fewer lines in length), you do both of the following:
+
+ a) Give prominent notice with each copy of the object code that the
+ Library is used in it and that the Library and its use are
+ covered by this License.
+
+ b) Accompany the object code with a copy of the GNU GPL and this license
+ document.
+
+ 4. Combined Works.
+
+ You may convey a Combined Work under terms of your choice that,
+taken together, effectively do not restrict modification of the
+portions of the Library contained in the Combined Work and reverse
+engineering for debugging such modifications, if you also do each of
+the following:
+
+ a) Give prominent notice with each copy of the Combined Work that
+ the Library is used in it and that the Library and its use are
+ covered by this License.
+
+ b) Accompany the Combined Work with a copy of the GNU GPL and this license
+ document.
+
+ c) For a Combined Work that displays copyright notices during
+ execution, include the copyright notice for the Library among
+ these notices, as well as a reference directing the user to the
+ copies of the GNU GPL and this license document.
+
+ d) Do one of the following:
+
+ 0) Convey the Minimal Corresponding Source under the terms of this
+ License, and the Corresponding Application Code in a form
+ suitable for, and under terms that permit, the user to
+ recombine or relink the Application with a modified version of
+ the Linked Version to produce a modified Combined Work, in the
+ manner specified by section 6 of the GNU GPL for conveying
+ Corresponding Source.
+
+ 1) Use a suitable shared library mechanism for linking with the
+ Library. A suitable mechanism is one that (a) uses at run time
+ a copy of the Library already present on the user's computer
+ system, and (b) will operate properly with a modified version
+ of the Library that is interface-compatible with the Linked
+ Version.
+
+ e) Provide Installation Information, but only if you would otherwise
+ be required to provide such information under section 6 of the
+ GNU GPL, and only to the extent that such information is
+ necessary to install and execute a modified version of the
+ Combined Work produced by recombining or relinking the
+ Application with a modified version of the Linked Version. (If
+ you use option 4d0, the Installation Information must accompany
+ the Minimal Corresponding Source and Corresponding Application
+ Code. If you use option 4d1, you must provide the Installation
+ Information in the manner specified by section 6 of the GNU GPL
+ for conveying Corresponding Source.)
+
+ 5. Combined Libraries.
+
+ You may place library facilities that are a work based on the
+Library side by side in a single library together with other library
+facilities that are not Applications and are not covered by this
+License, and convey such a combined library under terms of your
+choice, if you do both of the following:
+
+ a) Accompany the combined library with a copy of the same work based
+ on the Library, uncombined with any other library facilities,
+ conveyed under the terms of this License.
+
+ b) Give prominent notice with the combined library that part of it
+ is a work based on the Library, and explaining where to find the
+ accompanying uncombined form of the same work.
+
+ 6. Revised Versions of the GNU Lesser General Public License.
+
+ The Free Software Foundation may publish revised and/or new versions
+of the GNU Lesser General Public License from time to time. Such new
+versions will be similar in spirit to the present version, but may
+differ in detail to address new problems or concerns.
+
+ Each version is given a distinguishing version number. If the
+Library as you received it specifies that a certain numbered version
+of the GNU Lesser General Public License "or any later version"
+applies to it, you have the option of following the terms and
+conditions either of that published version or of any later version
+published by the Free Software Foundation. If the Library as you
+received it does not specify a version number of the GNU Lesser
+General Public License, you may choose any version of the GNU Lesser
+General Public License ever published by the Free Software Foundation.
+
+ If the Library as you received it specifies that a proxy can decide
+whether future versions of the GNU Lesser General Public License shall
+apply, that proxy's public statement of acceptance of any version is
+permanent authorization for you to choose that version for the
+Library.
View
BIN aux/coremap/tempeval2-english
Binary file not shown.
View
BIN aux/coremap/tempeval2-english-retok
Binary file not shown.
View
BIN aux/coremap/tempeval2-english-retok-numbers
Binary file not shown.
View
BIN dist/time.jar
Binary file not shown.
View
2 src/time/Entry.scala
@@ -760,7 +760,7 @@ object Entry {
// val pipeline = new StanfordCoreNLP(props)
val pipeline = new AnnotationPipeline()
- pipeline.addAnnotator(new WhitespaceTokenizerAnnotator)
+ pipeline.addAnnotator(new WhitespaceTokenizerAnnotator(new Properties))
pipeline.addAnnotator(new WordsToSentencesAnnotator(false))
pipeline.addAnnotator(new POSTaggerAnnotator(
System.getenv("HOME") +
View
121 src/time/Tempeval2.scala
@@ -198,14 +198,14 @@ object TempEval2 {
var textGloss = new HashMap[Int,String]
var timeExpressions = List[CoreMap]()
def mkGloss:String = {
- val tokenRange = if(lang.toLowerCase == "spanish") (1 to textGloss.keys.size)
+ val tokenRange = if(lang.toLowerCase == "espanol") (1 to textGloss.keys.size)
else (0 until textGloss.keys.size);
tokenRange.map{ textGloss(_) }
.toArray
.mkString(" ")
}
def mkTokens:JList[CoreLabel] = {
- val tokenRange = if(lang.toLowerCase == "spanish") (1 to tokens.keys.size)
+ val tokenRange = if(lang.toLowerCase == "espanol") (1 to tokens.keys.size)
else (0 until tokens.keys.size);
seqAsJavaList(tokenRange.map{ tokens(_) })
}
@@ -272,7 +272,7 @@ object TempEval2 {
offset = 0
}
//(cycle words)
- val tokenRange = if(lang.toLowerCase == "spanish") (1 to sentInfo.tokens.keys.size)
+ val tokenRange = if(lang.toLowerCase == "espanol") (1 to sentInfo.tokens.keys.size)
else (0 until sentInfo.tokens.keys.size);
tokenRange.foreach{ (wordI:Int) =>
if (!sentInfo.tokens.contains(wordI)) {
@@ -332,7 +332,7 @@ object TempEval2 {
sentMap.set(classOf[TokensAnnotation], sentInfo.mkTokens)
sentMap.set(classOf[TimeExpressionsAnnotation], sentInfo.mkTimes)
sentMap.set(classOf[CharacterOffsetBeginAnnotation],
- sentInfo.tokens(if(lang == "spanish") 1 else 0).get[JInt,CharacterOffsetBeginAnnotation](
+ sentInfo.tokens(if(lang == "espanol") 1 else 0).get[JInt,CharacterOffsetBeginAnnotation](
classOf[CharacterOffsetBeginAnnotation]))
sentMap.set(classOf[CharacterOffsetEndAnnotation],
sentInfo.tokens(sentInfo.tokens.keySet.size-1).get[JInt,CharacterOffsetEndAnnotation](
@@ -364,25 +364,25 @@ object TempEval2 {
}
endTrack("Pass 4 (global timexes)")
//--Pipeline Annotate
- forceTrack("Pass 5 (pipeline)")
- //(annotators)
- val lemma = new MorphaAnnotator(false)
- val pos = new POSTaggerAnnotator(
- System.getenv("HOME") +
- "/lib/data/bidirectional-distsim-wsj-0-18.tagger", //TODO hard coded
- false)
- //(run pipeline)
- threadAndRun(
- docs.zipWithIndex.map{ case (doc:Annotation,i:Int) => new Runnable{
- override def run {
- pos.annotate(doc)
- lemma.annotate(doc)
- log("("+(i+1)+"/"+docs.length+ ") Annotated "
- + doc.get[String,DocIDAnnotation](classOf[DocIDAnnotation]))
- }
- } }
- )
- endTrack("Pass 5 (pipeline)")
+// forceTrack("Pass 5 (pipeline)")
+// //(annotators)
+// val lemma = new MorphaAnnotator(false)
+// val pos = new POSTaggerAnnotator(
+// System.getenv("HOME") +
+// "/lib/data/bidirectional-distsim-wsj-0-18.tagger", //TODO hard coded
+// false)
+// //(run pipeline)
+// threadAndRun(
+// docs.zipWithIndex.map{ case (doc:Annotation,i:Int) => new Runnable{
+// override def run {
+// pos.annotate(doc)
+// lemma.annotate(doc)
+// log("("+(i+1)+"/"+docs.length+ ") Annotated "
+// + doc.get[String,DocIDAnnotation](classOf[DocIDAnnotation]))
+// }
+// } }
+// )
+// endTrack("Pass 5 (pipeline)")
//--Return
endTrack("Making CoreMaps")
docs.reverse.toArray
@@ -506,46 +506,51 @@ object TempEval2 {
}
//<<retok>>
- def retokFromInit(init:SerializedCoreMapDataset
+ def retokFromInit(init:SerializedCoreMapDataset, lang:String
):SerializedCoreMapDataset = {
- init.saveAs("aux/coremap/tempeval2-english-retok")
+ init.saveAs("aux/coremap/tempeval2-"+lang+"-retok")
(new TempEval2RetokTask).perform(init)
init.save
init
}
- def retokFromInit(source:String):SerializedCoreMapDataset = {
- retokFromInit( new SerializedCoreMapDataset(source) )
+ def retokFromInit(source:String, lang:String):SerializedCoreMapDataset = {
+ retokFromInit( new SerializedCoreMapDataset(source), lang )
}
def retok(dir:String,lang:String):SerializedCoreMapDataset = {
- retokFromInit( apply(dir,lang) )
+ retokFromInit( apply(dir,lang), lang )
}
//<<normalize>>
- def normalizeFromRetok(retok:SerializedCoreMapDataset
+ def normalizeFromRetok(retok:SerializedCoreMapDataset, lang:String
):SerializedCoreMapDataset = {
- retok.saveAs("aux/coremap/tempeval2-english-retok-numbers")
+ retok.saveAs("aux/coremap/tempeval2-"+lang+"-retok-numbers")
(new TempEval2NumberNormalizeTask).perform(retok)
retok.save
retok
}
- def normalizeFromRetok(source:String):SerializedCoreMapDataset = {
- normalizeFromRetok( new SerializedCoreMapDataset(source) )
+ def normalizeFromRetok(source:String, lang:String):SerializedCoreMapDataset = {
+ normalizeFromRetok( new SerializedCoreMapDataset(source), lang )
}
- def normalizeFromInit(source:String):SerializedCoreMapDataset = {
- normalizeFromRetok( retokFromInit(source) )
+ def normalizeFromInit(source:String, lang:String):SerializedCoreMapDataset = {
+ normalizeFromRetok( retokFromInit(source, lang), lang )
}
- def normalizeFromInit(init:SerializedCoreMapDataset
+ def normalizeFromInit(init:SerializedCoreMapDataset, lang:String
):SerializedCoreMapDataset = {
- normalizeFromRetok( retokFromInit(init) )
+ normalizeFromRetok( retokFromInit(init, lang), lang )
}
def normalize(dir:String,lang:String):SerializedCoreMapDataset = {
- val x = normalizeFromInit( apply(dir,lang) )
+ val x = normalizeFromInit( apply(dir, lang), lang )
x
}
def toReadableFile(input:String, output:String):Unit = {
val data = new SerializedCoreMapDataset(input)
val contents = new StringBuilder();
+ def write(typ:String, value:String, date:String, text:String) = {
+ contents.append(typ).append("\t").append(value).append("\t")
+ .append(date).append("\t")
+ .append(text).append("\n")
+ }
//--Info
var currentId = ""
var words = List[String]()
@@ -554,7 +559,7 @@ object TempEval2 {
//--Iterate Over Data
data.foreach{ (datum:CoreMapDatum) =>
val cal = datum.get[java.util.Calendar,CalendarAnnotation](classOf[CalendarAnnotation])
- val format = new java.text.SimpleDateFormat("yyyy-MM-dd.hh:mm:ss")
+ val format = new java.text.SimpleDateFormat("yyyy-MM-dd hh:mm:ss")
val docdate = format.format(cal.getTime)
val sentences = datum.get[JList[CoreMap],SentencesAnnotation](classOf[SentencesAnnotation])
sentences.foreach{ (sent:CoreMap) =>
@@ -567,22 +572,30 @@ object TempEval2 {
val value = token.get[String,OriginalTimeValueAnnotation](classOf[OriginalTimeValueAnnotation])
val typ = token.get[String,OriginalTimeTypeAnnotation](classOf[OriginalTimeTypeAnnotation])
val id = token.get[String,TimeIdentifierAnnotation](classOf[TimeIdentifierAnnotation])
- //(check for change)
- if (currentId != null && id != null && id != currentId) {
- //(print old info)
- if (currentId != null) {
- contents.append(currentType).append("\t").append(currentValue).append("\t")
- .append(docdate).append("\t")
- .append(words.reverse.mkString(" ")).append("\n")
- }
- //(set new fields)
- currentId = id
- words = List[String](word)
+ if (id != null && currentId != null && id == currentId) {
+ //(case: still in a time)
+ words = word :: words
+ }
+ if (id == null && currentId != "") {
+ //(case: left a time)
+ write(currentType, currentValue, docdate, words.reverse.mkString(" "))
+ currentId = ""
+ }
+ if (currentId == "" && id != null) {
+ //(case: entered a time)
+ words = List[String](word)
+ currentId = id;
+ currentValue = value
+ currentType = typ
+ }
+ if (id != null && currentId != "" && id != currentId) {
+ //(case: switching times)
+ write(currentType, currentValue, docdate, words.reverse.mkString(" "))
+ words = List[String](word)
+ currentId = id;
currentValue = value
currentType = typ
- } else if (id != null) {
- words = word :: words;
- }
+ }
}
}
}
@@ -613,9 +626,9 @@ object TempEval2 {
// Dump a language's data into readable form
val language = args(0)
println("Processing " + language)
- retok("aux/tempeval2-cleaned", language)
+ apply("aux/tempeval2-cleaned", language)
toReadableFile(
- "aux/coremap/tempeval2-"+language+"-retok",
+ "aux/coremap/tempeval2-"+language,
"tmp/tempeval-"+language+".dat"
)
}

0 comments on commit d9cebdb

Please sign in to comment.
Something went wrong with that request. Please try again.