Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Misc. improvements #17

Open
wants to merge 42 commits into
base: master
Choose a base branch
from
Open

Misc. improvements #17

wants to merge 42 commits into from

Commits on Oct 26, 2016

  1. added indri stopping (treats stopwords as OOV)

    Garrick Sherman committed Oct 26, 2016
    Configuration menu
    Copy the full SHA
    aba8971 View commit details
    Browse the repository at this point in the history

Commits on Nov 9, 2016

  1. added IndexBackedSearchHits to help alleviate unexpected missing info…

    … in SearchHits
    Garrick Sherman committed Nov 9, 2016
    Configuration menu
    Copy the full SHA
    296332b View commit details
    Browse the repository at this point in the history

Commits on Nov 10, 2016

  1. added utility class IndexBackedSearchHits

    Garrick Sherman committed Nov 10, 2016
    Configuration menu
    Copy the full SHA
    7675cd0 View commit details
    Browse the repository at this point in the history

Commits on Nov 30, 2016

  1. IndexWrappers return IndexBackedSearchHits by default

    Garrick Sherman committed Nov 30, 2016
    Configuration menu
    Copy the full SHA
    ec2f69b View commit details
    Browse the repository at this point in the history

Commits on Dec 13, 2016

  1. added toString, equals, and hashCode to SearchHit

    Garrick Sherman committed Dec 13, 2016
    Configuration menu
    Copy the full SHA
    cec1ba6 View commit details
    Browse the repository at this point in the history

Commits on Dec 15, 2016

  1. add equals() and hashCode() to GQuery

    Two queries are equal if they have the same title; maybe not a
    good choice?
    Garrick Sherman committed Dec 15, 2016
    Configuration menu
    Copy the full SHA
    cfa9e0e View commit details
    Browse the repository at this point in the history
  2. add some "implements Iterable" statements

    Garrick Sherman committed Dec 15, 2016
    Configuration menu
    Copy the full SHA
    873a9bb View commit details
    Browse the repository at this point in the history

Commits on Dec 20, 2016

  1. big help to GC but slows things down a lot

    Garrick Sherman committed Dec 20, 2016
    Configuration menu
    Copy the full SHA
    1304a1b View commit details
    Browse the repository at this point in the history

Commits on Jul 18, 2017

  1. set mu in Indri

    Garrick Sherman committed Jul 18, 2017
    Configuration menu
    Copy the full SHA
    6b08e94 View commit details
    Browse the repository at this point in the history
  2. thread safe FormattedOutputTrecEval

    Garrick Sherman committed Jul 18, 2017
    Configuration menu
    Copy the full SHA
    8422926 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    837cc69 View commit details
    Browse the repository at this point in the history
  4. added new scoring framework

    Note the package for the new framework is edu.gslis.scoring (as opposed
    to edu.gslis.docscoring for the existing scorers) to allow us to
    maintain the old scorers for backwards compatibility but keep the new
    scorers distinct.
    Garrick Sherman committed Jul 18, 2017
    Configuration menu
    Copy the full SHA
    c77d3a1 View commit details
    Browse the repository at this point in the history

Commits on Jul 25, 2017

  1. Configuration menu
    Copy the full SHA
    b915d33 View commit details
    Browse the repository at this point in the history

Commits on Jul 26, 2017

  1. Configuration menu
    Copy the full SHA
    59cd1de View commit details
    Browse the repository at this point in the history

Commits on Jul 27, 2017

  1. Create LICENSE

    craig-willis authored Jul 27, 2017
    Configuration menu
    Copy the full SHA
    21c14c9 View commit details
    Browse the repository at this point in the history

Commits on Jul 28, 2017

  1. Merge pull request uiucGSLIS#19 from bodom0015/NDS-975

    NDS-975: Updated POM for Maven Central sync requirements
    bodom0015 authored Jul 28, 2017
    Configuration menu
    Copy the full SHA
    2c66559 View commit details
    Browse the repository at this point in the history

Commits on Jul 29, 2017

  1. Configuration menu
    Copy the full SHA
    3e7dd44 View commit details
    Browse the repository at this point in the history
  2. Merge pull request uiucGSLIS#20 from bodom0015/master

    NDS-975: Added license/developer info to POM
    bodom0015 authored Jul 29, 2017
    Configuration menu
    Copy the full SHA
    f15cfb9 View commit details
    Browse the repository at this point in the history

Commits on Aug 1, 2017

  1. added BM25DocScorer

    Garrick Sherman committed Aug 1, 2017
    Configuration menu
    Copy the full SHA
    e9acabc View commit details
    Browse the repository at this point in the history
  2. added new scoring framework

    Note the package for the new framework is edu.gslis.scoring (as opposed
    to edu.gslis.docscoring for the existing scorers) to allow us to
    maintain the old scorers for backwards compatibility but keep the new
    scorers distinct.
    Garrick Sherman committed Aug 1, 2017
    Configuration menu
    Copy the full SHA
    c8006f8 View commit details
    Browse the repository at this point in the history
  3. added BM25DocScorer

    Garrick Sherman committed Aug 1, 2017
    Configuration menu
    Copy the full SHA
    51a7ce2 View commit details
    Browse the repository at this point in the history
  4. Merge branch 'scoring-lib' of https://github.com/gtsherman/ir-tools-1

    …into scoring-lib
    
    Conflicts:
    	pom.xml
    Garrick Sherman committed Aug 1, 2017
    Configuration menu
    Copy the full SHA
    9c78349 View commit details
    Browse the repository at this point in the history
  5. added guava to pom

    Garrick Sherman committed Aug 1, 2017
    Configuration menu
    Copy the full SHA
    2925e82 View commit details
    Browse the repository at this point in the history

Commits on Aug 2, 2017

  1. added DoNothingDocScorer

    Garrick Sherman committed Aug 2, 2017
    Configuration menu
    Copy the full SHA
    cfcf0d4 View commit details
    Browse the repository at this point in the history
  2. added indri stopping (treats stopwords as OOV)

    Garrick Sherman committed Aug 2, 2017
    Configuration menu
    Copy the full SHA
    f3cef07 View commit details
    Browse the repository at this point in the history
  3. added IndexBackedSearchHits to help alleviate unexpected missing info…

    … in SearchHits
    Garrick Sherman committed Aug 2, 2017
    Configuration menu
    Copy the full SHA
    8dd22e8 View commit details
    Browse the repository at this point in the history
  4. added utility class IndexBackedSearchHits

    Garrick Sherman committed Aug 2, 2017
    Configuration menu
    Copy the full SHA
    43a9e94 View commit details
    Browse the repository at this point in the history
  5. IndexWrappers return IndexBackedSearchHits by default

    Garrick Sherman committed Aug 2, 2017
    Configuration menu
    Copy the full SHA
    93af026 View commit details
    Browse the repository at this point in the history
  6. added toString, equals, and hashCode to SearchHit

    Garrick Sherman committed Aug 2, 2017
    Configuration menu
    Copy the full SHA
    3ab6b7f View commit details
    Browse the repository at this point in the history
  7. add equals() and hashCode() to GQuery

    Two queries are equal if they have the same title; maybe not a
    good choice?
    Garrick Sherman committed Aug 2, 2017
    Configuration menu
    Copy the full SHA
    7d5fbf1 View commit details
    Browse the repository at this point in the history
  8. add some "implements Iterable" statements

    Garrick Sherman committed Aug 2, 2017
    Configuration menu
    Copy the full SHA
    2005da5 View commit details
    Browse the repository at this point in the history
  9. big help to GC but slows things down a lot

    Garrick Sherman committed Aug 2, 2017
    Configuration menu
    Copy the full SHA
    006e886 View commit details
    Browse the repository at this point in the history
  10. set mu in Indri

    Garrick Sherman committed Aug 2, 2017
    Configuration menu
    Copy the full SHA
    b82439f View commit details
    Browse the repository at this point in the history
  11. thread safe FormattedOutputTrecEval

    Garrick Sherman committed Aug 2, 2017
    Configuration menu
    Copy the full SHA
    b36ae9e View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    6c5b122 View commit details
    Browse the repository at this point in the history

Commits on Oct 17, 2017

  1. apply stopper feature vector

    Garrick Sherman committed Oct 17, 2017
    Configuration menu
    Copy the full SHA
    579d6cd View commit details
    Browse the repository at this point in the history

Commits on Feb 7, 2018

  1. removed unwanted logging from GQueriesFactory

    Garrick Sherman committed Feb 7, 2018
    Configuration menu
    Copy the full SHA
    bc89660 View commit details
    Browse the repository at this point in the history
  2. changed FeatureVector storage

    - Revert to not saving FeatureVector by default when querying
    - Remove SoftReferences for storing FeatureVectors in
    IndexBackedSearchHits
    Garrick Sherman committed Feb 7, 2018
    Configuration menu
    Copy the full SHA
    bb844c8 View commit details
    Browse the repository at this point in the history
  3. relevance model improvements

    - StandardRM1Builder now:
    	- Does not enforce DocScorer type
            - Creates DocScorers lazily (when buildRelevanceModel is called)
            - Creates the QueryScorer only once
    - RM3Builder now:
            - Can be built with a pre-computed RM1 vector
    Garrick Sherman committed Feb 7, 2018
    Configuration menu
    Copy the full SHA
    8a3f638 View commit details
    Browse the repository at this point in the history
  4. Merge branch 'garrick' of https://github.com/gtsherman/ir-tools-1 int…

    …o garrick
    
    Conflicts:
    	src/main/java/edu/gslis/indexes/IndexWrapperLuceneImpl.java
    	src/main/java/edu/gslis/searchhits/IndexBackedSearchHit.java
    Garrick Sherman committed Feb 7, 2018
    Configuration menu
    Copy the full SHA
    8b02cb2 View commit details
    Browse the repository at this point in the history

Commits on Oct 18, 2018

  1. changes too old for me to remember

    Garrick Sherman committed Oct 18, 2018
    Configuration menu
    Copy the full SHA
    4f1f92c View commit details
    Browse the repository at this point in the history

Commits on Jan 14, 2019

  1. deal with docID == -1

    It turns out the IndexWrapperIndriImpl returns -1 if a docno isn't
    found. This was causing crashes when fetching the feature vector for
    unknown documents. These slight adjustments fix crashes.
    Garrick Sherman committed Jan 14, 2019
    Configuration menu
    Copy the full SHA
    a6e887a View commit details
    Browse the repository at this point in the history