Skip to content
Permalink
Branch: master
Commits on May 5, 2019
Commits on May 2, 2019
Commits on Apr 29, 2019
Commits on Apr 20, 2019
  1. Add relevance transfer package (castorini#14)

    achyudh committed Apr 20, 2019
    * Add TREC relevance datasets
    
    * Add relevance transfer trainer and evaluator
    
    * Add re-ranking module
    
    * Add ImbalancedDatasetSampler
    
    * Add relevance transfer package
    
    * Fix import in classification trainer
Commits on Apr 19, 2019
  1. Add ImbalancedDatasetSampler

    achyudh committed Apr 19, 2019
  2. Add re-ranking module

    achyudh committed Apr 19, 2019
  3. Add TREC relevance datasets

    achyudh committed Apr 19, 2019
Commits on Apr 14, 2019
  1. Integrate BERT into Hedwig (#29) (castorini#11)

    achyudh authored and Ashutosh-Adhikari committed Apr 14, 2019
    * Fix package imports
    
    * Update README.md
    
    * Fix bug due to TAR/AR attribute check
    
    * Add BERT models
    
    * Add BERT tokenizer
    
    * Return logits from the model.py
    
    * Remove unused classes in models/bert
    
    * Return logits from the model.py (castorini#12)
    
    * Remove unused classes in models/bert (castorini#13)
    
    * Add initial main file
    
    * Add args for BERT
    
    * Add partial support for BERT
    
    * Initialize training and optimization
    
    * Draft the structure of Trainers for BERT
    
    * Remove duplicate tokenizer
    
    * Add utils
    
    * Move optimization to utils
    
    * Add more structure for trainer
    
    * Refactor the trainer (castorini#15)
    
    * Refactor the trainer
    
    * Add more edits
    
    * Add support for our datasets
    
    * Add evaluator
    
    * Split data4bert module into multiple processors
    
    * Refactor BERT tokenizer
    
    * Integrate BERT into Castor framework (castorini#17)
    
    * Remove unused classes in models/bert
    
    * Split data4bert module into multiple processors
    
    * Refactor BERT tokenizer
    
    * Add multilabel support in BertTrainer
    
    * Add multilabel support in BertEvaluator
    
    * Add get_test_samples method in dataset processors
    
    * Fix args.py for BERT
    
    * Add support for Reuters, IMDB datasets for BERT
    
    * Revert "Integrate BERT into Castor framework (castorini#17)"
    
    This reverts commit e4244ec.
    
    * Fix paths to datasets in dataset classes and args
    
    * Add SST dataset
    
    * Add hedwig-data instructions to README.md
    
    * Fix KimCNN README
    
    * Fix RegLSTM README
    
    * Fix typos in README
    
    * Remove trec_eval from README
    
    * Add tensorboardX to requirements.txt
    
    * Rename processors module to bert_processors
    
    * Add method to print metrics after training
    
    * Add model check-pointing and early stopping for BERT
    
    * Add logos
    
    * Update README.md
    
    * Fix code comments in classification trainer
    
    * Add support for AAPD, Sogou, AGNews and Yelp2014
    
    * Fix bug that deleted saved models
    
    * Update README for HAN
    
    * Update README for XML-CNN
    
    * Remove redundant TODOs from the READMEs
    
    * Fix logo in README.md
    
    * Update README for Char-CNN
    
    * Fix all the READMEs
    
    * Resolve conflict
    
    * Fix Typos
    
    * Re-Add SST2 Processor
    
    * Add support for evaluating trained model
    
    * Update args.py
    
    * Resolve issues due to DataParallel wrapper on saved model
    
    * Remove redundant Yelp processor
    
    * Fix bug for safely creating the saving directory
    
    * Change checkpoint paths to timestamps
    
    * Remove unwanted string.strip() from tokenizer
    
    * Create save path if it doesn't exist
    
    * Decouple model checkpoints from code
    
    * Remove model choice restrictions for BERT
    
    * Remove model/distill driver
    
    * Simplify checkpoint directory creation
  2. Integrate BERT into Hedwig (#29)

    achyudh authored and Ashutosh-Adhikari committed Apr 14, 2019
    * Fix package imports
    
    * Update README.md
    
    * Fix bug due to TAR/AR attribute check
    
    * Add BERT models
    
    * Add BERT tokenizer
    
    * Return logits from the model.py
    
    * Remove unused classes in models/bert
    
    * Return logits from the model.py (castorini#12)
    
    * Remove unused classes in models/bert (castorini#13)
    
    * Add initial main file
    
    * Add args for BERT
    
    * Add partial support for BERT
    
    * Initialize training and optimization
    
    * Draft the structure of Trainers for BERT
    
    * Remove duplicate tokenizer
    
    * Add utils
    
    * Move optimization to utils
    
    * Add more structure for trainer
    
    * Refactor the trainer (castorini#15)
    
    * Refactor the trainer
    
    * Add more edits
    
    * Add support for our datasets
    
    * Add evaluator
    
    * Split data4bert module into multiple processors
    
    * Refactor BERT tokenizer
    
    * Integrate BERT into Castor framework (castorini#17)
    
    * Remove unused classes in models/bert
    
    * Split data4bert module into multiple processors
    
    * Refactor BERT tokenizer
    
    * Add multilabel support in BertTrainer
    
    * Add multilabel support in BertEvaluator
    
    * Add get_test_samples method in dataset processors
    
    * Fix args.py for BERT
    
    * Add support for Reuters, IMDB datasets for BERT
    
    * Revert "Integrate BERT into Castor framework (castorini#17)"
    
    This reverts commit e4244ec.
    
    * Fix paths to datasets in dataset classes and args
    
    * Add SST dataset
    
    * Add hedwig-data instructions to README.md
    
    * Fix KimCNN README
    
    * Fix RegLSTM README
    
    * Fix typos in README
    
    * Remove trec_eval from README
    
    * Add tensorboardX to requirements.txt
    
    * Rename processors module to bert_processors
    
    * Add method to print metrics after training
    
    * Add model check-pointing and early stopping for BERT
    
    * Add logos
    
    * Update README.md
    
    * Fix code comments in classification trainer
    
    * Add support for AAPD, Sogou, AGNews and Yelp2014
    
    * Fix bug that deleted saved models
    
    * Update README for HAN
    
    * Update README for XML-CNN
    
    * Remove redundant TODOs from the READMEs
    
    * Fix logo in README.md
    
    * Update README for Char-CNN
    
    * Fix all the READMEs
    
    * Resolve conflict
    
    * Fix Typos
    
    * Re-Add SST2 Processor
    
    * Add support for evaluating trained model
    
    * Update args.py
    
    * Resolve issues due to DataParallel wrapper on saved model
    
    * Remove redundant Yelp processor
    
    * Fix bug for safely creating the saving directory
    
    * Change checkpoint paths to timestamps
    
    * Remove unwanted string.strip() from tokenizer
    
    * Create save path if it doesn't exist
    
    * Decouple model checkpoints from code
    
    * Remove model choice restrictions for BERT
    
    * Remove model/distill driver
    
    * Simplify checkpoint directory creation
Commits on Apr 8, 2019
  1. Update README.md

    lintool committed Apr 8, 2019
Commits on Mar 28, 2019
  1. Merge pull request #5 from achyudh/master

    achyudh committed Mar 28, 2019
     Fix bug due to TAR/AR attribute check
Commits on Mar 24, 2019
Commits on Mar 21, 2019
  1. Merge pull request #2 from castorini/daemon/patch-requirements

    Ashutosh-Adhikari committed Mar 21, 2019
    Remove extraneous package requirements
  2. Remove tqdm from requirements

    daemon committed Mar 21, 2019
  3. Update README.md

    achyudh committed Mar 21, 2019
  4. Fix package imports

    achyudh committed Mar 21, 2019
  5. Update setup.py

    achyudh committed Mar 21, 2019
  6. Merge pull request castorini#10 from achyudh/master

    achyudh committed Mar 21, 2019
    Refactor XML-CNN and HAN
Commits on Mar 20, 2019
Older
You can’t perform that action at this time.