github
Advanced Search
  • Home
  • Pricing and Signup
  • Explore GitHub
  • Blog
  • Login

brendano / awkspeed

  • Admin
  • Watch Unwatch
  • Fork
  • Your Fork
  • Pull Request
  • Download Source
    • 4
    • 3
  • Source
  • Commits
  • Network (3)
  • Issues (0)
  • Downloads (0)
  • Wiki (1)
  • Graphs
  • Branch: master

click here to add a description

click here to add a homepage

  • Branches (1)
    • master ✓
  • Tags (0)
Sending Request…
Enable Donations

Pledgie Donations

Once activated, we'll place the following badge in your repository's detail box:
Pledgie_example
This service is courtesy of Pledgie.

Speed testing for a data munging task — Read more

  cancel

http://anyall.org/blog/?p=652

  cancel
  • Private
  • Read-Only
  • HTTP Read-Only

This URL has Read+Write access

fix flushing in Java version 
DRMacIver (author)
Thu Sep 10 06:05:04 -0700 2009
commit  474264d36c92f2ab6e7f9195a9afe8119ab2ccf9
tree    7537526b60e86396fdfa78fa66a3c98e2757efea
parent  4f672dfbff8e20a7de0587a6e15627e64c7c1560
awkspeed / 2num_3line.awk 2num_3line.awk
100644 8 lines (6 sloc) 0.359 kb
edit raw blame history
1
2
3
4
5
6
7
8
# USAGE: awk -f 2num.awk file1 file2 file2
# Take (item, feat, value) triplets and numberize them as sparse matrices.
# Each file should have item numbers count from 1, but feat numbers are global.
 
!imap[FILENAME,$1] {imap[FILENAME,$1] = ++I[FILENAME]}
!jmap[$2] {jmap[$2] = ++J; print $2 > "vocab"}
{ print imap[FILENAME,$1], jmap[$2], $3 > (FILENAME "n")}
 
Blog | Support | Training | Contact | API | Status | Twitter | Help | Security
© 2010 GitHub Inc. All rights reserved. | Terms of Service | Privacy Policy
Powered by the Dedicated Servers and
Cloud Computing of Rackspace Hosting®
Dedicated Server