some operation for the QApairs on Stack Overflow
Purpose:
- reading the post.xml file in Stack Overflow.
- storing the content in Lucene index.
- extracting content and writing them in a special .xml file.
Start reading and storing by checking "/src/com/xml/sax/XmlSax_Test.java" extracting by checking "src/GetQApairsLinks/ExtracData.java"
目的: 1.读取论坛Stack Overflow中的 Post.xml 文件 2.把这些信息存储到Lucene 的索引文件中 3.抽取需要的内容到固定格式的xml 文件中。
开始 读取和存储的工作请参考 "/src/com/xml/sax/XmlSax_Test.java" 抽取的工作请参考 "src/GetQApairsLinks/ExtracData.java"