Skip to content
qingfeng edited this page Sep 13, 2010 · 2 revisions
  1. 遍历目录下的文本(HTML,PHP,HTM,用GLOB模块,OS.path)
  2. 分析HTML (TO:TXT,TITLE,等)
  3. 写入数据库 KEY: 目录+文件明 —→ HASH 转为MD5码( import hashlib / hashlib.md5(“filename”).hexdigest() )
Clone this wiki locally