Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upGitHub is where the world builds software
Millions of developers and companies build, ship, and maintain their software on GitHub — the largest and most advanced development platform in the world.
Currently, each extractor calls sanitize_title in it's own and even makes up its own
stitle. We should do both steps inprocess_infofor all extractors.titleshould be the original title (and we canassert isinstance(title, unicode)).sanitized_titleshould be a title one can use in file names (i.e. without slash, and on Windows, without multiple RTL/LTR marks)We can then make up a simplified title
ascii_titlefor maximum compatibility.