Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upGitHub is where the world builds software
Millions of developers and companies build, ship, and maintain their software on GitHub — the largest and most advanced development platform in the world.
The Pornhub extractor first tries to extract the title from the flashvars. Only if that doesn't work it tries to extract the title from the h1-tag.
Problem: The value in flashvars might be spaces only when the title consists of UTF-8 chars.
Example url: http://www.pornhub.com/view_video.php?viewkey=1331683002
Solution:
Maybe use meta tag for title extraction and do not depend on flashvars/h1-tag: