Skip to content

Commit

Permalink
first
Browse files Browse the repository at this point in the history
  • Loading branch information
brainsqueezer committed Jun 14, 2011
0 parents commit f6bcc1c
Show file tree
Hide file tree
Showing 531 changed files with 77,500 additions and 0 deletions.
1 change: 1 addition & 0 deletions .cvsignore
@@ -0,0 +1 @@
bin
11 changes: 11 additions & 0 deletions .project
@@ -0,0 +1,11 @@
<?xml version="1.0" encoding="UTF-8"?>
<projectDescription>
<name>htmlparser</name>
<comment></comment>
<projects>
</projects>
<buildSpec>
</buildSpec>
<natures>
</natures>
</projectDescription>
11 changes: 11 additions & 0 deletions CVS/Entries
@@ -0,0 +1,11 @@
/.cvsignore/1.1/Wed Jun 2 22:47:07 2004//
D/.ssh////
D/bin////
/build.xml/1.85/Sun Jun 4 19:17:21 2006/-kb/
D/docs////
D/lib////
D/release////
D/resources////
D/results////
D/specs////
D/src////
1 change: 1 addition & 0 deletions CVS/Repository
@@ -0,0 +1 @@
htmlparser
1 change: 1 addition & 0 deletions CVS/Root
@@ -0,0 +1 @@
:pserver:anonymous@htmlparser.cvs.sourceforge.net:/cvsroot/htmlparser
Empty file added README.txt
Empty file.
552 changes: 552 additions & 0 deletions build.xml

Large diffs are not rendered by default.

20 changes: 20 additions & 0 deletions docs/CVS/Entries
@@ -0,0 +1,20 @@
D/articles////
/bug.html/1.3/Sun Mar 19 22:03:56 2006/-ko/
/changes.txt/1.211/Sat Jun 10 15:11:31 2006//
/contributors.html/1.22/Sun Mar 19 15:01:22 2006//
D/design////
D/docs////
/faq.html/1.1/Sat Jun 10 14:39:57 2006//
/htmlparser.jpg/1.1/Mon May 31 22:27:09 2004/-kb/
/htmlparserlogo.jpg/1.2/Mon May 31 22:27:09 2004/-kb/
/index.html/1.4/Thu Jun 3 01:20:16 2004/-kb/
/joinus.html/1.1/Sun Jan 4 03:23:08 2004//
/mailinglists.html/1.2/Sun Jan 4 03:23:08 2004/-kb/
/main.html/1.9/Sun Apr 24 17:48:06 2005/-kb/
/panel.html/1.10/Sun Mar 19 22:13:52 2006/-kb/
D/pics////
/release.txt/1.75/Sat Jun 10 15:11:32 2006//
D/samples////
/samples.html/1.3/Wed Apr 6 10:28:02 2005//
/support.html/1.2/Sun Jan 4 03:23:08 2004/-ko/
D/wiki////
1 change: 1 addition & 0 deletions docs/CVS/Repository
@@ -0,0 +1 @@
htmlparser/docs
1 change: 1 addition & 0 deletions docs/CVS/Root
@@ -0,0 +1 @@
:pserver:anonymous@htmlparser.cvs.sourceforge.net:/cvsroot/htmlparser
2 changes: 2 additions & 0 deletions docs/articles/CVS/Entries
@@ -0,0 +1,2 @@
/index.html/1.2/Sun Jan 4 03:23:08 2004/-ko/
/quest.html/1.2/Sun Jan 4 03:23:08 2004/-ko/
1 change: 1 addition & 0 deletions docs/articles/CVS/Repository
@@ -0,0 +1 @@
htmlparser/docs/articles
1 change: 1 addition & 0 deletions docs/articles/CVS/Root
@@ -0,0 +1 @@
:pserver:anonymous@htmlparser.cvs.sourceforge.net:/cvsroot/htmlparser
13 changes: 13 additions & 0 deletions docs/articles/index.html
@@ -0,0 +1,13 @@
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<title>HTMLParser Articles</title>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<link REL ="stylesheet" TYPE="text/css" HREF="../javadoc/stylesheet.css" TITLE="Style">
</head>

<body>
<h2>Articles </h2>
<p><a href="quest.html">The Quest for HTMLParser</a> - <a href="../contributors.html#dhaval"><em>Dhaval Udani</em></a></p>
</body>
</html>
58 changes: 58 additions & 0 deletions docs/articles/quest.html
@@ -0,0 +1,58 @@
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<title>The Quest for HTMLParser</title>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<link REL ="stylesheet" TYPE="text/css" HREF="../javadoc/stylesheet.css" TITLE="Style">
</head>
<body>
<h2><strong>The Quest for HTMLParser</strong></h2>
<p>by <a href="../contributors.html#dhaval">Dhaval Udani</a><br>
</p>
<table width="75%" border="0">
<tr>
<td><p>In 1984, Citicorp Overseas Software Limited(COSL) was created by Citibank
to produce low cost software for its various banking operations. Citicorp
Information Technologies India Ltd.(CITIL), now know ans i-Flex, was formed
out of this company around 10 years back to service non-Citi clients.
In 2001, COSL was merged with another arm of Citibank, India known as
Global Support Unit(GSU) to form OrbiTech Solutions Ltd which in turn
merged with Polaris Software Labs in 2002. With its expertise in the banking
domain, OrbiTech undertook to develop a suite of banking products. However
with several players in the market, it needed something innovative and
fast. With an aim of increasing productivity, an initiative was started
to develop tools, code generators and reusable components to be used within
the organization. It is in this aspect that I got involved with HTMLParser.</p>
<p>We were developing an MVC-based framework for performing static maintenance
of information like bank accounts, customer records etc. To simplify development
for users, we were asking our users to develop simple static HTML pages
which we would convert to JSP pages capable of showing dynamic data. It
is towards this goal that I required a tool which could parse HTML tags
and allow me to play with them. I searched high and low for various options.
One of them was the HTML DOM standard and APIs of W3C. However their inability
to process JSP tags and inability to change the tags and reproduce them
meant I had to discard it. Another implementation of the DOM standard
was provided by NekoHTML. </p>
<p>However it had similar problems and was too complex. These factors drew
me to HTMLParser. Initially it was difficlt to understand but once I had
written my first parsing routine, it was too easy. I especially love the
easy manner in which scanners are registered and removed so that scanning
is enabled or disabled for particular tags. This feature is absolutely
fantastic. Having to search for tags which were not written in the original
HTMLParser caused a slight flutter in my heart. However Somik encouraged
me not to give up and write my own tag-scanner pairs.<br>
<br>
This was the toughest activity because it meant not only delving deep
in the code but also the psyche behind the design. Somehow I got through
the first one and then it just flowed. I have now written 5 tag-scanner
pairs. Its just too simple once you get the hang of it. The constant ongoing
development and effort at bug-fixing also meant that any bugs reported
by me would be fixed and a release would be available soon.<br>
<br>
<a href="../contributors.html#dhaval"><em>Dhaval Udani</em></a><em>
is a Senior Analyst at Orbitech Solutions Ltd. and a developer on the
HTMLParser project. </em></p></td>
</tr>
</table>
</body>
</html>
28 changes: 28 additions & 0 deletions docs/bug.html
@@ -0,0 +1,28 @@
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<title>Bug Reports</title>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<link REL ="stylesheet" TYPE="text/css" HREF="javadoc/stylesheet.css" TITLE="Style">
</head>
<body>
<h2>Bug Reports </h2>
<p>You can submit bug reports here. Before you proceed, please note that you
<strong>must</strong>
<a href="http://sourceforge.net/account/login.php">login to sourceforge</a>.
This is required so that bug status reports can be forwarded to you.
If you do not have a sourceforge login, you can get one
<a href="http://sourceforge.net/account/register.php">here</a>.</p>
<p>Checklist <strong>BEFORE</strong> you submit your bug report :</p>
<ul>
<li>Have you pretty much isolated the problem to the HTML Parser component.</li>
<li>Have you <a href="http://sourceforge.net/tracker/?func=browse&group_id=24399&atid=381399">checked
the list of older bug reports</a></li>
<li>Have you written a testcase to simulate your bug?
We do take reports without testcases, but please note that such reports may take
longer for us to respond to.</li>
</ul>
<p>Once you are ready, <a href="http://sourceforge.net/tracker/?func=add&group_id=24399&atid=381399">click
here to submit your report</a> (upload your testcase as a file attachment).</p>
</body>
</html>

0 comments on commit f6bcc1c

Please sign in to comment.