HTML-Parser

HTML link Parser. Go exercise

The goal of this project is to parse an HTML file and extract all of the links (<a href="">...</a> tags). For each extracted link it should return a data structure that includes both the href, as well as the text inside the link. Any HTML inside of the link can be stripped out, along with any extra whitespace including newlines, back-to-back spaces, etc.

Input:

<a href="/dog">
  <span>Something in a span</span>
  Text not in a span
  <b>Bold text!</b>
</a>

Output:

Link{
  Href: "/dog",
  Text: "Something in a span Text not in a span Bold text!",
}

To run this project.

git clone https://github.com/niranjan-n/HTML-Parser.git

go run example1/main.go

go run example2/main.go

go run example3/main.go

go run example4/main.go

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
example1		example1
example2		example2
example3		example3
example4		example4
link		link
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HTML-Parser

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

niranjan-n/HTML-Parser

Folders and files

Latest commit

History

Repository files navigation

HTML-Parser

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages