Skip to content

its-kayes/web_scraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

#lines.select('#mw-toc-heading')

#for i in lines.select('#mw-toc-heading'):


lines.select('.kn')
for i in lines.select('.kn'):
    print(i.text)



for link in sup.find_all('a', herf = True):
    print(link['herf'])


text-white fadeInLeft

kn
href=True


res.text

https://web.learncodeonline.in


mw-headline
https://en.wikipedia.org/wiki/Web_scraping






for link in soup.find_all('a', href=True):

    first_two_char = link['href'][:2]

    if "#" not in link['href']:
        if first_two_char != '//':
            print(link['href'])

    if(first_two_char == '//'):
        base_link = 'www.google.co.in/'
        final_link = base_link + link['href'][2:]
        print(final_link)



About

Get all links from a webpage and find by class

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages