# Biotech VC Monitoring

> **Goal:** This notebook will keep track of all investments by selected biotech VCs. <br>
> **Notebook Owner(s):** Time Bioventures <br>
> **Date:** July 2024 <br>

***
## Import Libraries

In [60]:
import os
import re
import sys
import html
import requests
import pandas as pd
from bs4 import BeautifulSoup
from collections import defaultdict
from utils.webpage_scraping import test_connection

***
## 5AM Ventures

In [99]:
def fiveam_scrape():
	'''Scrape 5AM Ventures portfolio page for company information.'''
	fiveam_url = 'https://5amventures.com/portfolio'
	response = test_connection(fiveam_url)
	if response.status_code == 200:
		page = requests.get(fiveam_url)
		soup = BeautifulSoup(page.content, 'html.parser')
		portfolio = soup.find_all('li', class_='company-click')
		fiveam_df = defaultdict(dict)
		for company_soup in portfolio:
			# <img alt="Achaogen" src="https://5amventures.com/wp-content/uploads/2018/03/achaogen.png"/> -> Achaogen
			company_name = company_soup.find('img')['alt']
			fiveam_df[company_name]['desc'] = company_soup.find('p').get_text().strip()
			# find everything between data-facts="[ and "
			company_facts_str = html.unescape(str(re.findall(r'data-facts="(.*?)"', str(company_soup)))).split('</li>')
			# replace " with empty string and remove [ and ] from the beginning and end of the string"
			company_facts = [re.sub('<li>', '', fact).strip().replace('[', '').replace('\'', '') for fact in company_facts_str[:-1]]
			fiveam_df[company_name]['facts'] = company_facts
			fiveam_df[company_name]['img'] = company_soup.find('img')['src']
		fiveam_df = pd.DataFrame(fiveam_df).T
		return fiveam_df
	else:
		print(f'5AM Ventures portfolio page not found. Response: {response}')
		print(f'  - Check if {fiveam_url} is valid.')
		return None

fiveam_df = fiveam_scrape()
fiveam_df

Unnamed: 0,desc,facts,img
Achaogen,Achaogen (AKAO) is a biopharmaceutical company...,"[Founded in 2002, IPO in March 2014, Based in ...",https://5amventures.com/wp-content/uploads/201...
Akouos,Akouos (AKUS) is a biotechnology company devel...,"[Founded in 2016, IPO in June 2020, Acquired i...",https://5amventures.com/wp-content/uploads/201...
Alexza,Alexza (ALXA) is a biopharmaceutical company d...,"[Founded in 2000, IPO in March 2006, Based in ...",https://5amventures.com/wp-content/uploads/201...
Ambrx,Ambrx is a biopharmaceutical company which int...,"[Founded in 2003, Acquired in June 2015, Based...",https://5amventures.com/wp-content/uploads/201...
Aprea,Aprea (APRE) is a biopharmaceutical company de...,"[Founded in 2003, IPO in October 2019, Based i...",https://5amventures.com/wp-content/uploads/201...
...,...,...,...
TMRW,TMRW is the world’s first and only integrated ...,"[Founded in 2018, Based in New York City, NY, ...",https://5amventures.com/wp-content/uploads/201...
VBI Vaccines,VBI Vaccines is a biopharmaceutical company de...,"[Founded in 2001, Merged in 2014, Based in Cam...",https://5amventures.com/wp-content/uploads/201...
Viveve,Viveve (VIVE) is a medical device company comm...,"[Founded in 2005, Merged in 2014, Based in Eng...",https://5amventures.com/wp-content/uploads/201...
Vor Biopharma,Vor Biopharma (VOR) is a biotechnology company...,"[Founded in 2016, IPO in February 2021, Based ...",https://5amventures.com/wp-content/uploads/201...
