2.Symbol: Ticker code or listed company code on NY stock exchange
3.Open: Intra-day opening price for each listed company
4.Close: Intra-day closing price for each listed company
5.Low: Intra-day lowest price for each listed company
6.High: Intra-day highest price for each listed company
7.Volume: Number of shares traded per day per company
1.Ticker_Symbol: Country to which the customer belongs
2.Security: Legal name of the listed company
3.Sector: Business vertical of the listed company
4.Sub_Industry: Business domain of the listed company within a Sector.
5.Headquarter: Headquarters address
1) Create a data pipeline using sqoop to pull the data from the table below from MYSQL server into Hive.
i. Stock_prices
ii. Stock_companies
Check the TABLE description:
- Create a new hive table with the following fields by joining the above two hive tables.
Please use appropriate Hive built-in functions for columns (a,b,e and h to l).
- Trading_year: Should contain YYYY for each record
- Trading_month: Should contain MM or MMM for each record
- Symbol: Ticker code
- CompanyName: Legal name of the listed company
- State: State to be extracted from headquarters value.
- Sector: Business vertical of the listed company
- Sub_Industry: Business domain of the listed company within a sector
- Open: Average of intra-day opening price by month and year for each listed company
- Close: Average of intra-day closing price by month and year for each listed company
- Low: Average of intra-day lowest price by month and year for each listed company
- High: Average of intra-day highest price by month and year for each listed company
- Volume: Average of number of shares traded by month and year for each listed company
3) Find the top five companies that are good for investment
4) Show the best-growing industry by each state, having at least two or more industries mapped.
5) For each sector find the following.
a.) Worst year
b.) Best year
c.) Stable year