# **Project: Social Media Analytics**

### **Problem Statement:**

You have been assigned a social media analytics project to analyze a comprehensive dataset from a popular social media platform. The dataset comprises information about users, posts, comments, and likes. Your primary objective is to extract meaningful insights to assist the platform in understanding user behavior, engagement patterns, and overall social media trends.

### **Dataset Description:**
The dataset includes the following tables:

1. **Users:**
   - UserID (unique identifier)
   - UserName
   - RegistrationDate

2. **Posts:**
   - PostID (unique identifier)
   - UserID (foreign key to Users table)
   - PostText
   - PostDate

3. **Comments:**
   - CommentID (unique identifier)
   - PostID (foreign key to Posts table)
   - UserID (foreign key to Users table)
   - CommentText
   - CommentDate

4. **Likes:**
   - LikeID (unique identifier)
   - PostID (foreign key to Posts table)
   - UserID (foreign key to Users table)
   - LikeDate

### **Questions:**

1. **User Engagement:**
   - How many users are registered on the platform?
   - What is the average registration date of users?
   - Identify users who have been registered for more than a year.

2. **Post Analysis:**
   - How many posts are there in the dataset?
   - What is the average number of words per post?
   - Identify the top three users with the highest average post length.

3. **Comment Analysis:**
   - How many comments are there in the dataset?
   - What is the average number of words per comment?
   - Identify the top three users with the highest average comment length.

4. **Like Analysis:**
   - How many likes are there in the dataset?
   - What is the average number of likes per user?
   - Identify the top three users with the highest average likes received.

5. **User Interaction:**
   - Identify users who have posted, commented, and liked.
   - Identify posts that have both comments and likes.
   - Identify users who have posted multiple times in a day.

6. **Temporal Analysis:**
   - Determine the average time gap between posting and receiving the first comment.
   - Identify the days of the week with the highest user activity.

7. **Engagement Trends:**
   - Analyze the trend of user engagement over the months.

8. **Influence Analysis:**
   - Identify users with the highest cumulative engagement (posts, comments, and likes).
   - Determine the most engaging post based on a combination of comments and likes.

9. **User Activity Patterns:**
   - Determine the average number of posts made by users per month.


10. **Post Engagement Analysis:**
   - Calculate the average number of comments and likes per post.
   - Identify posts that have a high ratio of comments to likes.

11. **Comment Length Distribution:**
   - Analyze the distribution of comment lengths and identify any outliers.
   - Determine the average comment length for each user.



Below is a sample of data that you can use to populate the Users, Posts, Comments, and Likes tables for your social media analytics project:

**Note: - To ensure a more thorough analysis in your social media analytics project, please add an additional 25 rows of data to each of the tables: Users, Posts, Comments, and Likes. Your creativity in introducing variations across user behaviors, post content, and engagement levels will significantly enhance the depth and realism of the project. Maintain foreign key relationships between tables and consider different timeframes for a more comprehensive exploration of social media interactions.**

**Users Table:**

| UserID | UserName    | RegistrationDate |
|--------|-------------|-------------------|
| 1      | Alice       | 2023-01-01        |
| 2      | Bob         | 2023-01-02        |
| 3      | Charlie     | 2023-01-03        |
| 4      | David       | 2023-01-04        |
| 5      | Emily       | 2023-01-05        |

**Posts Table:**

| PostID | UserID | PostText                             | PostDate           |
|--------|--------|--------------------------------------|---------------------|
| 1      | 1      | Excited about my weekend plans!      | 2023-01-10 08:30:00|
| 2      | 2      | Just finished a great book!          | 2023-01-11 12:45:00|
| 3      | 3      | Exploring new hiking trails today.   | 2023-01-12 15:20:00|
| 4      | 4      | Trying out a new recipe for dinner.  | 2023-01-13 18:10:00|
| 5      | 5      | Movie night with friends!            | 2023-01-14 21:05:00|

**Comments Table:**

| CommentID | PostID | UserID | CommentText                           | CommentDate        |
|-----------|--------|--------|----------------------------------------|--------------------|
| 1         | 1      | 2      | Sounds like fun!                       | 2023-01-10 09:00:00|
| 2         | 1      | 3      | Wish I could join!                      | 2023-01-10 09:30:00|
| 3         | 2      | 1      | What book did you read?                 | 2023-01-11 13:00:00|
| 4         | 3      | 2      | Let me know if you find any hidden gems| 2023-01-12 16:00:00|
| 5         | 4      | 5      | Share the recipe, please!               | 2023-01-13 19:00:00|

**Likes Table:**

| LikeID | PostID | UserID | LikeDate             |
|--------|--------|--------|----------------------|
| 1      | 1      | 3      | 2023-01-10 09:15:00  |
| 2      | 2      | 4      | 2023-01-11 13:30:00  |
| 3      | 3      | 1      | 2023-01-12 16:30:00  |
| 4      | 4      | 2      | 2023-01-13 19:30:00  |
| 5      | 5      | 3      | 2023-01-14 21:30:00  |

This sample data provides a starting point for your social media analytics project. Feel free to customize or expand upon it based on the specific requirements and scenarios you'd like to explore in your project.