In [1]:
import numpy as np
from scipy.fft import fft
import matplotlib.pyplot as plt
import pandas as pd

In [7]:
dummyDataFile = pd.read_excel("dummyData.xlsx")

dummyDataFile

Unnamed: 0,DATE,TEMP,HUMI,hello,Unnamed: 4,Unnamed: 5
0,,25,28,,,
1,,28,31,,,
2,,30,26,,,
3,,26,29,,,aaaaaaaaaaaaaaaaaaaaaaaa
4,,28,32,,,
...,...,...,...,...,...,...
2866,,29,28,,,
2867,,25,26,,,
2868,,29,30,,,
2869,,26,27,,,


In [24]:
temphumiDF = pd.read_excel("dummyData.xlsx", "Sheet1", skiprows=0, nrows= 2871, usecols="B:C")

temphumiDF

Unnamed: 0,TEMP,HUMI
0,25,28
1,28,31
2,30,26
3,26,29
4,28,32
...,...,...
2866,29,28
2867,25,26
2868,29,30
2869,26,27


In [25]:
tempOnly = temphumiDF['TEMP'].tolist()
humiOnly = temphumiDF['HUMI'].tolist()

# tempOnly
# humiOnly

In [26]:
tempVector = np.array(tempOnly)
humiVector = np.array(humiOnly)

print(tempVector)
print(humiVector)

[25 28 30 ... 29 26 28]
[28 31 26 ... 30 27 28]


Analyzing two weeks' worth of temperature and humidity data can provide valuable insights into weather patterns and environmental conditions. Here are some possible analyses you can perform:

1. **Trend Analysis:**
   - Identify any long-term trends in temperature and humidity over the two-week period. Are they increasing, decreasing, or remaining stable?

2. **Daily Variations:**
   - Examine the daily temperature and humidity patterns to see if there are any consistent variations, such as temperature highs and lows.

3. **Weekly Patterns:**
   - Look for weekly cycles or patterns. Some places experience different weather conditions on weekends compared to weekdays.

4. **Descriptive Statistics:**
   - Calculate summary statistics, such as mean, median, minimum, and maximum values for temperature and humidity during the two-week period.

5. **Histograms and Distributions:**
   - Create histograms to visualize the distribution of temperature and humidity data. Are they normally distributed or skewed?

6. **Correlation Analysis:**
   - Examine the relationship between temperature and humidity. Is there a correlation between the two variables?

7. **Anomalies and Outliers:**
   - Identify any unusual or extreme temperature or humidity values that deviate significantly from the norm.

8. **Comparative Analysis:**
   - Compare the data from the current two-week period to historical data to identify any anomalies or deviations from the norm.

9. **Weather Events:**
   - Look for patterns that might be associated with specific weather events, such as rain, storms, or heatwaves.

10. **Spatial Analysis:**
    - If you have data from multiple locations, compare temperature and humidity between those locations to analyze spatial variations.

11. **Time Series Analysis:**
    - Use time series analysis techniques to identify seasonality and cyclic patterns in the data.

12. **Predictive Modeling:**
    - Use the data to build predictive models for future temperature and humidity based on historical patterns and weather forecasts.

13. **Data Visualization:**
    - Create visualizations, such as line graphs, scatter plots, or heatmaps, to help convey the data's insights.

14. **Comparative Analysis:**
    - If you have access to data from different sources or times, compare the two-week data to see if there are any significant differences.

15. **Weather Impact Analysis:**
    - Assess how the temperature and humidity data might be impacting various aspects of the environment, such as plant growth, energy consumption, or human comfort.

16. **Health and Safety Implications:**
    - Consider the implications of temperature and humidity on human health, such as heat stress or risk of mold growth in homes.

17. **Energy Efficiency Analysis:**
    - Explore how temperature and humidity affect energy consumption, especially in buildings with HVAC systems.

18. **Agricultural and Environmental Implications:**
    - Analyze the data to understand how temperature and humidity affect crop growth, irrigation needs, and local ecosystems.

19. **Climate Change Trends:**
    - If you have access to historical data, analyze whether the two-week data reflects any trends related to climate change.

20. **Threshold Analysis:**
    - Determine if temperature or humidity levels have crossed specific thresholds that trigger certain actions or alerts (e.g., frost warnings, heat advisories).

The specific analysis you choose to perform will depend on your goals and the context in which the data is collected. Additionally, you may use statistical software, programming languages, or data visualization tools to aid in your analysis.

Central tendencies, which include measures like mean, median, and mode, can provide valuable insights when analyzing data according to time of day, day of the week, and in general. Here's what these central tendencies mean in each context:

**1. Time of Day:**

- **Mean:** The mean (average) value for a specific time of day provides information about the typical or expected value of the data at that time. For example, if you calculate the mean temperature at 2:00 PM over a two-week period, it tells you the average temperature for that time.

- **Median:** The median at a specific time of day represents the middle value when the data is ordered. It can help identify the typical or central value that is not skewed by extreme outliers.

- **Mode:** The mode at a particular time of day is the most frequently occurring value. It can reveal the most common conditions or situations at that time.

**2. Day of the Week:**

- **Mean:** Calculating the mean for a specific day of the week provides insight into the average value for that particular day. For example, the mean temperature on Sundays over two weeks helps identify the typical Sunday temperature.

- **Median:** The median on a specific day of the week helps identify the middle value for that day, which can be useful for understanding what's typical without being influenced by outliers.

- **Mode:** The mode for a particular day of the week indicates the most common value, which can be related to recurring patterns or events on that day.

**3. In General (Overall):**

- **Mean:** When calculating the overall mean, it provides a measure of central tendency for the entire dataset. It represents the average value across all time periods, days of the week, or whatever time intervals the data covers.

- **Median:** The overall median identifies the middle value of the entire dataset, which can be useful for understanding the central tendency of the data without being significantly influenced by extreme values.

- **Mode:** The overall mode is the most frequently occurring value in the entire dataset. It highlights the most common conditions or situations in the data.

Central tendencies are important for summarizing and understanding data across different time intervals, as they provide a way to describe what's typical, average, or most common. These statistics can be particularly useful when analyzing time-series data, such as temperature and humidity, as they help uncover patterns, trends, and variations over time or within specific time segments.

Performing data analysis for collected data, especially when focusing on central tendencies at different time intervals, involves several steps. Here's a general process to follow:

1. **Data Collection:**
   Collect and organize your temperature and humidity data. Ensure that the data is timestamped, indicating the time and date of each measurement.

2. **Data Cleaning:**
   Before analysis, clean the data to handle missing values, outliers, and inconsistencies. This step is crucial to ensure the data's quality.

3. **Time Segmentation:**
   Depending on your analysis goals, segment the data into the relevant time intervals. This can include time of day (e.g., hourly), day of the week, or specific date ranges.

4. **Calculate Central Tendencies:**
   For each time interval or segment, calculate the central tendencies you're interested in:

   - **Mean:** Sum all values in the interval and divide by the number of data points.
   - **Median:** Arrange the data points in ascending order and find the middle value.
   - **Mode:** Identify the most frequently occurring value in the interval.

5. **Visualize the Data:**
   Create plots and graphs to visualize the central tendencies over time. Line plots, bar charts, and histograms can help you see trends and variations.

6. **Statistical Analysis:**
   Perform statistical tests to determine the significance of central tendency differences between time intervals or days of the week. For example, you can use t-tests or ANOVA to compare means.

7. **Interpret the Results:**
   Analyze the central tendencies and statistical results to draw conclusions. Determine whether there are any patterns or significant differences between time intervals or days.

8. **Inferential Analysis:**
   Depending on your data, you may want to make inferences or predictions. Time series forecasting, regression analysis, or other modeling techniques can be used for this purpose.

9. **Report and Visualization:**
   Present your findings in a clear and understandable way. Use visualizations, tables, and written explanations to communicate your results effectively.

10. **Consider Additional Factors:**
    While central tendencies are important, consider other factors that may influence the data, such as external events (e.g., holidays, weather events) or seasonal variations.

11. **Repeat as Needed:**
    If your analysis requires deeper investigation or comparison over longer periods, repeat the process with different time intervals or subsets of the data.

12. **Document Your Methodology:**
    Keep detailed records of your data cleaning, analysis steps, and the tools and techniques you used. This documentation is crucial for transparency and reproducibility.

13. **Make Informed Decisions:**
    Finally, use your analysis to make informed decisions or take actions based on the insights gained from the central tendencies of your data.

The specific tools and software you use for data analysis may vary depending on your preferences and expertise. Common tools for data analysis include Microsoft Excel, Python with libraries like NumPy, pandas, and Matplotlib, R, and specialized statistical software packages.

The periodicity of data refers to the presence of recurring patterns or cycles at regular intervals within the data. In other words, it signifies that the data exhibits a predictable and repetitive behavior over time. Periodic data can be found in various fields, including finance, economics, meteorology, and many others.

Key characteristics of periodic data include:

1. **Regular Intervals:** Periodic data displays patterns or cycles that occur at consistent time intervals. For example, daily temperature data often exhibits a 24-hour periodicity with daytime highs and nighttime lows.

2. **Predictability:** Periodic data allows you to make predictions about future values based on the observed patterns. If you identify a recurring cycle, you can anticipate when certain events or conditions are likely to repeat.

3. **Seasonality:** Seasonal patterns are a common form of periodicity. For instance, retail sales tend to spike during holiday seasons every year, demonstrating an annual periodicity.

4. **Frequency:** The frequency of a periodic pattern refers to how often it repeats within a given time frame. For example, monthly sales data has a frequency of one cycle per month.

5. **Amplitude and Phase:** Periodic data often exhibits variations in amplitude (the range of values within a cycle) and phase (the timing or alignment of the cycle).

6. **Waveform Analysis:** Periodic data can often be described using waveform functions, such as sine or cosine waves, to model the underlying patterns.

Understanding the periodicity of data is crucial for various applications, including:

- **Forecasting:** Identifying and characterizing periodic patterns helps in making predictions about future data points, especially in time series analysis.

- **Anomaly Detection:** Detecting deviations from expected periodic patterns can help in identifying unusual events or outliers.

- **Resource Allocation:** Businesses and organizations can use knowledge of periodic patterns to allocate resources effectively, such as staffing during peak demand times.

- **Optimization:** Periodic data can inform decisions on scheduling, inventory management, and maintenance activities.

Analyzing periodic data may involve using techniques like Fourier analysis, spectral analysis, or time series modeling to extract and quantify the periodic components. Understanding the periodicity of data can lead to more informed decision-making and improved modeling of systems with cyclical behaviors.

Analyzing the periodicity of collected temperature and humidity data involves identifying and characterizing recurring patterns in the data. To do this, you can follow these steps:

1. **Data Collection and Preparation:**
   - Ensure you have collected and organized your temperature and humidity data with timestamps.

2. **Data Cleaning:**
   - Clean the data to handle missing values, outliers, and inconsistencies. This is a crucial step to ensure the data's quality.

3. **Time Series Plot:**
   - Create a time series plot to visualize the entire dataset. This can help you identify any visible patterns or trends.

4. **Resampling:**
   - Depending on the frequency of your data (e.g., hourly, daily), you may need to resample the data to a consistent time interval to ensure uniformity in your analysis.

5. **Periodogram Analysis:**
   - Compute a periodogram, which is a graphical representation of the frequency components in the data. It helps identify dominant periodicities. You can use techniques like the Fast Fourier Transform (FFT) to calculate the periodogram.

6. **Peak Detection:**
   - Identify the peaks in the periodogram. These peaks correspond to the dominant periodic frequencies in your data.

7. **Waveform Analysis:**
   - For each dominant periodicity identified, fit a sinusoidal wave (e.g., sine or cosine function) to the data to describe the periodic pattern. Determine the amplitude, phase, and frequency of the wave.

8. **Visualization:**
   - Plot the fitted sinusoidal waves on top of the original data to visualize how well they capture the periodic patterns.

9. **Statistical Analysis:**
   - Use statistical tests to assess the significance of the periodic patterns. You can conduct hypothesis tests to determine if the periodicity is statistically significant.

10. **Modeling:**
    - Depending on the complexity of the periodic patterns, you may consider time series modeling techniques like seasonal decomposition (e.g., STL decomposition) to separate the data into trend, seasonal, and residual components.

11. **Further Analysis:**
    - If there are multiple periodic patterns (e.g., daily and weekly), analyze each separately and examine how they interact.

12. **Interpretation:**
    - Interpret the results in the context of your analysis goals. For example, if you are analyzing temperature and humidity data, you might identify daily temperature variations and weekly humidity patterns.

13. **Usefulness:**
    - Consider how the knowledge of periodicity can be used for various applications, such as weather forecasting, energy management, or resource allocation.

14. **Reporting:**
    - Document your analysis process, findings, and any actions or decisions based on the identified periodicity.

15. **Further Research:**
    - If you want to explore the data further, you can investigate the relationships between periodicity and external factors (e.g., weather events, seasonality) that may influence the observed patterns.

The choice of tools and techniques for periodicity analysis may depend on the complexity of the data and your expertise. Common tools for this type of analysis include Python with libraries like NumPy, pandas, Matplotlib, and SciPy for signal processing, or specialized time series analysis software packages like R or MATLAB.