Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MAPREDUCE-7400. New MapReduce example - Sentiment Analysis #4808

Open
wants to merge 1 commit into
base: trunk
Choose a base branch
from

Conversation

MeetuPatel
Copy link

Description of PR

I am looking to add a new map reduce example, i.e, sentiment analysis. Sentiment analysis map reduce job helps in determining the sentiment score for a user. It takes each tweet made by an user and assigns a sentiment score for that tweet/sentence for a particular user and then aggregates the sentiment scores for all tweets made by all users.

This example takes the twitter dataset which contains users and the tweets made by users and gives the output as <username, sentiment score>. For each user, the sentiment score is calculated for all the tweets made by that particular user.

How was this patch tested?

This patch was tested using the twitter dataset on a single node Hadoop cluster in pseudo-distributed mode.

For code changes:

  • Does the title or this PR starts with the corresponding JIRA issue id (e.g. 'HADOOP-17799. Your PR title ...')?
  • Object storage: have the integration tests been executed and the endpoint declared according to the connector-specific documentation?
  • If adding new dependencies to the code, are these dependencies licensed in a way that is compatible for inclusion under ASF 2.0?
  • If applicable, have you updated the LICENSE, LICENSE-binary, NOTICE-binary files?

@hadoop-yetus
Copy link

💔 -1 overall

Vote Subsystem Runtime Logfile Comment
+0 🆗 reexec 1m 2s Docker mode activated.
_ Prechecks _
+1 💚 dupname 0m 1s No case conflicting files found.
+0 🆗 codespell 0m 0s codespell was not available.
+0 🆗 detsecrets 0m 0s detect-secrets was not available.
+0 🆗 xmllint 0m 0s xmllint was not available.
+1 💚 @author 0m 0s The patch does not contain any @author tags.
+1 💚 test4tests 0m 0s The patch appears to include 1 new or modified test files.
_ trunk Compile Tests _
+1 💚 mvninstall 38m 19s trunk passed
+1 💚 compile 0m 39s trunk passed with JDK Private Build-11.0.15+10-Ubuntu-0ubuntu0.20.04.1
+1 💚 compile 0m 36s trunk passed with JDK Private Build-1.8.0_312-8u312-b07-0ubuntu1~20.04-b07
+1 💚 checkstyle 0m 44s trunk passed
+1 💚 mvnsite 0m 42s trunk passed
+1 💚 javadoc 0m 53s trunk passed with JDK Private Build-11.0.15+10-Ubuntu-0ubuntu0.20.04.1
+1 💚 javadoc 0m 44s trunk passed with JDK Private Build-1.8.0_312-8u312-b07-0ubuntu1~20.04-b07
+1 💚 spotbugs 1m 11s trunk passed
+1 💚 shadedclient 20m 18s branch has no errors when building and testing our client artifacts.
_ Patch Compile Tests _
+1 💚 mvninstall 0m 26s the patch passed
+1 💚 compile 0m 28s the patch passed with JDK Private Build-11.0.15+10-Ubuntu-0ubuntu0.20.04.1
+1 💚 javac 0m 28s the patch passed
+1 💚 compile 0m 24s the patch passed with JDK Private Build-1.8.0_312-8u312-b07-0ubuntu1~20.04-b07
+1 💚 javac 0m 24s the patch passed
+1 💚 blanks 0m 0s The patch has no blanks issues.
-0 ⚠️ checkstyle 0m 22s /results-checkstyle-hadoop-mapreduce-project_hadoop-mapreduce-examples.txt hadoop-mapreduce-project/hadoop-mapreduce-examples: The patch generated 181 new + 15 unchanged - 0 fixed = 196 total (was 15)
+1 💚 mvnsite 0m 30s the patch passed
+1 💚 javadoc 0m 25s the patch passed with JDK Private Build-11.0.15+10-Ubuntu-0ubuntu0.20.04.1
+1 💚 javadoc 0m 24s the patch passed with JDK Private Build-1.8.0_312-8u312-b07-0ubuntu1~20.04-b07
-1 ❌ spotbugs 0m 56s /new-spotbugs-hadoop-mapreduce-project_hadoop-mapreduce-examples.html hadoop-mapreduce-project/hadoop-mapreduce-examples generated 2 new + 0 unchanged - 0 fixed = 2 total (was 0)
+1 💚 shadedclient 19m 59s patch has no errors when building and testing our client artifacts.
_ Other Tests _
-1 ❌ unit 0m 57s /patch-unit-hadoop-mapreduce-project_hadoop-mapreduce-examples.txt hadoop-mapreduce-examples in the patch passed.
-1 ❌ asflicense 0m 43s /results-asflicense.txt The patch generated 1 ASF License warnings.
92m 19s
Reason Tests
SpotBugs module:hadoop-mapreduce-project/hadoop-mapreduce-examples
Found reliance on default encoding in org.apache.hadoop.examples.SentimentAnalysis$SentimentMapper.getList(Mapper$Context):in org.apache.hadoop.examples.SentimentAnalysis$SentimentMapper.getList(Mapper$Context): new java.io.InputStreamReader(InputStream) At SentimentAnalysis.java:[line 84]
org.apache.hadoop.examples.SentimentAnalysis$SentimentMapper.getList(Mapper$Context) invokes System.exit(...), which shuts down the entire virtual machine At SentimentAnalysis.java:down the entire virtual machine At SentimentAnalysis.java:[line 92]
Failed junit tests hadoop.examples.TestSentiment
Subsystem Report/Notes
Docker ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-4808/1/artifact/out/Dockerfile
GITHUB PR #4808
Optional Tests dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient codespell detsecrets xmllint spotbugs checkstyle
uname Linux f43454b9fe70 4.15.0-191-generic #202-Ubuntu SMP Thu Aug 4 01:49:29 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux
Build tool maven
Personality dev-support/bin/hadoop.sh
git revision trunk / cf1463c
Default Java Private Build-1.8.0_312-8u312-b07-0ubuntu1~20.04-b07
Multi-JDK versions /usr/lib/jvm/java-11-openjdk-amd64:Private Build-11.0.15+10-Ubuntu-0ubuntu0.20.04.1 /usr/lib/jvm/java-8-openjdk-amd64:Private Build-1.8.0_312-8u312-b07-0ubuntu1~20.04-b07
Test Results https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-4808/1/testReport/
Max. process+thread count 727 (vs. ulimit of 5500)
modules C: hadoop-mapreduce-project/hadoop-mapreduce-examples U: hadoop-mapreduce-project/hadoop-mapreduce-examples
Console output https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-4808/1/console
versions git=2.25.1 maven=3.6.3 spotbugs=4.2.2
Powered by Apache Yetus 0.14.0 https://yetus.apache.org

This message was automatically generated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
2 participants