Update quickstart.md by findinpath · Pull Request #270 · apache/accumulo-website

findinpath · 2021-04-07T14:10:09Z

Update quickstart.md to contain explicitly all the necessary commands needed to setup the accumulo environment on a single node.
When starting only tserver, some of the commands (e.g. : createtable mytable )executed on the accumulo shell will hang undefinitely

Update quickstart.md to contain explicitly all the necessary commands needed to setup the accumulo environment on a single node. When starting only tserver, some of the commands (e.g. : `createtable mytable` )executed on the accumulo shell will hang undefinitely

EdColeman · 2021-04-07T15:11:08Z

_docs-2/getting-started/quickstart.md


+    accumulo-service master start
    accumulo-service tserver start
+    accumulo-service monitor start


You can use the accumulo-cluster cmd to start all of the services either on a single node or on multiple nodes - these commands are only necessary if you elect to run each manually (or say you need to restart just one) Maybe it would be more user-friendly if the accumulo-cluster command section came before this section?

Also, for a complete installation I believe that you might be missing necessary services (gc). Then maybe this could be
accumulo [service-name] or accumulo-service [service-name] start | stop with a more complete list of the required service names?

@EdColeman i'm a newbie with accumulo and while trying to create a table i simply didn't know why it hangs.
I've spent more than an hour debugging and pondering about it, and afterwards I came to see the fact that there are more services to start, not only the tserver.

Obviously my modification is quite naive because there are probably more services. So what you are suggesting with accumulo [service-name] fits much better.

The quick start for me wasn't so quick :)

I came across the sample code https://github.com/apache/accumulo-examples/blob/main/docs/sample.md which gives a few pointers on how to work with the accumulo shell - I find it quite good for a quick start - to get a feeling about what accumulo actually does.

The docs start with consider using fluo-uno - that's the easiest on ramp that I know of. Is there a reason that you did not try it? Would changing / adding wording there have made you more likely to use fluo-uno?

Moving the cluster command before the individual commands might have made your experience easier - with accumulo-cluster start, things should have been started and removes the beginner from needing five commands instead of one.

But with that, uno fetch accumulo, uno start accumulo is way easier than setting up accumulo from scratch.

A bit unrelated, but another small issue I've encountered was the ClassNotFoundError regarding zookeeper's KeeperException class.

Cause of it :
accumulo-env.sh

CLASSPATH="${CLASSPATH}:${lib}/*:${HADOOP_CONF_DIR}:${ZOOKEEPER_HOME}/*:${HADOOP_HOME}/share/hadoop/client/*"

replaced it with

CLASSPATH="${CLASSPATH}:${lib}/*:${HADOOP_CONF_DIR}:${ZOOKEEPER_HOME}/lib/*:${HADOOP_HOME}/share/hadoop/client/*"

(added lib after ZOOKEEPER_HOME)

But with that, uno fetch accumulo, uno start accumulo is way easier than setting up accumulo from scratch.

I'm looking now through the code of uno and see that it downloads everything what is needed. I am sorry, but due to the fact that I have already zookeeper and hadoop on my machine I thought to opt to simply install the bin archive of accumulo.

For the zookeeper issue we'd need to know the zookeeper and accumlo versions. There were changes in ZooKeeper 3.x series that modified where zookeeper store jars and separated the jute (zk comms layer) into a separate jar. That's one benefit of uno - it should download compatible versions of things - it also allows you to point at you local reop and build / run accumulo with your changes if you think you might go down that route in the future.

I tried with both apache zookeeper 3.5.9 and zookeeper 3.7.0
In both of them, the .jar libraries are located under ZOOKEEPER_HOME/lib directory.

What version of Accumulo?

Since that is unrelated, let's not bog this issue down with that issue. See apache/accumulo#1530 for more.

ctubbsii · 2021-04-07T16:24:35Z

_docs-2/getting-started/quickstart.md

 Start Accumulo processes (tserver, master, monitor, etc) using command below:

+    accumulo master
    accumulo tserver
+    accumulo monitor


I think this was an example explaining how to start an individual service, generally. Rather than add the other services as an example, it would probably be better to reword the instructions to make it clear that this is just one example.

(similar comment below)

I made some suggested improvements in #271 . Please take a look and see if they help alleviate the concerns you were trying to address in this PR.

It looks good to me, but now I am bit biased because I read in the meantime more about the architecture of accumulo and know now a bit how to use it.

I'd highlight which of the services need to run on the local installation at the very least. I don't run for example GC and it's still fine. You may say that I should have used Uno, but maybe some other users will try their luck also by doing directly a local installation of accumulo.

Also important is a link to the wonderful accumulo-examples repository. This is a gem containing quite useful stuff to get started with accumulo.

Troubleshooting is also quite welcome. Without troubleshooting tips some of the users will stop early after trying to setup accumulo.

findinpath · 2021-04-08T15:05:25Z

Latest 2.0.1

…

On Thu, Apr 8, 2021 at 4:36 PM EdColeman ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In _docs-2/getting-started/quickstart.md <#270 (comment)> : > accumulo-service tserver start + accumulo-service monitor start What version of Accumulo? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#270 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ANU3TAP2KO54YS3ZBONKBHTTHW5O3ANCNFSM42Q32WMA> .

EdColeman · 2021-04-08T16:06:19Z

The pom for 2.0.1 specifies zookeeper 3.4.14.

For the next release (main branch), the pom currently has ZooKeeper version 3.5.9 and it looks like the accumulo-env.sh has been updated with the zookeeper jar locations ($ZOOKEEPER_HOME/lib/) for later zookeeper releases.

findinpath · 2021-04-08T19:31:02Z

Thank you @EdColeman and @ctubbsii for taking the time to talk with me on the quick start documentation topic.

Update quickstart.md

da40ece

Update quickstart.md to contain explicitly all the necessary commands needed to setup the accumulo environment on a single node. When starting only tserver, some of the commands (e.g. : `createtable mytable` )executed on the accumulo shell will hang undefinitely

EdColeman reviewed Apr 7, 2021

View reviewed changes

ctubbsii requested changes Apr 7, 2021

View reviewed changes

findinpath closed this Apr 8, 2021

Comments

Conversation

findinpath commented Apr 7, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

EdColeman Apr 7, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

findinpath commented Apr 8, 2021 via email

Uh oh!

EdColeman commented Apr 8, 2021

Uh oh!

findinpath commented Apr 8, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

EdColeman Apr 7, 2021 •

edited

Loading