-
Notifications
You must be signed in to change notification settings - Fork 187
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Any documentation? #257
Comments
There is a LizardFS whitepaper available on our site. Currently there is no other documentation but we are happy to answer any of Your questions either here or on IRC channel - |
As referred in #205, some documentation will hopefully be created at github wiki soon. For now, besides the whitepaper, you can also refer to our man pages. |
Comparing to alternative solutions it's kinda frustrating... :( Let's say I have a bunch of VPS and I want to create a distributed FS setup to share storage between them. Is that possible with LizardFS? Assuming that it's possible, how can I setup LizardFS across my servers? What services I need to configure and run? How could I mount LizardFS volumes via fstab? And how can I configure transport-layer encryption for LizardFS? |
Hi Ashald, First of all, I started working on a quick start guide: https://github.com/lizardfs/lizardfs/wiki/Quick-Start-Guide Secondly, how do you wish to share storage between your VPS? Example fstab entry (assuming mfsmaster is in /etc/hosts and default master port): |
@psarna great, thanks! I think it will be very useful for LizardFS to have a quick start guide! Well my servers are run on SSD though, CPU and memory are somehow limited. On the other hand the total disks capacity not so big and I'm planing to use distributed FS to store files from several MBs up to several GBs. I think I'll give it a try and then can post test results and comparison with xtreemfs, if you interested in it. |
@ashald any feedback will be welcome:) |
What alternative solutions? There is only one -- GfarmFS unless you are talking about worthless crap like Ceph or XtreemFS. RozoFS and QuantcastFS are so buggy and immature that they are not to be taken seriously and OrangeFS is not there yet.
Yes.
We need quick start guide -- the one which was just created is not very quick and not too accurate...
You will need to choose IP for master and name it "mfsmaster" in your DNS configuration. Install master, then CGI to monitor it. Then add chunkservers one by one as many as you need. Master does not need much RAM -- it uses only about 3GiB for 5 million files. |
@psarna, thanks very much for making quick start page in Wiki. Remove hardware requirements (IMHO it's best described in standalone page) and avoid suggesting that SSD or 32 GiB of RAM is necessary. It is not true and may scare away our users. Thanks. |
I'm also interested to read about transport-layer encryption in LizardFS. Anyone? |
@ashald remember to use man pages and common sense. Setup of LizardFS is the easiest of all what you called alternatives. ;) |
Haven't heard about it. :)
Yes, I'm talking about these 2. I discarded 1st because it seems that they not provide anything for transport layer security (as I said, I want to encrypt traffic between my servers). And as for the 2nd one - maybe it's not the best solution but it has quickstart guide and I was able to make it working by following that guide. IMO, that's one of the advantages of XtreemFS compared to LizardFS. Though, it can be easily fixed by adding quckstart guide for LizardFS. :) I'm about to try LizardFS and hope I will be able to make it work with that draft of quickstart guide. I will share my feedback afterwards. |
As I said in my previous comment, it's one of my requirements for distributed FS so I'm highly interested in how LizardFS deal with it. |
Yeah, will do. But I also wanted some kind of quickstart so I can try the solution before diving deep into man pages. :) |
I recommend to avoid Ceph and XtreemFS for too many reasons to describe here in details but primarily due to their gross disregard to data integrity, design flaws and intolerable amount of hopeless critical bugs. Besides Ceph and XtreemFS are much slower than LizardFS. |
I have great hopes for LizardFS. :) |
So I was able to do a minimal setup of LizardFS and it's pretty fast, but... It seems that there are no transport layer encryption at all. :( Let's try IPSec... |
Any comments on Gfarmfs?, seems interesting, but also the project seems Regards,
|
To @ciroiriarte:
Great reliable system; keeps metadata in PostgreSQL; optional file-based data integrity.
Dead? You must be kidding, they just released version 2.6.1 earlier this month (March 2015). |
On Wed, 18 Mar 2015 17:28:10 Ashald wrote:
No worries, don't expect too much from a file system... |
So, here is my results. I did basic setup with 4 VPS servers - each of them contains chunk server and client. And one of them hosts master. As I need to encrypt my traffic, I figured that there are 2 possible ways to do that: IPSec or VPN. Since my VPS provider uses OpenVZ with old kernel I cannot use IPSec so I ended up using OpenVPN (just followed this guide https://www.digitalocean.com/community/tutorials/how-to-secure-traffic-between-vps-using-openvpn) and everything is awesome. I even didn't notice performance degradation. Though I have some questions and issues. And I have an issue with master server. From time to time it just dies and clients are hanging up in result.
The Thanks for your help! |
Can you provide more information regarding when/how/why master dies? Do you know exactly when it happens and can provide system logs from that moment? lizardfs-master logs with syslog() so if your system is some Linux with default configuration, master's logs should be found in /var/log/syslog (or any file that syslog's logs go into). |
On Thu, 19 Mar 2015 00:26:33 Ashald wrote:
There is no surprise that XtreemFS is slow because it is written in retarded
Awesome feedback, thanks.
I have been using LizardFS extensively since December and my Master never died Unless you built your own LizardFS packages I suspect this issue may be due to Normally I would recommend to use only official Debian packages from native
and build your own packages (if you know how). Regards, Democracy is a pathetic belief in the collective wisdom of individual |
On Thu, 19 Mar 2015 00:26:33 Ashald wrote:
No, For quotas see mfsrepquota(1). |
Read whitepaper from lizardfs.com I do not understand until the end of the LFS is a fault-tolerant disk. |
On Thu, 19 Mar 2015 04:24:32 andypl78 wrote:
It is quite hard to understand what is your question. Each chunk server is a place for one replica. All HDDs will be (more-or-less) |
@AmokHuginnsson I checked syslog and don't see anything unusual... Is there is a way to increase logging verbosity for master? |
Hm, master just went down one more time. It's the only 2 lines from syslog that it wrote during shutdown
|
This looks like a HTTP request sent to matoml (9419) port. Such a request would typically begin with:
which is:
LizardFS packets begin with two 32-bit big endian values: the first one is packet's type, the second one is packet's length. Your length is 790644820 which would be encoded as |
@marcinsulikowski Thanks for quick reply. I guess it's my fault that I haven't setup authentication and firewall rules. Will do it now and let's see what will happen. As I said, it's the only log messages by master that I see except messages logged on startup. Is there is a way to change logging level to something like debug? Or I need to run master in foreground in order to see more detailed logs? Just in case, I'm running Debian 7 and installing LizardFS from the provided repo (in Downloads section)
and
|
Btw, I see mfspassword option in mfsmount.cfg but I don't such such option in master config. Can I use this in order to setup authentication within master server? Also is it possible to protect CGI stats server with a password? Or I need to use something like nginx/apache in order to do that? And I just tried 'lizardfs-admin -h' (from the quickstart guide) and it doesn't work.
|
|
It turned out that there was a mistake in my configuration and some traffic was going outside of VPN. After proper configuration iperf estimated network performance in 145 mbit/s (unfortunately I cannot change a lot of settings imposed by VPS provider) and LizardFS r/w performance is 10-12 mb/s which I consider as a good (taking into account how limited resources are). Though I think IPSec could give a better performance than VPN. After proper VPN setup I isolated my LizardFS from outside world and from that strange host that was crashing my master. Now master is solid stable. In general I'm happy with LizardFS but I wish there was better documentation. The only issue that I have with LizardFS is that it seems I need to mount meta in order to be able to clear trash (I expected some tool from lizardfs-adm). Also I expected that upon running off empty space trash will be auto-cleaned (starting from older records) but instead I got a segmentation fault on client. Though I wasn't able to reproduce that. Thanks for your help! |
Ah sorry, one more question. Assuming that I have some user and corresponding group with the same uid/gid on all machines that have LizardFS mounter (and on master) is it possible to change owner of all files in LizardFS to this user/group? I'm using this user with its group in mapall option in mfs export but I also want to change ownership of all files in LizardFS to this user/group. |
You can change owner of LizardFS files just like in any other file system, just run
|
Also, this issue is oversized, so I'm going to close it. If you have futher questions, please start a new one. |
Most of MooseFS Documentation is a good fit! |
I'm using tinc to provide an encrypted VPN but I think you're talking about
|
Very nice but it would be even better to have free documentation with license and source for .PDF file. Why not wiki format? Also I'm concerned about installation instructions for Debian -- they are in conflict with the official Debian packages that are available from "testing" and "jessie-backports" suites. Once package is in Debian, the official Debian repositories becomes authoritative hence there is no need for vendor-provided packages any more (or duplication of effort and deviations are inevitable). Please coordinate with Debian package maintainer (yours truly, right here at your service). ;-) |
Erm... Proof me if I'm wrong, but I looked everywhere - official site, github repo and github wiki, google and still wasn't able to find any documentation and/or user guide for LizardFS. Is it hidden somewhere? Or there are no docs at all? How people are getting familiar with it? o_O
The text was updated successfully, but these errors were encountered: