New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Compaction #14
Compaction #14
Conversation
The master version of this PR deals with the unified asok/tell interface. Nautilus- are separated. Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com> (cherry picked from commit 9dc07d8)
Rationale can be found in [1]. Point is that EC pools incur a significant performance penalty when dealing with small files and xattr updates. This is because _every_ inode has a corresponding data pool object with backtrace information stored in its xattr. [1] doc/cephfs/createfs.rst Fixes: https://tracker.ceph.com/issues/42450 Signed-off-by: Patrick Donnelly <pdonnell@redhat.com> (cherry picked from commit 3e0aee5)
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com> (cherry picked from commit bf0cf8e) Conflicts: qa/tasks/cephfs/filesystem.py
In the future, we should add the EC data pool as a supplementary data pool but that requires a mount to setup which is awkward in the code here. When cephfs-shell is more widely available, this will be easier. Signed-off-by: Patrick Donnelly <pdonnell@redhat.com> (cherry picked from commit 6e448f9)
Connection pointer is not helpful. Signed-off-by: Patrick Donnelly <pdonnell@redhat.com> (cherry picked from commit 9b71bbe) Conflicts: src/mgr/DaemonServer.cc: actually do print the Connection*, Connection& cannot be dumped in Nautilus.
If the mgr is waiting on daemon metadata from the mons, it has no DaemonState associated with the daemon yet. If we try to process this MgrOpen, the metadata sent by the daemon (like its config) will not be recorded. Fixes: https://tracker.ceph.com/issues/43037 Signed-off-by: Patrick Donnelly <pdonnell@redhat.com> (cherry picked from commit 16a1deb)
Added 'telemetry show-device' command to print a preview of telemetry device report. Added a message at the bottom of 'telemetry show' about 'telemetry show-device' new command. Signed-off-by: Yaarit Hatuka <yaarit@redhat.com> (cherry picked from commit cae87cc) Conflicts: - path: src/pybind/mgr/telemetry/module.py comment: nautilus version of json.dumps() don't have sort_keys arg.
smartctl JSON output contains the device's serial number in two different keys ('serial_number' & 'output'). Serial is now obfuscated in both. Fixes: https://tracker.ceph.com/issues/43939 Signed-off-by: Yaarit Hatuka <yaarit@redhat.com> (cherry picked from commit be1257f)
get_metada() returns 'None' when requesting a missing service, hence trying to access its content fails. Added a check for osd and mgr get_metadata() calls. Fixes: https://tracker.ceph.com/issues/43642 Signed-off-by: Yaarit Hatuka <yaarit@redhat.com> (cherry picked from commit 9e7a0cb)
nautilus: mgr/dashboard: Using wrong identifiers in RGW user/bucket datatables Reviewed-by: Laura Paduano <lpaduano@suse.com> Reviewed-by: Volker Theile <vtheile@suse.com>
nautilus: mgr/dashboard: iSCSI targets not available if any gateway is down (and more...) Reviewed-by: Jason Dillaman <dillaman@redhat.com> Reviewed-by: Laura Paduano <lpaduano@suse.com> Reviewed-by: Tiago Melo <tmelo@suse.com> Reviewed-by: Volker Theile <vtheile@suse.com>
nautilus: mgr/dashboard: add debug mode, and accept expected exception when SSL handshaking Reviewed-by: Kefu Chai <kchai@redhat.com> Reviewed-by: Laura Paduano <lpaduano@suse.com> Reviewed-by: Stephan Müller <smueller@suse.com> Reviewed-by: Tatjana Dehler <tdehler@suse.com>
nautilus: mgr/dashboard: Dashboard can't handle self-signed cert on Grafana API Reviewed-by: Ernesto Puerta <epuertat@redhat.com> Reviewed-by: Laura Paduano <lpaduano@suse.com> Reviewed-by: Volker Theile <vtheile@suse.com>
nautilus: mgr/dashboard: check embedded Grafana dashboard references Reviewed-by: Laura Paduano <lpaduano@suse.com>
nautilus: mgr/dashboard: check if user has config-opt permissions Reviewed-by: Ernesto Puerta <epuertat@redhat.com> Reviewed-by: Laura Paduano <lpaduano@suse.com>
nautilus: mgr/dashboard: disable 'Add Capability' button in rgw user edit Reviewed-by: Laura Paduano <lpaduano@suse.com>
Nothing inherits from PQ. Signed-off-by: Patrick Donnelly <pdonnell@redhat.com> (cherry picked from commit 761cc0e) Conflicts: src/mds/PurgeQueue.h
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com> (cherry picked from commit 096a5ca)
This makes the corresponding test not racy. Fixes: https://tracker.ceph.com/issues/16881 Signed-off-by: Patrick Donnelly <pdonnell@redhat.com> Conflicts: src/mds/PurgeQueue.cc src/mds/PurgeQueue.h
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com> (cherry picked from commit 98e3b7e) Note: removed mgr blacklist test which applies to Octopus.
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
The squelched error prevented us from knowing connection cleanup doesn't work on py3. Signed-off-by: Patrick Donnelly <pdonnell@redhat.com> (cherry picked from commit b45c08b)
Otherwise this raises an exception. Fixes: https://tracker.ceph.com/issues/43113 Signed-off-by: Patrick Donnelly <pdonnell@redhat.com> (cherry picked from commit 03f8080)
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com> (cherry picked from commit 6714364)
nautilus: mgr/MgrClient: fix open condition Reviewed-by: Josh Durgin <jdurgin@redhat.com>
nautilus: selinux: Allow ceph to read udev db Reviewed-by: Kefu Chai <kchai@redhat.com> Reviewed-by: Boris Ranto <branto@redhat.com> Reviewed-by: Neha Ojha <nojha@redhat.com>
nautilus: kv: fix shutdown vs async compaction Reviewed-by: Neha Ojha <nojha@redhat.com> Reviewed-by: Sage Weil <sage@redhat.com>
nautilus: osd: Diagnostic logging for upmap cleaning Reviewed-by: David Zafman <dzafman@redhat.com>
nautilus: ceph-volume: don't remove vg twice when zapping filestore
nautilus: mgr/DaemonServer: fix 'osd ok-to-stop' for EC pools Reviewed-by: Sage Weil <sage@redhat.com> Reviewed-by: David Zafman <dzafman@redhat.com>
nautilus: qa/suites/rados/multimon/tasks/mon_clock_with_skews: disable ntpd etc Reviewed-by: Kefu Chai <kchai@redhat.com>
nautilus: common/bl: fix the dangling last_p issue. Reviewed-by: Kefu Chai <kchai@redhat.com>
nautilus: ceph-monstore-tool: correct the key for storing mgr_command_descs Reviewed-by: Kefu Chai <kchai@redhat.com>
Add the min_sample lower-bound argument too Signed-off-by: Sage Weil <sage@redhat.com> (cherry picked from commit 7be5c13) Conflicts: had to be backported to enable backporting of ceph#32903 Backport tracker: https://tracker.ceph.com/issues/43873
… hours Telemetry module fetches device metrics which were scraped in the last "telemetry interval"*2 (=48 hours by default) by calling _get_device_metrics() with min_sample. _get_device_metrics() fetches the metrics from omap and breaks on the first one that is older than min_sample. But because it fetched in ascending order (from oldest to newest) it was breaking on the first one it received, if it was older than the interval above. We need to pass min_sample to get_omap_vals() so it will start fetching from that value. Fixes: https://tracker.ceph.com/issues/43837 Signed-off-by: Yaarit Hatuka <yaarit@redhat.com> (cherry picked from commit 5f7e4a9)
Upgrade to 2.8.1 and stable-4.0 respectively Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
nautilus: qa/ceph-ansible: ansible-version and ceph_ansible Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
This was done for octopus in 8283ea9, but not for nautilus Signed-off-by: Neha Ojha <nojha@redhat.com>
nautilus: qa/suites/upgrade/mimic-x/stress-split: fix msgr2 vs nautilus ordering
Caused by backport commit cb48be5 which did not account for the explicit drop of the message reference, only in Nautilus-. Fixes: https://tracker.ceph.com/issues/44245 Fixes: cb48be5 Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
* refs/pull/33498/head: mgr: drop reference to msg on return Reviewed-by: Venky Shankar <vshankar@redhat.com> Reviewed-by: Sage Weil <sage@redhat.com>
Saw a deadlock when deleting lot of subvolumes -- purge threads were stuck in accessing global lock for volume access. This can happen when there is a concurrent remove (which renames and signals the purge threads) and a purge thread is just about to scan the trash directory for entries. For the fix, purge threads fetches entries by accessing the volume in lockless mode. This is safe from functionality point-of-view as the rename and directory scan is correctly handled by the filesystem. Worst case the purge thread would pick up the trash entry on next scan, never leaving a stale trash entry. Signed-off-by: Venky Shankar <vshankar@redhat.com> (cherry picked from commit 808a1ce)
Signed-off-by: Venky Shankar <vshankar@redhat.com> (cherry picked from commit 5ec09a2)
Fixes: http://tracker.ceph.com/issues/44282 Signed-off-by: Venky Shankar <vshankar@redhat.com> (cherry picked from commit 92b2008)
* refs/pull/33526/head: test: verify purge queue w/ large number of subvolumes test: pass timeout argument to mount::wait_for_dir_empty() mgr/volumes: access volume in lockless mode when fetching async job Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
nautilus: mgr/devicehealth: fix telemetry stops sending device reports after 48 hours Reviewed-by: Sage Weil <sage@redhat.com>
If the async threads hit a temporary exception the job is never unregistered and therefore gets skipped by the async threads on subsequent scans. Patrick hit this in nautilus when one of the purge threads hit an exception when trying to log a message. The trash entry was never picked up again by the purge threads. Fixes: http://tracker.ceph.com/issues/44315 Signed-off-by: Venky Shankar <vshankar@redhat.com> (cherry picked from commit 46476ef)
* refs/pull/33569/head: mgr/volumes: unregister job upon async threads exception Reviewed-by: Ramana Raja <rraja@redhat.com> Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Thanks for your pull request. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA). The following commits have not yet signed CLA. 3f7f0c7 | mds: skip tell command scrub on multimds The master version of this PR deals with the unified asok/tell Signed-off-by: Patrick Donnelly pdonnell@redhat.com Signed-off-by: Patrick Donnelly pdonnell@redhat.com Rationale can be found in [1]. Point is that EC pools incur a [1] doc/cephfs/createfs.rst Fixes: https://tracker.ceph.com/issues/42450 Signed-off-by: Patrick Donnelly pdonnell@redhat.com Conflicts: In the future, we should add the EC data pool as a supplementary data Signed-off-by: Patrick Donnelly pdonnell@redhat.com Connection pointer is not helpful. Signed-off-by: Patrick Donnelly pdonnell@redhat.com Conflicts: If the mgr is waiting on daemon metadata from the mons, it has no Fixes: https://tracker.ceph.com/issues/43037 Added 'telemetry show-device' command to print a preview of telemetry device report. Signed-off-by: Yaarit Hatuka yaarit@redhat.com Conflicts:
smartctl JSON output contains the device's serial number in two Fixes: https://tracker.ceph.com/issues/43939 get_metada() returns 'None' when requesting a missing service, hence Fixes: https://tracker.ceph.com/issues/43642 nautilus: mgr/dashboard: Using wrong identifiers in RGW user/bucket datatables Reviewed-by: Laura Paduano lpaduano@suse.com nautilus: mgr/dashboard: iSCSI targets not available if any gateway is down (and more...) Reviewed-by: Jason Dillaman dillaman@redhat.com nautilus: mgr/dashboard: add debug mode, and accept expected exception when SSL handshaking Reviewed-by: Kefu Chai kchai@redhat.com nautilus: mgr/dashboard: Dashboard can't handle self-signed cert on Grafana API Reviewed-by: Ernesto Puerta epuertat@redhat.com nautilus: mgr/dashboard: check embedded Grafana dashboard references Reviewed-by: Laura Paduano lpaduano@suse.com nautilus: mgr/dashboard: check if user has config-opt permissions Reviewed-by: Ernesto Puerta epuertat@redhat.com nautilus: mgr/dashboard: disable 'Add Capability' button in rgw user edit Reviewed-by: Laura Paduano lpaduano@suse.com Nothing inherits from PQ. Signed-off-by: Patrick Donnelly pdonnell@redhat.com Conflicts: Signed-off-by: Patrick Donnelly pdonnell@redhat.com This makes the corresponding test not racy. Fixes: https://tracker.ceph.com/issues/16881 Conflicts: Signed-off-by: Patrick Donnelly pdonnell@redhat.com Signed-off-by: Patrick Donnelly pdonnell@redhat.com The squelched error prevented us from knowing connection cleanup doesn't Signed-off-by: Patrick Donnelly pdonnell@redhat.com Otherwise this raises an exception. Fixes: https://tracker.ceph.com/issues/43113 Signed-off-by: Patrick Donnelly pdonnell@redhat.com nautilus: mgr/MgrClient: fix open condition Reviewed-by: Josh Durgin jdurgin@redhat.com nautilus: selinux: Allow ceph to read udev db Reviewed-by: Kefu Chai kchai@redhat.com nautilus: kv: fix shutdown vs async compaction Reviewed-by: Neha Ojha nojha@redhat.com nautilus: osd: Diagnostic logging for upmap cleaning Reviewed-by: David Zafman dzafman@redhat.com nautilus: osd: Use physical ratio for nearfull (doesn't include backfill resserve) Reviewed-by: Neha Ojha nojha@redhat.com nautilus: osd/OSD: enhance osd numa affinity compatibility Reviewed-by: Kefu Chai kchai@redhat.com nautilus: osd/PeeringState.cc: skip peer_purged when discovering all missing Reviewed-by: Sage Weil sage@redhat.com nautilus: qa/suites/rados/thrash: force normal pg log length with cache tiering Reviewed-by: Sage Weil sage@redhat.com nautilus: osd/PeeringState.cc: don't let num_objects become negative Reviewed-by: Neha Ojha nojha@redhat.com nautilus: core: osd/OSDMap: health alert for non-power-of-two pg_num Reviewed-by: Neha Ojha nojha@redhat.com Fixes: https://tracker.ceph.com/issues/36728 Fallback to predefined paths for backward compatibility. Signed-off-by: Shyukri Shyukriev shshyukriev@suse.com Users complained[1] the error message isn't clear, and they thought [1] rook/rook#4547 Fixes: https://tracker.ceph.com/issues/43442 Fixes: bb4de1a Signed-off-by: Jan Fajerski jfajerski@suse.com Conflicts: Signed-off-by: dongdong tao dongdong.tao@canonical.com Actual data size depending on osds_per_device needs to be calculated here. Otherwise, if osds_per_device is greater than 1, ceph-volume will allocate 100% of the device to the first osd and then fail to create the LV for the second because the volume group is already full. Fixes: https://tracker.ceph.com/issues/39442 Conflicts: I've removed This allows for a symlink to be passed to
which makes it possible to use Fixes: https://tracker.ceph.com/issues/43497 A lot of our functionality depends on the mgr now. If there is a cluster Fixes: https://tracker.ceph.com/issues/38942 On Linux system it is possible to set 64 character length hostname when With the current code on host with 64 long hostname osd during start Signed-off-by: Michal Skalski mskalski@juniper.net Fixes: https://tracker.ceph.com/issues/42119 Signed-off-by: Richard Bai(白学余) baixueyu@inspur.com (cherry picked from commit ff2f4af) The session gets put as result of the set_session call in the next Fixes: https://tracker.ceph.com/issues/38345 Signed-off-by: Brad Hubbard bhubbard@redhat.com Introduced by 08fcf01, which activated Fixes: https://tracker.ceph.com/issues/43892 This commit adds per-pool pg states metrics Signed-off-by: Aleksei Zakharov zakharov.a.g@yandex.ru If we have all other stats by pool, it's better to have total Signed-off-by: Aleksei Zakharov zakharov.a.g@yandex.ru Update pg metrics descriptions to show that we have per Signed-off-by: Aleksei Zakharov zakharov.a.g@yandex.ru Signed-off-by: Aleksei Zakharov zaharov@selectel.ru Also, revert table formatting. Signed-off-by: Aleksei Zakharov zaharov@selectel.ru If an entity name (id.type) has more than one dot (i.e. 'type' has Fixes: https://tracker.ceph.com/issues/43313 Using raw_used_rate to calculate the pool_pg_target results in too Signed-off-by: Dan van der Ster daniel.vanderster@cern.ch Signed-off-by: dongdong tao dongdong.tao@canonical.com Conflicts:
nautilus: test: Fix wait_for_state() to wait for a PG to get into a state Reviewed-by: Neha Ojha nojha@redhat.com nautilus: os/bluestore/BlueStore.cc: set priorities for compression stats Reviewed-by: Igor Fedotov ifedotov@suse.com nautilus: common: fix deadlocky inflight op visiting in OpTracker. Reviewed-by: Kefu Chai kchai@redhat.com nautilus: common/util: use ifstream to read from /proc files Reviewed-by: Kefu Chai kchai@redhat.com nautilus: mon/MgrMonitor.cc: add always_on_modules to the output of "ceph mgr module ls" Reviewed-by: David Zafman dzafman@redhat.com nautilus: common/config: update values when they are removed via mon Reviewed-by: Sage Weil sage@redhat.com nautilus: common/options: bluestore 4k min_alloc_size for SSD Reviewed-by: Sage Weil sage@redhat.com nautilus: ceph-volume: minor clean-up of "simple scan" subcommand help nautilus: ceph-volume/test: patch VolumeGroups Fixes: https://tracker.ceph.com/issues/42777 Signed-off-by: Jan Fajerski jfajerski@suse.com Signed-off-by: Jan Fajerski jfajerski@suse.com Conflicts: When batch is called non-interactively and a user explicitly specifies, Fixes: https://tracker.ceph.com/issues/43105 Signed-off-by: Jan Fajerski jfajerski@suse.com nautilus: ceph-volume: assume msgrV1 for all branches containing mimic nautilus: ceph-volume: util: look for executable in $PATH nautilus: ceph-volume/lvm/activate.py: clarify error message: fsid refers to osd_fsid nautilus: ceph-volume: use correct extents if using db-devices and >1 osds_per_device nautilus: ceph-volume: fix the integer overflow nautilus: ceph-volume: Dereference symlink in lvm list nautilus: ceph-volume: import mock.mock instead of unittest.mock (py2) nautilus: ceph-volume: make get_devices fs location independent nautilus: ceph-volume/batch: fail on filtered devices when non-interactive This changes create_lv so one can pass the desired device and either a Signed-off-by: Jan Fajerski jfajerski@suse.com Signed-off-by: Jan Fajerski jfajerski@suse.com Add option to pass raw physical devices everywhere, restructure a little Signed-off-by: Jan Fajerski jfajerski@suse.com nautilus: ceph-volume: allow raw block devices everywhere Simply calls lvchange -an to deactivate a logical volume. Signed-off-by: Jan Fajerski jfajerski@suse.com Signed-off-by: Jan Fajerski jfajerski@suse.com Thsi unmounts a path if and only if it's a tmpfs mount. Signed-off-by: Jan Fajerski jfajerski@suse.com This new subcommand unmounts and OSDs tmpfs mount and closes crypt Signed-off-by: Jan Fajerski jfajerski@suse.com nautilus: mds: reject sessionless messages Reviewed-by: Ramana Raja rraja@redhat.com nautilus: MDSMonitor: warn if a new file system is being created with an EC default data pool Reviewed-by: Ramana Raja rraja@redhat.com nautilus: mds: reject forward scrubs when cluster has multiple active MDS (more than one rank) Reviewed-by: Ramana Raja rraja@redhat.com nautilus: mds: fix revoking caps after after stale->resume circle Reviewed-by: Ramana Raja rraja@redhat.com nautilus: cephfs-journal-tool: fix crash and usage Reviewed-by: Ramana Raja rraja@redhat.com nautilus: mds: note client features when rejecting client Reviewed-by: Ramana Raja rraja@redhat.com nautilus: client: disallow changing fuse_default_permissions option at runtime Reviewed-by: Ramana Raja rraja@redhat.com nautilus: ceph-volume: lvm deactivate command Remove the --all flag until its actually implemented. Fixes: https://tracker.ceph.com/issues/43330 Signed-off-by: Jan Fajerski jfajerski@suse.com Conflicts: nautilus: mon: print FSMap regardless of file system count Reviewed-by: Ramana Raja rraja@redhat.com Filters can be passed to these commands by using option '-S'. Signed-off-by: Rishabh Dave ridave@redhat.com nautilus: ceph-volume: add methods to pass filters to pvs, vgs and lvs commands nautilus: ceph-volume: lvm/deactivate: add unit tests, remove --all Signed-off-by: Yuri Weinstein yweinste@redhat.com qa/tests: added client-upgrade-nautilus suite to be used on octopus … Reviewed-by: Neha Ojha nojha@redhat.com The Size class can now parse strings and has support for arithmetic Signed-off-by: Jan Fajerski jfajerski@suse.com This adds options to size to-be-created LVs in the prepare and create Fixes: https://tracker.ceph.com/issues/43299 Signed-off-by: Jan Fajerski jfajerski@suse.com Signed-off-by: Jan Fajerski jfajerski@suse.com Signed-off-by: Jan Fajerski jfajerski@suse.com This was introduced in ceph#32242 Signed-off-by: Jan Fajerski jfajerski@suse.com Fixes: https://tracker.ceph.com/issues/43844 Signed-off-by: Jan Fajerski jfajerski@suse.com nautilus: pybind/mgr/volumes: idle connection drop is not working Reviewed-by: Ramana Raja rraja@redhat.com Add a new Python binding equivalent to lstat so that information about Signed-off-by: Rishabh Dave ridave@redhat.com Fixes: http://tracker.ceph.com/issues/42646
Signed-off-by: Jos Collin jcollin@redhat.com
Signed-off-by: Jos Collin jcollin@redhat.com
Signed-off-by: Jos Collin jcollin@redhat.com Fixes: https://tracker.ceph.com/issues/42872 Signed-off-by: Jos Collin jcollin@redhat.com There are only 2 cases which needs cleanup:
In either case, we could do a 'volume rm'. Signed-off-by: Jos Collin jcollin@redhat.com ... which was not fully implemented anyway, so just remove the Signed-off-by: Venky Shankar vshankar@redhat.com introduce with statement in rmtree. This change Signed-off-by: Jos Collin jcollin@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Joshua Schmid jschmid@suse.de Instead of checking if the --yes-i-really-mean-it Fixes: https://tracker.ceph.com/issues/42931
Signed-off-by: Jos Collin jcollin@redhat.com This is fixed already. Signed-off-by: Jos Collin jcollin@redhat.com helpers for various filesystem querying routines, utils Signed-off-by: Venky Shankar vshankar@redhat.com unlike existing subvolume specification, this is just a Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com subvolume base class implements common routines/helpers and typically, a subvolume would be implemented by subclassing the with this commit, current subvolume topology is maintained. but Signed-off-by: Venky Shankar vshankar@redhat.com create_subvolume() creates a subvolume with the max version known Signed-off-by: Venky Shankar vshankar@redhat.com apart from the new way of provisioning subvolumes, this makes heavy this change classifies volumes, groups and subvolumes to be treated
this way, lot of redundant checks such as verifying if a volume or Signed-off-by: Venky Shankar vshankar@redhat.com Fixes: https://tracker.ceph.com/issues/43349 this was lying around post versioning changes. Signed-off-by: Venky Shankar vshankar@redhat.com Fixes: http://tracker.ceph.com/issues/43645 ... and fetch creation state from state machine table. Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com This will be required when creating a clone as the clone would Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com This also makes Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com This fix is only needed in nautilus, and the issue File "/usr/share/ceph/mgr/volumes/fs/async_cloner.py", line 114, in cptree The issue wasn't observed in master/octopus teuthology Signed-off-by: Ramana Raja rraja@redhat.com Fix the following issue seen while upstream teuthology testing, This issue is not observed in master/octopus, as it only supports Signed-off-by: Ramana Raja rraja@redhat.com nautilus: ceph-volume: add sizing arguments to prepare nautilus: ceph-volume: batch bluestore fix create_lvs call get_pvs, get_vgs and get_lvs must accept tags and filter volumes based Signed-off-by: Rishabh Dave ridave@redhat.com These convenience methods shortens following phrase to lvs = get_lvs() These methods do the same things as above phrase internall. Rewrite Signed-off-by: Rishabh Dave ridave@redhat.com The method determines whether given LV is managed by Ceph or not. Signed-off-by: Rishabh Dave ridave@redhat.com Get rid of duplicate and redundant code and use get_lvs, get_vgs and Signed-off-by: Rishabh Dave ridave@redhat.com Signed-off-by: Rishabh Dave ridave@redhat.com lisitng.py doesn't call api.Volumes anymore. Therefore, this test is Signed-off-by: Rishabh Dave ridave@redhat.com 17957d9 introduced a regression in When passing a vg/lv path for generating a single report, it fails Fixes: https://tracker.ceph.com/issues/43969 Signed-off-by: Guillaume Abrioux gabrioux@redhat.com Also drop the sep argument from get_lvs and siblings, unused. Signed-off-by: Jan Fajerski jfajerski@suse.com A single report on a non-lvm device now works. Fixes: https://tracker.ceph.com/issues/44009 Signed-off-by: Jan Fajerski jfajerski@suse.com nautilus: ceph-volume: refactor listing.py + fixes When using vg/lv, this function throws an error like following:
using Fixes: https://tracker.ceph.com/issues/43970 Signed-off-by: Guillaume Abrioux gabrioux@redhat.com We don't want to generate this log when a call to Signed-off-by: Guillaume Abrioux gabrioux@redhat.com This adds two properties available_[lvm,raw] to device (and thus inventory). Fixes: https://tracker.ceph.com/issues/43400 When rerunning ceph-volume lvm create on a device already prepared and This is a regression introduced by bb4de1a Fixes: https://tracker.ceph.com/issues/43981 Signed-off-by: Guillaume Abrioux gabrioux@redhat.com This commit adds a new unit test Signed-off-by: Guillaume Abrioux gabrioux@redhat.com We need to pay attention to account for CRUSH_ITEM_NONE entries in the Fixes: https://tracker.ceph.com/issues/43151 Conflicts:
Signed-off-by: Sage Weil sage@redhat.com Make sure PGs peer (simply flushing state to mon isn't enough). Fixes: https://tracker.ceph.com/issues/43721 mgr/volumes: misc fix and feature enhancements Reviewed-by: Venky Shankar vshankar@redhat.com nautilus: mount.ceph: give a hint message when no mds is up or cluster is laggy Reviewed-by: Jeff Layton jlayton@redhat.com nautilus: mount.ceph: remove arbitrary limit on size of name= option Reviewed-by: Ilya Dryomov idryomov@redhat.com nautilus: cephfs: client: Add is_dir() check before changing directory Reviewed-by: Patrick Donnelly pdonnell@redhat.com nautilus: mgr: "mds metadata" to setup new DaemonState races with fsmap Reviewed-by: Ramana Raja rraja@redhat.com nautilus: mds: fix assert(omap_num_objs <= MAX_OBJECTS) of OpenFileTable Reviewed-by: Ramana Raja rraja@redhat.com nautilus: cephfs: qa: ignore slow ops for ffsb workunit Reviewed-by: Patrick Donnelly pdonnell@redhat.com nautilus: cephfs: qa: save MDS epoch barrier Reviewed-by: Patrick Donnelly pdonnell@redhat.com nautilus: mds/OpenFileTable: match MAX_ITEMS_PER_OBJ to osd_deep_scrub_large_omap_object_key_threshold Reviewed-by: Patrick Donnelly pdonnell@redhat.com nautilus: RuntimeError: Files in flight high water is unexpectedly low (0 / 6) Reviewed-by: Ramana Raja rraja@redhat.com nautilus: rgw: update the hash source for multipart entries during resharding Reviewed-by: J. Eric Ivancich ivancich@redhat.com nautilus: cephfs: qa: ignore trimmed cache items for dead cache drop Reviewed-by: Patrick Donnelly pdonnell@redhat.com nautilus: ceph-volume: skip osd creation when already done This is a regression introduced by 634a709 The lvm batch command fails to prepare the OSDs on the created LV. $ lvs -o lv_tags --noheadings Since we call is_ceph_device which returns True if the ceph.osd_id LVM When the tag value is set to 'null' then we can consider that the device Closes: https://tracker.ceph.com/issues/44069 Signed-off-by: Dimitri Savineau dsavinea@redhat.com Signed-off-by: Jan Fajerski jfajerski@suse.com Fixes: https://tracker.ceph.com/issues/44099 Signed-off-by: Jan Fajerski jfajerski@suse.com nautilus: ceph-volume: fix has_bluestore_label() function nautilus: ceph-volume: finer grained availability notion in inventory. nautilus: ceph-volume: fix is_ceph_device for lvm batch nautilus: ceph-volume: use get_device_vgs in has_common_vg Fixes: https://tracker.ceph.com/issues/43889 The skewed clock makes some mons miss elections. Signed-off-by: Sage Weil sage@redhat.com Fixes: https://tracker.ceph.com/issues/43646 Conflicts: when ceph-mon starts, it checks to see if it's listed in the monmap, if
then bails out. normally, the monitor will try to rename its name in in this change, user is allowed to specify the mon-ids in command line Signed-off-by: Kefu Chai kchai@redhat.com Fixes: https://tracker.ceph.com/issues/43582 to note that we also need to add mgr's key to monitor's keyring Signed-off-by: Kefu Chai kchai@redhat.com nautilus: mon/ConfigMonitor: fix handling of NO_MON_UPDATE settings Reviewed-by: Nathan Cutler ncutler@suse.com nautilus: crush/CrushWrapper: behave with empty weight vector Reviewed-by: Kefu Chai kchai@redhat.com nautilus: mgr/pg_autoscaler: default to pg_num[_min] = 32 Reviewed-by: Neha Ojha nojha@redhat.com nautilus: mon: elector: return after triggering a new election Reviewed-by: Josh Durgin jdurgin@redhat.com nautilus: mgr/telemetry: fix device serial number anonymization Reviewed-by: Kefu Chai kchai@redhat.com nautilus: mgr/telemetry: anonymizing smartctl report itself Reviewed-by: Sage Weil sage@redhat.com nautilus: mon/MgrMonitor.cc: warn about missing mgr in a cluster with osds Reviewed-by: Kefu Chai kchai@redhat.com nautilus: osd: Allow 64-char hostname to be added as the "host" in CRUSH Reviewed-by: Kefu Chai kchai@redhat.com nautilus: mon/ConfigMonitor: only propose if leader Reviewed-by: Kefu Chai kchai@redhat.com nautilus: mgr/prometheus: report per-pool pg states Reviewed-by: Jan Fajerski jfajerski@suse.com nautilus: mgr/telemetry: split entity_name only once (handle ids with dots) Reviewed-by: Sage Weil sage@redhat.com nautilus: mgr/pg_autoscaler: calculate pool_pg_target using pool size Reviewed-by: Kefu Chai kchai@redhat.com nautilus: mon/Session: only index osd ids >= 0 nautilus: mgr/telemetry: check get_metadata return val Reviewed-by: David Zafman dzafman@redhat.com nautilus: mon: Don't put session during feature change Reviewed-by: David Zafman dzafman@redhat.com nautilus: rgw_file: avoid string::front() on empty path Reviewed-by: Casey Bodley cbodley@redhat.com nautilus: rgw: maybe coredump when reload operator happened Reviewed-by: Casey Bodley cbodley@redhat.com nautilus: rgw: fix one part of the bulk delete(RGWDeleteMultiObj_ObjStore_S3) fails but no error messages Reviewed-by: Casey Bodley cbodley@redhat.com Fixes: https://tracker.ceph.com/issues/44125 Signed-off-by: Jan Fajerski jfajerski@suse.com Fixes: https://tracker.ceph.com/issues/43844 Signed-off-by: Jan Fajerski jfajerski@suse.com nautilus: ceph-volume: avoid calling zap_lv with a LV-less VG nautilus: ceph-volume: batch bluestore fix create_lvs call Fixes: https://tracker.ceph.com/issues/44148 Signed-off-by: Jan Fajerski jfajerski@suse.com Signed-off-by: Jan Fajerski jfajerski@suse.com Fixes: https://tracker.ceph.com/issues/44149 nautilus: ceph-volume: pass journal_size as Size not string nautilus: ceph-volume: don't remove vg twice when zapping filestore nautilus: mgr/DaemonServer: fix 'osd ok-to-stop' for EC pools Reviewed-by: Sage Weil sage@redhat.com nautilus: qa/suites/rados/multimon/tasks/mon_clock_with_skews: disable ntpd etc Reviewed-by: Kefu Chai kchai@redhat.com nautilus: common/bl: fix the dangling last_p issue. Reviewed-by: Kefu Chai kchai@redhat.com nautilus: ceph-monstore-tool: correct the key for storing mgr_command_descs Reviewed-by: Kefu Chai kchai@redhat.com Add the min_sample lower-bound argument too Signed-off-by: Sage Weil sage@redhat.com Telemetry module fetches device metrics which were scraped in the last Fixes: https://tracker.ceph.com/issues/43837 Upgrade to 2.8.1 and stable-4.0 respectively Signed-off-by: Brad Hubbard bhubbard@redhat.com nautilus: qa/ceph-ansible: ansible-version and ceph_ansible Reviewed-by: Yuri Weinstein yweinste@redhat.com This was done for octopus in 8283ea9, Signed-off-by: Neha Ojha nojha@redhat.com nautilus: qa/suites/upgrade/mimic-x/stress-split: fix msgr2 vs nautilus ordering Caused by backport commit cb48be5 which Fixes: https://tracker.ceph.com/issues/44245
Reviewed-by: Venky Shankar vshankar@redhat.com Saw a deadlock when deleting lot of subvolumes -- purge threads were For the fix, purge threads fetches entries by accessing the volume Signed-off-by: Venky Shankar vshankar@redhat.com Signed-off-by: Venky Shankar vshankar@redhat.com Fixes: http://tracker.ceph.com/issues/44282
Reviewed-by: Patrick Donnelly pdonnell@redhat.com nautilus: mgr/devicehealth: fix telemetry stops sending device reports after 48 hours Reviewed-by: Sage Weil sage@redhat.com If the async threads hit a temporary exception the job is Patrick hit this in nautilus when one of the purge threads Fixes: http://tracker.ceph.com/issues/44315
Reviewed-by: Ramana Raja rraja@redhat.com 📝 Please access here to sign the CLA. It may take a couple minutes for the CLA signature to be fully registered; after that, please reply here with a new comment: /check-cla to verify. Thanks.
|
Checklist
Show available Jenkins commands
jenkins retest this please
jenkins test classic perf
jenkins test crimson perf
jenkins test signed
jenkins test make check
jenkins test make check arm64
jenkins test submodules
jenkins test dashboard
jenkins test api
jenkins test docs
jenkins render docs
jenkins test ceph-volume all
jenkins test ceph-volume tox