The Ceph Blog

Featured Post

v0.84 released

The next Ceph development release is here! This release contains several meaty items, including some MDS improvements for journaling, the ability to remove the CephFS file system (and name it), several mon cleanups with tiered pools, several OSD performance branches, a new “read forward” RADOS caching mode, a prototype Kinetic OSD backend, and various radosgw improvements (especially with the new standalone civetweb frontend). And there are a zillion OSD bug fixes. Things are looking pretty good for the Giant release that is coming up in the next month.

UPGRADING

  • The *_kb perf counters on the monitor have been removed. These are replaced with a new set of *_bytes counters (e.g., cluster_osd_kb is replaced by cluster_osd_bytes).
  • The rd_kb and wr_kb fields in the JSON dumps for pool stats (accessed via the ‘ceph df detail -f json-pretty’ and related commands) have been replaced with corresponding *_bytes fields. Similarly, the ‘total_space’, ‘total_used’, and ‘total_avail’ fields are replaced with ‘total_bytes’, ‘total_used_bytes’, and ‘total_avail_bytes’ fields.
  • The ‘rados df –format=json’ output ‘read_bytes’ and ‘write_bytes’ fields were incorrectly reporting ops; this is now fixed.
  • The ‘rados df –format=json’ output previously included ‘read_kb’ and ‘write_kb’ fields; these have been removed. Please use ‘read_bytes’ and ‘write_bytes’ instead (and divide by 1024 if appropriate).

NOTABLE CHANGES

  • ceph-conf: flush log on exit (Sage Weil)
  • ceph-dencoder: refactor build a bit to limit dependencies (Sage Weil, Dan Mick)
  • ceph.spec: split out ceph-common package, other fixes (Sandon Van Ness)
  • ceph_test_librbd_fsx: fix RNG, make deterministic (Ilya Dryomov)
  • cephtool: refactor and improve CLI tests (Joao Eduardo Luis)
  • client: improved MDS session dumps (John Spray)
  • common: fix dup log messages (#9080, Sage Weil)
  • crush: include new tunables in dump (Sage Weil)
  • crush: only require rule features if the rule is used (#8963, Sage Weil)
  • crushtool: send output to stdout, not stderr (Wido den Hollander)
  • fix i386 builds (Sage Weil)
  • fix struct vs class inconsistencies (Thorsten Behrens)
  • hadoop: update hadoop tests for Hadoop 2.0 (Haumin Chen)
  • librbd, ceph-fuse: reduce cache flush overhead (Haomai Wang)
  • librbd: fix error path when opening image (#8912, Josh Durgin)
  • mds: add file system name, enabled flag (John Spray)
  • mds: boot refactor, cleanup (John Spray)
  • mds: fix journal conversion with standby-replay (John Spray)
  • mds: separate inode recovery queue (John Spray)
  • mds: session ls, evict commands (John Spray)
  • mds: submit log events in async thread (Yan, Zheng)
  • mds: use client-provided timestamp for user-visible file metadata (Yan, Zheng)
  • mds: validate journal header on load and save (John Spray)
  • misc build fixes for OS X (John Spray)
  • misc integer size cleanups (Kevin Cox)
  • mon: add get-quota commands (Joao Eduardo Luis)
  • mon: do not create file system by default (John Spray)
  • mon: fix ‘ceph df’ output for available space (Xiaoxi Chen)
  • mon: fix bug when no auth keys are present (#8851, Joao Eduardo Luis)
  • mon: fix compat version for MForward (Joao Eduardo Luis)
  • mon: restrict some pool properties to tiered pools (Joao Eduardo Luis)
  • msgr: misc locking fixes for fast dispatch (#8891, Sage Weil)
  • osd: add ‘dump_reservations’ admin socket command (Sage Weil)
  • osd: add READFORWARD caching mode (Luis Pabon)
  • osd: add header cache for KeyValueStore (Haomai Wang)
  • osd: add prototype KineticStore based on Seagate Kinetic (Josh Durgin)
  • osd: allow map cache size to be adjusted at runtime (Sage Weil)
  • osd: avoid refcounting overhead by passing a few things by ref (Somnath Roy)
  • osd: avoid sharing PG info that is not durable (Samuel Just)
  • osd: clear slow request latency info on osd up/down (Sage Weil)
  • osd: fix PG object listing/ordering bug (Guang Yang)
  • osd: fix PG stat errors with tiering (#9082, Sage Weil)
  • osd: fix bug with long object names and rename (#8701, Sage Weil)
  • osd: fix cache full -> not full requeueing (#8931, Sage Weil)
  • osd: fix gating of messages from old OSD instances (Greg Farnum)
  • osd: fix memstore bugs with collection_move_rename, lock ordering (Sage Weil)
  • osd: improve locking for KeyValueStore (Haomai Wang)
  • osd: make tiering behave if hit_sets aren’t enabled (Sage Weil)
  • osd: mark pools with incomplete clones (Sage Weil)
  • osd: misc locking fixes for fast dispatch (Samuel Just, Ma Jianpeng)
  • osd: prevent old rados clients from using tiered pools (#8714, Sage Weil)
  • osd: reduce OpTracker overhead (Somnath Roy)
  • osd: set configurable hard limits on object and xattr names (Sage Weil, Haomai Wang)
  • osd: trim old EC objects quickly; verify on scrub (Samuel Just)
  • osd: work around GCC 4.8 bug in journal code (Matt Benjamin)
  • rados bench: fix arg order (Kevin Dalley)
  • rados: fix {read,write}_ops values for df output (Sage Weil)
  • rbd: add rbdmap pre- and post post- hooks, fix misc bugs (Dmitry Smirnov)
  • rbd: improve option default behavior (Josh Durgin)
  • rgw: automatically align writes to EC pool (#8442, Yehuda Sadeh)
  • rgw: fix crash on swift CORS preflight request (#8586, Yehuda Sadeh)
  • rgw: fix memory leaks (Andrey Kuznetsov)
  • rgw: fix multipart upload (#8846, Silvain Munaut, Yehuda Sadeh)
  • rgw: improve -h (Abhishek Lekshmanan)
  • rgw: improve delimited listing of bucket, misc fixes (Yehuda Sadeh)
  • rgw: misc civetweb fixes (Yehuda Sadeh)
  • rgw: powerdns backend for global namespaces (Wido den Hollander)
  • systemd: initial systemd config files (Federico Simoncelli)

GETTING CEPH

Earlier Posts

v0.67.10 Dumpling released

This stable update release for Dumpling includes primarily fixes for RGW, including several issues with bucket listings and a potential data corruption problem when multiple multi-part uploads race. There is also some throttling capability added in the OSD for scrub that can mitigate the performance impact on production clusters.

We recommend that all Dumpling users upgrade at their convenience.

NOTABLE CHANGES

  • ceph-disk: partprobe befoere settle, fixing dm-crypt (#6966, Eric Eastman)
  • librbd: add invalidate cache interface (Josh Durgin)
  • librbd: close image if remove_child fails (Ilya Dryomov)
  • librbd: fix potential null pointer dereference (Danny Al-Gaaf)
  • librbd: improve writeback checks, performance (Haomai Wang)
  • librbd: skip zeroes when copying image (#6257, Josh Durgin)
  • mon: fix rule(set) check on ‘ceph pool set … crush_ruleset …’ (#8599, John Spray)
  • mon: shut down if mon is removed from cluster (#6789, Joao Eduardo Luis)
  • osd: fix filestore perf reports to mon (Sage Weil)
  • osd: force any new or updated xattr into leveldb if E2BIG from XFS (#7779, Sage Weil)
  • osd: lock snapdir object during write to fix race with backfill (Samuel Just)
  • osd: option sleep during scrub (Sage Weil)
  • osd: set io priority on scrub and snap trim threads (Sage Weil)
  • osd: ‘status’ admin socket command (Sage Weil)
  • rbd: tolerate missing NULL terminator on block_name_prefix (#7577, Dan Mick)
  • rgw: calculate user manifest (#8169, Yehuda Sadeh)
  • rgw: fix abort on chunk read error, avoid using extra memory (#8289, Yehuda Sadeh)
  • rgw: fix buffer overflow on bucket instance id (#8608, Yehuda Sadeh)
  • rgw: fix crash in swift CORS preflight request (#8586, Yehuda Sadeh)
  • rgw: fix implicit removal of old objects on object creation (#8972, Patrycja Szablowska, Yehuda Sadeh)
  • rgw: fix MaxKeys in bucket listing (Yehuda Sadeh)
  • rgw: fix race with multiple updates to a single multipart object (#8269, Yehuda Sadeh)
  • rgw: improve bucket listing with delimiter (Yehuda Sadeh)
  • rgw: include NextMarker in bucket listing (#8858, Yehuda Sadeh)
  • rgw: return error early on non-existent bucket (#7064, Yehuda Sadeh)
  • rgw: set truncation flag correctly in bucket listing (Yehuda Sadeh)
  • sysvinit: continue starting daemons after pre-mount error (#8554, Sage Weil)

For more detailed information, see the complete changelog.

Twitter || Facebook || Google+ || Lists/IRC


openstack-logo512

Voting for submissions is well underway for the next OpenStack summit, and this one is shaping up to be another great place to talk about Ceph. Almost fifty talks are currently available for voting on the OpenStack site! Ceph has been steadily gaining popularity in the OpenStack world, especially if you take a look at recent user survey results.

Start Voting!

The even better part is there are only three Ceph talks that were submitted by the former Inktank crew, which means there are a ton of organic submissions available. If you are interested in taking a peek at the Inktank crew specifically, they can be found at the following URLs:

read more…

v0.83 released

Another Ceph development release! This has been a longer cycle, so there has been quite a bit of bug fixing and stabilization in this round. There is also a bunch of packaging fixes for RPM distros (RHEL/CentOS, Fedora, and SUSE) and for systemd. We’ve also added a new librados-striper library from Sebastien Ponce that provides a generic striping API for applications to code to.

UPGRADING

  • The experimental keyvaluestore-dev OSD backend had an on-disk format change that prevents existing OSD data from being upgraded. This affects developers and testers only.
  • mon-specific and osd-specific leveldb options have been removed. From this point onward users should use ‘leveldb_‘ generic options and add the options in the appropriate sections of their configuration files. Monitors will still maintain the following monitor-specific defaults:

    leveldb_write_buffer_size = 32*1024*1024 = 33554432 // 32MB leveldb_cache_size = 512*1024*1204 = 536870912 // 512MB leveldb_block_size = 64*1024 = 65536 // 64KB leveldb_compression = false leveldb_log = “”

    OSDs will still maintain the following osd-specific defaults:

    leveldb_log = “”

read more…

v0.80.5 Firefly released

This release fixes a few important bugs in the radosgw and fixes several packaging and environment issues, including OSD log rotation, systemd environments, and daemon restarts on upgrade.

We recommend that all v0.80.x Firefly users upgrade, particularly if they are using upstart, systemd, or radosgw.

NOTABLE CHANGES

  • ceph-dencoder: do not needlessly link to librgw, librados, etc. (Sage Weil)
  • do not needlessly link binaries to leveldb (Sage Weil)
  • mon: fix mon crash when no auth keys are present (#8851, Joao Eduaro Luis)
  • osd: fix cleanup (and avoid occasional crash) during shutdown (#7981, Sage Weil)
  • osd: fix log rotation under upstart (Sage Weil)
  • rgw: fix multipart upload when object has irregular size (#8846, Yehuda Sadeh, Sylvain Munaut)
  • rgw: improve bucket listing S3 compatibility (#8858, Yehuda Sadeh)
  • rgw: improve delimited bucket listing (Yehuda Sadeh)
  • rpm: do not restart daemons on upgrade (#8849, Alfredo Deza)

For more detailed information, see the complete changelog.

GETTING CEPH

Lots Going on with Ceph

While we knew that after the acquisition of Inktank life would accelerate again, it seems like the Ceph community is quickly approaching ludicrous speed, and it shows no sign of slowing down. We have had some amazing participation in the various endeavors, but it would be completely understandable if you had missed something amidst the avalanche of Ceph-related news.

Just in case something flew by you, I wanted to take a few minutes to recap some of the highlights of recent history. If you would like to keep a closer eye on what has been going on feel free to follow one (or all!) of our informational feeds:

Twitter || Facebook || Google+ || Lists/IRC

ludicrous speed

read more…

OSCON has arrived (although if you came in for the Ceph tutorial session that’s old news to you)! As a part of our participation in OSCON, and as a way to celebrate the fact that Ceph turned 10 years old this year, we have decided to have our party be a distributed one.

We would love to have our users send us pictures of whatever they might be doing to celebrate the 10th anniversary of ceph. Are you busy racking in 3 petabytes of storage to add to your Ceph cluster? Did you create a culinary masterpiece in the form of a squid cake? Are you sitting alone in the middle of the OSCON show floor with a party hat and a cupcake? We want to see! As thanks for sharing your birthday celebration efforts with the community we’ll be picking one lucky winner to receive a desktop Ceph test cluster built by our very own Mark Nelson (Ceph performance guru extraordinaire!).

While the cluster wont break any speed records, and only a madman would use it for anything even remotely production ready, it will give you a Ceph cluster to play with and can sit on your desk to invoke feelings of envy in your coworkers. For more details check out the (new) contest page on the Ceph wiki. If you have any questions please contact me or just tweet @Ceph. Thanks, and happy birthday to Ceph!

scuttlemonkey out

Ceph is coming back to OSCON next week (July 20-24 in Portland, OR). The difference however, is that this year we need two digits to tell people how old we are. Stop by for some mild festivities at the Ceph booth (P2) as we share cupcakes, and t-shirts that salute the hard work of all our committers since day one.

Photo credit: picphotos.net

Originally we had much bigger plans for a guerrilla show floor birthday spectacle, but summoning Cthulhu just seemed like far too much work and could have been potentially disruptive to our fellow attendees. So instead we’re just going to enjoy hanging out with our community and sharing memories and calories.

read more…

v0.80.4 Firefly released

This Firefly point release fixes an potential data corruption problem when ceph-osd daemons run on top of XFS and service Firefly librbd clients. A recently added allocation hint that RBD utilizes triggers an XFS bug on some kernels (Linux 3.2, and likely others) that leads to data corruption and deep-scrub errors (and inconsistent PGs). This release avoids the situation by disabling the allocation hint until we can validate which kernels are affected and/or are known to be safe to use the hint on.

We recommend that all v0.80.x Firefly users urgently upgrade, especially if they are using RBD.

NOTABLE CHANGES

  • osd: disable XFS extsize hint by default (#8830, Samuel Just)
  • rgw: fix extra data pool default name (Yehuda Sadeh)

For more detailed information, see the complete changelog.

GETTING CEPH

 

v0.80.3 Firefly released

V0.80.3 FIREFLY

This is the third Firefly point release. It includes a single fix for a radosgw regression that was discovered in v0.80.2 right after it was released.

We recommand that all v0.80.x Firefly users upgrade.

NOTABLE CHANGES

  • radosgw: fix regression in manifest decoding (#8804, Sage Weil)

For more detailed information, see the complete changelog.

V0.80.2 FIREFLY

This is the second Firefly point release. It contains a range of important fixes, including several bugs in the OSD cache tiering, some compatibility checks that affect upgrade situations, several radosgw bugs, and an irritating and unnecessary feature bit check that prevents older clients from communicating with a cluster with any erasure coded pools.

One someone large change in this point release is that the ceph RPM package is separated into a ceph and ceph-common package, similar to Debian. The ceph-common package contains just the client libraries without any of the server-side daemons.

We recommend that all v0.80.x Firefly users skip this release and use v0.80.3.

NOTABLE CHANGES

  • ceph-disk: better debug logging (Alfredo Deza)
  • ceph-disk: fix preparation of OSDs with dmcrypt (#6700, Stephen F Taylor)
  • ceph-disk: partprobe on prepare to fix dm-crypt (#6966, Eric Eastman)
  • do not require ERASURE_CODE feature from clients (#8556, Sage Weil)
  • libcephfs-java: build with older JNI headers (Greg Farnum)
  • libcephfs-java: fix build with gcj-jdk (Dmitry Smirnov)
  • librados: fix osd op tid for redirected ops (#7588, Samuel Just)
  • librados: fix rados_pool_list buffer bounds checks (#8447, Sage Weil)
  • librados: resend ops when pool overlay changes (#8305, Sage Weil)
  • librbd, ceph-fuse: reduce CPU overhead for clean object check in cache (Haomai Wang)
  • mon: allow deletion of cephfs pools (John Spray)
  • mon: fix default pool ruleset choice (#8373, John Spray)
  • mon: fix health summary for mon low disk warning (Sage Weil)
  • mon: fix ‘osd pool set <pool> cache_target_full_ratio’ (Geoffrey Hartz)
  • mon: fix quorum feature check (Greg Farnum)
  • mon: fix request forwarding in mixed firefly+dumpling clusters 9#8727, Joao Eduardo Luis)
  • mon: fix rule vs ruleset check in ‘osd pool set … crush_ruleset’ command (John Spray)
  • mon: make osd ‘down’ count accurate (Sage Weil)
  • mon: set ‘next commit’ in primary-affinity reply (Ilya Dryomov)
  • mon: verify CRUSH features are supported by all mons (#8738, Greg Farnum)
  • msgr: fix sequence negotiation during connection reset (Guang Yang)
  • osd: block scrub on blocked objects (#8011, Samuel Just)
  • osd: call XFS hint ioctl less often (#8241, Ilya Dryomov)
  • osd: copy xattr spill out marker on clone (Haomai Wang)
  • osd: fix flush of snapped objects (#8334, Samuel Just)
  • osd: fix hashindex restart of merge operation (#8332, Samuel Just)
  • osd: fix osdmap subscription bug causing startup hang (Greg Farnum)
  • osd: fix potential null deref (#8328, Sage Weil)
  • osd: fix shutdown race (#8319, Sage Weil)
  • osd: handle ‘none’ in CRUSH results properly during peering (#8507, Samuel Just)
  • osd: set no spill out marker on new objects (Greg Farnum)
  • osd: skip op ordering debug checks on tiered pools (#8380, Sage Weil)
  • rados: enforce ‘put’ alignment (Lluis Pamies-Juarez)
  • rest-api: fix for ‘rx’ commands (Ailing Zhang)
  • rgw: calc user manifest etag and fix check (#8169, #8436, Yehuda Sadeh)
  • rgw: fetch attrs on multipart completion (#8452, Yehuda Sadeh, Sylvain Munaut)
  • rgw: fix buffer overflow for long instance ids (#8608, Yehuda Sadeh)
  • rgw: fix entity permission check on metadata put (#8428, Yehuda Sadeh)
  • rgw: fix multipart retry race (#8269, Yehuda Sadeh)
  • rpm: split ceph into ceph and ceph-common RPMs (Sandon Van Ness, Dan Mick)
  • sysvinit: continue startin daemons after failure doing mount (#8554, Sage Weil)

For more detailed information, see the complete changelog.

GETTING CEPH

Page 1 of 1312345...10...Last »
© 2013, Inktank Storage, Inc.. All rights reserved.