<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>http://wiki.define-technology.com/mediawiki-1.35.0/index.php?action=history&amp;feed=atom&amp;title=VScaler%3A_Debugging_kolla_ceph_OSD_issues</id>
	<title>VScaler: Debugging kolla ceph OSD issues - Revision history</title>
	<link rel="self" type="application/atom+xml" href="http://wiki.define-technology.com/mediawiki-1.35.0/index.php?action=history&amp;feed=atom&amp;title=VScaler%3A_Debugging_kolla_ceph_OSD_issues"/>
	<link rel="alternate" type="text/html" href="http://wiki.define-technology.com/mediawiki-1.35.0/index.php?title=VScaler:_Debugging_kolla_ceph_OSD_issues&amp;action=history"/>
	<updated>2026-05-05T00:03:04Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.35.0</generator>
	<entry>
		<id>http://wiki.define-technology.com/mediawiki-1.35.0/index.php?title=VScaler:_Debugging_kolla_ceph_OSD_issues&amp;diff=25703&amp;oldid=prev</id>
		<title>Martin t: Added a link to VScaler: Adding and removing kolla ceph OSDs</title>
		<link rel="alternate" type="text/html" href="http://wiki.define-technology.com/mediawiki-1.35.0/index.php?title=VScaler:_Debugging_kolla_ceph_OSD_issues&amp;diff=25703&amp;oldid=prev"/>
		<updated>2018-02-19T11:28:38Z</updated>

		<summary type="html">&lt;p&gt;Added a link to VScaler: Adding and removing kolla ceph OSDs&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left diff-editfont-monospace&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 11:28, 19 February 2018&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l59&quot; &gt;Line 59:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 59:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;A lof of the errors above occured. Looks like we have a faulty disk. depending on the issue, wipe and readd to ceph or replace and readd.  &lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;A lof of the errors above occured. Looks like we have a faulty disk. depending on the issue, wipe and readd to ceph or replace and readd.  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt;−&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;MArtin please add link here to replacing an OSD in &lt;/del&gt;kolla&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;. &lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;* [[VScaler: Adding and removing &lt;/ins&gt;kolla &lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;ceph OSDs]]&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Sidenote: we can also run smart tests on the disk&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Sidenote: we can also run smart tests on the disk&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Martin t</name></author>
	</entry>
	<entry>
		<id>http://wiki.define-technology.com/mediawiki-1.35.0/index.php?title=VScaler:_Debugging_kolla_ceph_OSD_issues&amp;diff=25684&amp;oldid=prev</id>
		<title>David: Created page with &quot;== Check the OSD status == &lt;syntaxhighlight&gt; [root@node02-enp175s0f0 ~]# docker exec ceph_mon ceph osd tree ID WEIGHT   TYPE NAME            UP/DOWN REWEIGHT PRIMARY-AFFINITY  -1 16.0000...&quot;</title>
		<link rel="alternate" type="text/html" href="http://wiki.define-technology.com/mediawiki-1.35.0/index.php?title=VScaler:_Debugging_kolla_ceph_OSD_issues&amp;diff=25684&amp;oldid=prev"/>
		<updated>2018-02-19T11:10:45Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;== Check the OSD status == &amp;lt;syntaxhighlight&amp;gt; [root@node02-enp175s0f0 ~]# docker exec ceph_mon ceph osd tree ID WEIGHT   TYPE NAME            UP/DOWN REWEIGHT PRIMARY-AFFINITY  -1 16.0000...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;== Check the OSD status ==&lt;br /&gt;
&amp;lt;syntaxhighlight&amp;gt;&lt;br /&gt;
[root@node02-enp175s0f0 ~]# docker exec ceph_mon ceph osd tree&lt;br /&gt;
ID WEIGHT   TYPE NAME            UP/DOWN REWEIGHT PRIMARY-AFFINITY &lt;br /&gt;
-1 16.00000 root default                                           &lt;br /&gt;
-2  4.00000     host 10.10.20.12                                   &lt;br /&gt;
 5  1.00000         osd.5           down        0          1.00000 &lt;br /&gt;
 8  1.00000         osd.8             up  1.00000          1.00000 &lt;br /&gt;
13  1.00000         osd.13            up  1.00000          1.00000 &lt;br /&gt;
16  1.00000         osd.16            up  1.00000          1.00000 &lt;br /&gt;
-3  4.00000     host 10.10.20.13                                   &lt;br /&gt;
 2  1.00000         osd.2             up  1.00000          1.00000 &lt;br /&gt;
 4  1.00000         osd.4             up  1.00000          1.00000 &lt;br /&gt;
 9  1.00000         osd.9             up  1.00000          1.00000 &lt;br /&gt;
12  1.00000         osd.12            up  1.00000          1.00000 &lt;br /&gt;
-4  4.00000     host 10.10.20.11                                   &lt;br /&gt;
 1  1.00000         osd.1             up  1.00000          1.00000 &lt;br /&gt;
 6  1.00000         osd.6             up  1.00000          1.00000 &lt;br /&gt;
10  1.00000         osd.10            up  1.00000          1.00000 &lt;br /&gt;
14  1.00000         osd.14            up  1.00000          1.00000 &lt;br /&gt;
-5  4.00000     host 10.10.20.10                                   &lt;br /&gt;
 3  1.00000         osd.3             up  1.00000          1.00000 &lt;br /&gt;
 7  1.00000         osd.7             up  1.00000          1.00000 &lt;br /&gt;
11  1.00000         osd.11            up  1.00000          1.00000 &lt;br /&gt;
15  1.00000         osd.15            up  1.00000          1.00000 &lt;br /&gt;
&amp;lt;/syntaxhighlight&amp;gt;&lt;br /&gt;
&lt;br /&gt;
So we can see OSD 5 is down above. Lets look at docker  to see whats happening (we see the container is stuck restarting - with an unexpected error (fairly useless error!) &lt;br /&gt;
&amp;lt;syntaxhighlight&amp;gt;&lt;br /&gt;
[root@node02-enp175s0f0 ~]# docker ps | grep -i osd_5 &lt;br /&gt;
5bce4c98e95a        registry.vscaler.com:5000/kolla/centos-binary-ceph-osd:4.0.3                    &amp;quot;kolla_start&amp;quot;       9 weeks ago         Restarting (134) 13 hours ago                       ceph_osd_5&lt;br /&gt;
&lt;br /&gt;
[root@node02-enp175s0f0 ~]# docker logs ceph_osd_5 | grep -i fail | tail -n 3&lt;br /&gt;
os/filestore/FileStore.cc: 2920: FAILED assert(0 == &amp;quot;unexpected error&amp;quot;)&lt;br /&gt;
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x85) [0x561f203c7ad5]&lt;br /&gt;
 5: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x267) [0x561f203c7cb7]&lt;br /&gt;
&amp;lt;/syntaxhighlight&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Ok lets drill down some more. What physical device is backing OSD 5&lt;br /&gt;
&amp;lt;syntaxhighlight&amp;gt;&lt;br /&gt;
[root@node02-enp175s0f0 ~]# docker inspect ceph_osd_5 | grep -i &amp;#039;/var/lib/ceph/osd&amp;#039; &lt;br /&gt;
                &amp;quot;/var/lib/ceph/osd/1d29b3f9-e8c6-406d-b881-9eb6ca878d28:/var/lib/ceph/osd/ceph-5:rw&amp;quot;,&lt;br /&gt;
                &amp;quot;Source&amp;quot;: &amp;quot;/var/lib/ceph/osd/1d29b3f9-e8c6-406d-b881-9eb6ca878d28&amp;quot;,&lt;br /&gt;
                &amp;quot;Destination&amp;quot;: &amp;quot;/var/lib/ceph/osd/ceph-5&amp;quot;,&lt;br /&gt;
                &amp;quot;/var/lib/ceph/osd/ceph-5&amp;quot;: {},&lt;br /&gt;
[root@node02-enp175s0f0 ~]# df | grep /var/lib/ceph/osd/1d29b3f9-e8c6-406d-b881-9eb6ca878d28&lt;br /&gt;
/dev/sde2      1869578324  50644684 1818933640   3% /var/lib/ceph/osd/1d29b3f9-e8c6-406d-b881-9eb6ca878d28&lt;br /&gt;
&amp;lt;/syntaxhighlight&amp;gt;&lt;br /&gt;
&lt;br /&gt;
So know we know OSD5 is sde on the host system. Lets check dmesg for IO errors. &lt;br /&gt;
&amp;lt;syntaxhighlight&amp;gt;&lt;br /&gt;
[849146.277876] sd 10:0:0:0: [sde] FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE&lt;br /&gt;
[849146.285800] sd 10:0:0:0: [sde] Sense Key : Medium Error [current] [descriptor] &lt;br /&gt;
[849146.293209] sd 10:0:0:0: [sde] Add. Sense: Unrecovered read error - auto reallocate failed&lt;br /&gt;
[849146.301580] sd 10:0:0:0: [sde] CDB: Read(10) 28 00 70 88 68 b0 00 00 08 00&lt;br /&gt;
[849146.308550] blk_update_request: I/O error, dev sde, sector 1887987888&lt;br /&gt;
&amp;lt;/syntaxhighlight&amp;gt;&lt;br /&gt;
&lt;br /&gt;
A lof of the errors above occured. Looks like we have a faulty disk. depending on the issue, wipe and readd to ceph or replace and readd. &lt;br /&gt;
&lt;br /&gt;
MArtin please add link here to replacing an OSD in kolla. &lt;br /&gt;
&lt;br /&gt;
Sidenote: we can also run smart tests on the disk&lt;br /&gt;
&amp;lt;syntaxhighlight&amp;gt;&lt;br /&gt;
smartctl -t short -a /dev/sde&lt;br /&gt;
smartctl -t long -a /dev/sde&lt;br /&gt;
&amp;lt;/syntaxhighlight&amp;gt;&lt;/div&gt;</summary>
		<author><name>David</name></author>
	</entry>
</feed>