<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>http://wiki.define-technology.com/mediawiki-1.35.0/index.php?action=history&amp;feed=atom&amp;title=Lustre%3A_Problems_with_df_on_lustre_clients</id>
	<title>Lustre: Problems with df on lustre clients - Revision history</title>
	<link rel="self" type="application/atom+xml" href="http://wiki.define-technology.com/mediawiki-1.35.0/index.php?action=history&amp;feed=atom&amp;title=Lustre%3A_Problems_with_df_on_lustre_clients"/>
	<link rel="alternate" type="text/html" href="http://wiki.define-technology.com/mediawiki-1.35.0/index.php?title=Lustre:_Problems_with_df_on_lustre_clients&amp;action=history"/>
	<updated>2026-05-04T22:58:49Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.35.0</generator>
	<entry>
		<id>http://wiki.define-technology.com/mediawiki-1.35.0/index.php?title=Lustre:_Problems_with_df_on_lustre_clients&amp;diff=9321&amp;oldid=prev</id>
		<title>Chenhui: /* ptlrpcd_rcv loop CPU 100% */</title>
		<link rel="alternate" type="text/html" href="http://wiki.define-technology.com/mediawiki-1.35.0/index.php?title=Lustre:_Problems_with_df_on_lustre_clients&amp;diff=9321&amp;oldid=prev"/>
		<updated>2015-07-31T11:12:39Z</updated>

		<summary type="html">&lt;p&gt;&lt;span dir=&quot;auto&quot;&gt;&lt;span class=&quot;autocomment&quot;&gt;ptlrpcd_rcv loop CPU 100%&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left diff-editfont-monospace&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 11:12, 31 July 2015&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l70&quot; &gt;Line 70:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 70:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* Drop caches to resolve; *NOTE* Only do this when no users are running jobs - something wierd happened when we did this on a compute node with a user job. Clear jobs first then drop caches&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* Drop caches to resolve; *NOTE* Only do this when no users are running jobs - something wierd happened when we did this on a compute node with a user job. Clear jobs first then drop caches&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&amp;lt;syntaxhighlight&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&amp;lt;syntaxhighlight&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt;−&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;del class=&quot;diffchange diffchange-inline&quot;&gt; &lt;/del&gt;echo 1 &amp;gt; /proc/sys/vm/drop_caches&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;    lctl set_param ldlm.namespaces.*.lru_size=clear&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt; &lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;    or&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot;&gt; &lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;    &lt;/ins&gt;echo 1 &amp;gt; /proc/sys/vm/drop_caches&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&amp;lt;/syntaxhighlight&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&amp;lt;/syntaxhighlight&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Chenhui</name></author>
	</entry>
	<entry>
		<id>http://wiki.define-technology.com/mediawiki-1.35.0/index.php?title=Lustre:_Problems_with_df_on_lustre_clients&amp;diff=8281&amp;oldid=prev</id>
		<title>David: Created page with &quot;== df hangs because OST is not accessible == * This will occur when OSTs are offline or inactive (in this instance they were added through IML and then removed, but not remove...&quot;</title>
		<link rel="alternate" type="text/html" href="http://wiki.define-technology.com/mediawiki-1.35.0/index.php?title=Lustre:_Problems_with_df_on_lustre_clients&amp;diff=8281&amp;oldid=prev"/>
		<updated>2015-07-01T11:00:52Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;== df hangs because OST is not accessible == * This will occur when OSTs are offline or inactive (in this instance they were added through IML and then removed, but not remove...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;== df hangs because OST is not accessible ==&lt;br /&gt;
* This will occur when OSTs are offline or inactive (in this instance they were added through IML and then removed, but not removed fully. &lt;br /&gt;
&amp;lt;syntaxhighlight&amp;gt;&lt;br /&gt;
[root@hyalite ~]# lfs df -h&lt;br /&gt;
UUID                       bytes        Used   Available Use% Mounted on&lt;br /&gt;
lustrefs-MDT0000_UUID        1.2T       14.6G        1.1T   1% /mnt/lustrefs[MDT:0]&lt;br /&gt;
lustrefs-OST0000_UUID       36.4T        7.5T       27.0T  22% /mnt/lustrefs[OST:0]&lt;br /&gt;
lustrefs-OST0001_UUID       36.4T        8.2T       26.4T  24% /mnt/lustrefs[OST:1]&lt;br /&gt;
lustrefs-OST0002_UUID       36.4T        7.2T       27.3T  21% /mnt/lustrefs[OST:2]&lt;br /&gt;
lustrefs-OST0003_UUID       36.4T        8.0T       26.5T  23% /mnt/lustrefs[OST:3]&lt;br /&gt;
lustrefs-OST0004_UUID       36.4T        6.8T       27.8T  20% /mnt/lustrefs[OST:4]&lt;br /&gt;
lustrefs-OST0005_UUID       36.4T        6.6T       28.0T  19% /mnt/lustrefs[OST:5]&lt;br /&gt;
lustrefs-OST0006_UUID       36.4T        5.1T       29.4T  15% /mnt/lustrefs[OST:6]&lt;br /&gt;
lustrefs-OST0007_UUID       36.4T        5.8T       28.7T  17% /mnt/lustrefs[OST:7]&lt;br /&gt;
lustrefs-OST0008_UUID       54.6T      168.5G       51.7T   0% /mnt/lustrefs[OST:8]&lt;br /&gt;
lustrefs-OST0009_UUID       54.6T      146.9G       51.7T   0% /mnt/lustrefs[OST:9]&lt;br /&gt;
OST000a             : inactive device&lt;br /&gt;
OST000b             : inactive device&lt;br /&gt;
&lt;br /&gt;
filesystem summary:       400.1T       55.6T      324.4T  15% /mnt/lustrefs&lt;br /&gt;
&lt;br /&gt;
* It’ll be because of the inactive devices, to correct this; &lt;br /&gt;
* Run; &lt;br /&gt;
&amp;lt;syntaxhighlight&amp;gt;&lt;br /&gt;
 lctl set_param osc.lustrefs-OST000a-*.active=0&lt;br /&gt;
 lctl set_param osc.lustrefs-OST000b-*.active=0&lt;br /&gt;
&amp;lt;/syntaxhighlight&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* And its worked ok again &lt;br /&gt;
&amp;lt;syntaxhighlight&amp;gt;&lt;br /&gt;
[root@hyalite ~]#  lctl set_param osc.lustrefs-OST000a-*.active=0&lt;br /&gt;
osc.lustrefs-OST000a-osc-ffff881070ee7000.active=0&lt;br /&gt;
[root@hyalite ~]#  lctl set_param osc.lustrefs-OST000b-*.active=0&lt;br /&gt;
osc.lustrefs-OST000b-osc-ffff881070ee7000.active=0&lt;br /&gt;
[root@hyalite ~]# df -h&lt;br /&gt;
Filesystem            Size  Used Avail Use% Mounted on&lt;br /&gt;
/dev/md126            867G  365G  458G  45% /&lt;br /&gt;
tmpfs                  32G   76K   32G   1% /dev/shm&lt;br /&gt;
/dev/md127            496M   27M  444M   6% /boot&lt;br /&gt;
/dev/md125            7.9G  152M  7.4G   2% /tmp&lt;br /&gt;
/dev/md123             16G  3.1G   12G  21% /var&lt;br /&gt;
/dev/md122            9.9G  501M  8.9G   6% /var/lib/mysql/cmdaemon_mon&lt;br /&gt;
172.23.19.42@tcp1:172.23.19.41@tcp1:/lustrefs&lt;br /&gt;
                      401T   56T  325T  15% /mnt/lustrefs&lt;br /&gt;
&amp;lt;/syntaxhighlight&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Another command to check out the devices within lustre is: &lt;br /&gt;
&amp;lt;syntaxhighlight&amp;gt;&lt;br /&gt;
[root@hyalite ~]# lctl dl &lt;br /&gt;
  0 UP mgc MGC172.23.19.42@tcp1 0d48eca7-fb5f-d53f-3bee-e6b1a6745dcc 5&lt;br /&gt;
  1 UP lov lustrefs-clilov-ffff881070ee7000 32bcb3c7-3977-99f9-f3f4-0c1914ccec79 4&lt;br /&gt;
  2 UP lmv lustrefs-clilmv-ffff881070ee7000 32bcb3c7-3977-99f9-f3f4-0c1914ccec79 4&lt;br /&gt;
  3 UP mdc lustrefs-MDT0000-mdc-ffff881070ee7000 32bcb3c7-3977-99f9-f3f4-0c1914ccec79 5&lt;br /&gt;
  4 UP osc lustrefs-OST0000-osc-ffff881070ee7000 32bcb3c7-3977-99f9-f3f4-0c1914ccec79 5&lt;br /&gt;
  5 UP osc lustrefs-OST0002-osc-ffff881070ee7000 32bcb3c7-3977-99f9-f3f4-0c1914ccec79 5&lt;br /&gt;
  6 UP osc lustrefs-OST0003-osc-ffff881070ee7000 32bcb3c7-3977-99f9-f3f4-0c1914ccec79 5&lt;br /&gt;
  7 UP osc lustrefs-OST0001-osc-ffff881070ee7000 32bcb3c7-3977-99f9-f3f4-0c1914ccec79 5&lt;br /&gt;
  8 UP osc lustrefs-OST0005-osc-ffff881070ee7000 32bcb3c7-3977-99f9-f3f4-0c1914ccec79 5&lt;br /&gt;
  9 UP osc lustrefs-OST0007-osc-ffff881070ee7000 32bcb3c7-3977-99f9-f3f4-0c1914ccec79 5&lt;br /&gt;
 10 UP osc lustrefs-OST0004-osc-ffff881070ee7000 32bcb3c7-3977-99f9-f3f4-0c1914ccec79 5&lt;br /&gt;
 11 UP osc lustrefs-OST0006-osc-ffff881070ee7000 32bcb3c7-3977-99f9-f3f4-0c1914ccec79 5&lt;br /&gt;
 12 UP osc lustrefs-OST0009-osc-ffff881070ee7000 32bcb3c7-3977-99f9-f3f4-0c1914ccec79 5&lt;br /&gt;
 13 UP osc lustrefs-OST0008-osc-ffff881070ee7000 32bcb3c7-3977-99f9-f3f4-0c1914ccec79 5&lt;br /&gt;
 14 UP osc lustrefs-OST000a-osc-ffff881070ee7000 32bcb3c7-3977-99f9-f3f4-0c1914ccec79 5&lt;br /&gt;
 15 UP osc lustrefs-OST000b-osc-ffff881070ee7000 32bcb3c7-3977-99f9-f3f4-0c1914ccec79 5&lt;br /&gt;
&amp;lt;/syntaxhighlight&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== ptlrpcd_rcv loop CPU 100% ==&lt;br /&gt;
* Seems related to: https://jira.hpdd.intel.com/browse/LU-5787&lt;br /&gt;
* Drop caches to resolve; *NOTE* Only do this when no users are running jobs - something wierd happened when we did this on a compute node with a user job. Clear jobs first then drop caches&lt;br /&gt;
&amp;lt;syntaxhighlight&amp;gt;&lt;br /&gt;
 echo 1 &amp;gt; /proc/sys/vm/drop_caches&lt;br /&gt;
&amp;lt;/syntaxhighlight&amp;gt;&lt;/div&gt;</summary>
		<author><name>David</name></author>
	</entry>
</feed>