Difference between revisions of "VScaler: vGPU Configuration"

From Define Wiki
Jump to navigation Jump to search
(Created page with " 1. Yum install kernel kernel-devel -y 2. reboot 3. Download latest NVIDIA GRID software/drivers for RHEL KVM from here https://nvidia.flexnetoperations.com/control/nvda/dow...")
 
Line 1: Line 1:
 
1. Yum install kernel kernel-devel -y
 
1. Yum install kernel kernel-devel -y
 +
 
2. reboot
 
2. reboot
 +
 
3. Download latest NVIDIA GRID software/drivers for RHEL KVM from here https://nvidia.flexnetoperations.com/control/nvda/download?agree=Accept&element=10189877
 
3. Download latest NVIDIA GRID software/drivers for RHEL KVM from here https://nvidia.flexnetoperations.com/control/nvda/download?agree=Accept&element=10189877
 +
 
4. Yum install gcc glibc -y
 
4. Yum install gcc glibc -y
 +
 
5. Rpm -iv NVIDIA-vGPU-rhel-7.5-390.72.x86_64.rpm
 
5. Rpm -iv NVIDIA-vGPU-rhel-7.5-390.72.x86_64.rpm
 +
 
6. Reboot
 
6. Reboot
 +
 
7. Check with:
 
7. Check with:
 
a. lsmod | grep vfio
 
a. lsmod | grep vfio
 
b. Nvidia-smi
 
b. Nvidia-smi
 +
 
8. cp /usr/lib/nvidia/systemd/nvidia-vgpu* /usr/lib/systemd/system/
 
8. cp /usr/lib/nvidia/systemd/nvidia-vgpu* /usr/lib/systemd/system/
 +
 
9. Systemctl start nvidia-vgpu-mgr.service 
 
9. Systemctl start nvidia-vgpu-mgr.service 
 +
 
10. Systemctl enable nvidia-vgpu-mgr.service 
 
10. Systemctl enable nvidia-vgpu-mgr.service 
 +
 
11. systemctl start nvidia-vgpud.service
 
11. systemctl start nvidia-vgpud.service
 +
 
12. systemctl enable nvidia-vgpud.service
 
12. systemctl enable nvidia-vgpud.service
 +
 
13. Check the /sys/class/mdev_bus/0000\:05\:00.0/mdev_supported_types/ directories and select one of the supported devices, eg nvidia-101
 
13. Check the /sys/class/mdev_bus/0000\:05\:00.0/mdev_supported_types/ directories and select one of the supported devices, eg nvidia-101
 +
 
14. Create uuids and vgpu devices with them FOR EACH PHYSICAL GPU:
 
14. Create uuids and vgpu devices with them FOR EACH PHYSICAL GPU:
 
a. uuidgen
 
a. uuidgen
 
b. echo "af88fbf2-0110-4669-ab84-d747e9a9c19c" > /sys/class/mdev_bus/0000\:05\:00.0/mdev_supported_types/nvidia-101/create 
 
b. echo "af88fbf2-0110-4669-ab84-d747e9a9c19c" > /sys/class/mdev_bus/0000\:05\:00.0/mdev_supported_types/nvidia-101/create 
 +
 
15. Disable ECC on the GPUs on the host
 
15. Disable ECC on the GPUs on the host
 +
 
16. Add the following to nova.conf of the gpu nodes:
 
16. Add the following to nova.conf of the gpu nodes:
 
[devices]
 
[devices]
 
enabled_vgpu_types = nvidia-84
 
enabled_vgpu_types = nvidia-84
 +
 
17. Add the following to nova.conf of the controller nodes:
 
17. Add the following to nova.conf of the controller nodes:
 
[scheduler]
 
[scheduler]
Line 28: Line 44:
 
available_filters = nova.scheduler.filters.all_filters
 
available_filters = nova.scheduler.filters.all_filters
 
enabled_filters = AvailabilityZoneFilter, ComputeFilter, ComputeCapabilitiesFilter, ImagePropertiesFilter, ServerGroupAntiAffinityFilter, ServerGroupAffinityFilter
 
enabled_filters = AvailabilityZoneFilter, ComputeFilter, ComputeCapabilitiesFilter, ImagePropertiesFilter, ServerGroupAntiAffinityFilter, ServerGroupAffinityFilter
 +
 
18. Do a kolla-ansible reconfigure to apply the above settings
 
18. Do a kolla-ansible reconfigure to apply the above settings
 +
 
19. Create a flavor with the following property:
 
19. Create a flavor with the following property:
 
a. --property "resources:VGPU=1
 
a. --property "resources:VGPU=1

Revision as of 17:20, 8 January 2019

1. Yum install kernel kernel-devel -y

2. reboot

3. Download latest NVIDIA GRID software/drivers for RHEL KVM from here https://nvidia.flexnetoperations.com/control/nvda/download?agree=Accept&element=10189877

4. Yum install gcc glibc -y

5. Rpm -iv NVIDIA-vGPU-rhel-7.5-390.72.x86_64.rpm

6. Reboot

7. Check with: a. lsmod | grep vfio b. Nvidia-smi

8. cp /usr/lib/nvidia/systemd/nvidia-vgpu* /usr/lib/systemd/system/

9. Systemctl start nvidia-vgpu-mgr.service 

10. Systemctl enable nvidia-vgpu-mgr.service 

11. systemctl start nvidia-vgpud.service

12. systemctl enable nvidia-vgpud.service

13. Check the /sys/class/mdev_bus/0000\:05\:00.0/mdev_supported_types/ directories and select one of the supported devices, eg nvidia-101

14. Create uuids and vgpu devices with them FOR EACH PHYSICAL GPU: a. uuidgen b. echo "af88fbf2-0110-4669-ab84-d747e9a9c19c" > /sys/class/mdev_bus/0000\:05\:00.0/mdev_supported_types/nvidia-101/create 

15. Disable ECC on the GPUs on the host

16. Add the following to nova.conf of the gpu nodes: [devices] enabled_vgpu_types = nvidia-84

17. Add the following to nova.conf of the controller nodes: [scheduler] driver = filter_scheduler

[filter_scheduler] available_filters = nova.scheduler.filters.all_filters enabled_filters = AvailabilityZoneFilter, ComputeFilter, ComputeCapabilitiesFilter, ImagePropertiesFilter, ServerGroupAntiAffinityFilter, ServerGroupAffinityFilter

18. Do a kolla-ansible reconfigure to apply the above settings

19. Create a flavor with the following property: a. --property "resources:VGPU=1