Difference between revisions of "VScaler: vGPU Configuration"
(Created page with " 1. Yum install kernel kernel-devel -y 2. reboot 3. Download latest NVIDIA GRID software/drivers for RHEL KVM from here https://nvidia.flexnetoperations.com/control/nvda/dow...") |
|||
| Line 1: | Line 1: | ||
1. Yum install kernel kernel-devel -y | 1. Yum install kernel kernel-devel -y | ||
| + | |||
2. reboot | 2. reboot | ||
| + | |||
3. Download latest NVIDIA GRID software/drivers for RHEL KVM from here https://nvidia.flexnetoperations.com/control/nvda/download?agree=Accept&element=10189877 | 3. Download latest NVIDIA GRID software/drivers for RHEL KVM from here https://nvidia.flexnetoperations.com/control/nvda/download?agree=Accept&element=10189877 | ||
| + | |||
4. Yum install gcc glibc -y | 4. Yum install gcc glibc -y | ||
| + | |||
5. Rpm -iv NVIDIA-vGPU-rhel-7.5-390.72.x86_64.rpm | 5. Rpm -iv NVIDIA-vGPU-rhel-7.5-390.72.x86_64.rpm | ||
| + | |||
6. Reboot | 6. Reboot | ||
| + | |||
7. Check with: | 7. Check with: | ||
a. lsmod | grep vfio | a. lsmod | grep vfio | ||
b. Nvidia-smi | b. Nvidia-smi | ||
| + | |||
8. cp /usr/lib/nvidia/systemd/nvidia-vgpu* /usr/lib/systemd/system/ | 8. cp /usr/lib/nvidia/systemd/nvidia-vgpu* /usr/lib/systemd/system/ | ||
| + | |||
9. Systemctl start nvidia-vgpu-mgr.service | 9. Systemctl start nvidia-vgpu-mgr.service | ||
| + | |||
10. Systemctl enable nvidia-vgpu-mgr.service | 10. Systemctl enable nvidia-vgpu-mgr.service | ||
| + | |||
11. systemctl start nvidia-vgpud.service | 11. systemctl start nvidia-vgpud.service | ||
| + | |||
12. systemctl enable nvidia-vgpud.service | 12. systemctl enable nvidia-vgpud.service | ||
| + | |||
13. Check the /sys/class/mdev_bus/0000\:05\:00.0/mdev_supported_types/ directories and select one of the supported devices, eg nvidia-101 | 13. Check the /sys/class/mdev_bus/0000\:05\:00.0/mdev_supported_types/ directories and select one of the supported devices, eg nvidia-101 | ||
| + | |||
14. Create uuids and vgpu devices with them FOR EACH PHYSICAL GPU: | 14. Create uuids and vgpu devices with them FOR EACH PHYSICAL GPU: | ||
a. uuidgen | a. uuidgen | ||
b. echo "af88fbf2-0110-4669-ab84-d747e9a9c19c" > /sys/class/mdev_bus/0000\:05\:00.0/mdev_supported_types/nvidia-101/create | b. echo "af88fbf2-0110-4669-ab84-d747e9a9c19c" > /sys/class/mdev_bus/0000\:05\:00.0/mdev_supported_types/nvidia-101/create | ||
| + | |||
15. Disable ECC on the GPUs on the host | 15. Disable ECC on the GPUs on the host | ||
| + | |||
16. Add the following to nova.conf of the gpu nodes: | 16. Add the following to nova.conf of the gpu nodes: | ||
[devices] | [devices] | ||
enabled_vgpu_types = nvidia-84 | enabled_vgpu_types = nvidia-84 | ||
| + | |||
17. Add the following to nova.conf of the controller nodes: | 17. Add the following to nova.conf of the controller nodes: | ||
[scheduler] | [scheduler] | ||
| Line 28: | Line 44: | ||
available_filters = nova.scheduler.filters.all_filters | available_filters = nova.scheduler.filters.all_filters | ||
enabled_filters = AvailabilityZoneFilter, ComputeFilter, ComputeCapabilitiesFilter, ImagePropertiesFilter, ServerGroupAntiAffinityFilter, ServerGroupAffinityFilter | enabled_filters = AvailabilityZoneFilter, ComputeFilter, ComputeCapabilitiesFilter, ImagePropertiesFilter, ServerGroupAntiAffinityFilter, ServerGroupAffinityFilter | ||
| + | |||
18. Do a kolla-ansible reconfigure to apply the above settings | 18. Do a kolla-ansible reconfigure to apply the above settings | ||
| + | |||
19. Create a flavor with the following property: | 19. Create a flavor with the following property: | ||
a. --property "resources:VGPU=1 | a. --property "resources:VGPU=1 | ||
Revision as of 17:20, 8 January 2019
1. Yum install kernel kernel-devel -y
2. reboot
3. Download latest NVIDIA GRID software/drivers for RHEL KVM from here https://nvidia.flexnetoperations.com/control/nvda/download?agree=Accept&element=10189877
4. Yum install gcc glibc -y
5. Rpm -iv NVIDIA-vGPU-rhel-7.5-390.72.x86_64.rpm
6. Reboot
7. Check with: a. lsmod | grep vfio b. Nvidia-smi
8. cp /usr/lib/nvidia/systemd/nvidia-vgpu* /usr/lib/systemd/system/
9. Systemctl start nvidia-vgpu-mgr.service
10. Systemctl enable nvidia-vgpu-mgr.service
11. systemctl start nvidia-vgpud.service
12. systemctl enable nvidia-vgpud.service
13. Check the /sys/class/mdev_bus/0000\:05\:00.0/mdev_supported_types/ directories and select one of the supported devices, eg nvidia-101
14. Create uuids and vgpu devices with them FOR EACH PHYSICAL GPU: a. uuidgen b. echo "af88fbf2-0110-4669-ab84-d747e9a9c19c" > /sys/class/mdev_bus/0000\:05\:00.0/mdev_supported_types/nvidia-101/create
15. Disable ECC on the GPUs on the host
16. Add the following to nova.conf of the gpu nodes: [devices] enabled_vgpu_types = nvidia-84
17. Add the following to nova.conf of the controller nodes: [scheduler] driver = filter_scheduler
[filter_scheduler] available_filters = nova.scheduler.filters.all_filters enabled_filters = AvailabilityZoneFilter, ComputeFilter, ComputeCapabilitiesFilter, ImagePropertiesFilter, ServerGroupAntiAffinityFilter, ServerGroupAffinityFilter
18. Do a kolla-ansible reconfigure to apply the above settings
19. Create a flavor with the following property: a. --property "resources:VGPU=1