NCP-AI Online Practice Questions

Home / Nutanix / NCP-AI

Latest NCP-AI Exam Practice Questions

The practice questions for NCP-AI exam was last updated on 2025-12-14 .

Viewing page 1 out of 5 pages.

Viewing questions 1 out of 27 questions.

Question#1

An administrator notices increased model inference latency and frequent timeout errors in a Nutanix AI deployment during peak usage.
What is the most effective action to troubleshoot and resolve the performance issue?

A. Reduce the number of API keys to limit external access to the model.
B. Restart the Nutanix Kubernetes cluster to clear cached memory and reset service states.
C. Disable logging and monitoring tools to free up system resources.
D. Analyze resource metrics and scale out the model service to handle increased load.

Question#2

Here is the text converted from the uploaded image.
What does hibernating an endpoint do?

A. It deletes all the resources from the endpoint without affecting the LL
B. It pauses the endpoint and releases compute resources without deleting the endpoint.
C. It freezes the activity of the endpoint to make edits to the endpoint.
D. It optimizes resource usage and deletes the endpoint for new endpoints to be created.

Question#3

An administrator is managing a Nutanix AI cluster used for NLP (Natural Language Processing) training. A data scientist reports that training jobs intermittently stall and fail to complete within the expected time window. The administrator reviews the performance data for the VM hosting the job and finds:
The VM is configured with passthrough access to a dedicated GPU
Memory ballooning is active, and swap usage is increasing
CPU utilization is moderate (~60%)
GPU utilization is stable and high (~85%)
The VM has 8 vCPUs and 24 GB of RAM assigned
NCC shows no hardware or driver issues
What is the most appropriate optimization to improve workload stability and performance?

A. Reduce the number of vCPUs allocated to lower the CPU scheduling overhead.
B. Add additional vGPUs to the VM to reduce processing time.
C. Increase the VM's RAM to eliminate memory ballooning and swap usage.
D. Disable GPU passthrough and use a shared vGPU profile instead.

Question#4

Here is the text converted from the uploaded image:
An Accounting Department is thrilled with the RAG Application that the Application & Data Science Teams recently rolled out. However, they provided some feedback that sometimes (approximately 20% of the time), the documents retrieved are not relevant to their prompts or are too generic.
During development, there was extensive testing between models to make sure the best possible model was selected. The Accounting Department emphasizes that when the responses use the right documents, the results are very good and they are pleased with the completeness, accuracy, and coherence of those responses.
What would be a way to address the irrelevant RAG results without having to rebuild the entire workflow?

A. Replace the embedding model with a larger, more general-purpose language model to improve document retrieval.
B. Fine-tune the Large Language Model on a broader dataset to enable it to generate more relevant responses.
C. Implement a rerank model as a post retrieval step to re-order initially retrieved documents based on query-document relevance.
D. Significantly expand the document knowledge base by ingesting a much larger volume of financial reports.

Question#5

When licensing Nutanix Enterprise AI, which license is required to create an inference endpoint?

A. GPU GB
B. vCPU
C. NUS Pro
D. NKP Ultimate

Exam Code: NCP-AIQ & A: 75 Q&AsUpdated:  2025-12-14

 Get All NCP-AI Q&As