"AllerEgo" is a product that uses sell-learning to predict the behavior of a pilot under combat situation for a variety of terrains and enemy aircraft formations. Post training the model was exposed to the real-
world data and the model was found to be behaving poorly. A lot of data quality tests had been performed on the data to bring it into a shape fit for training and testing.
Which ONE of the following options is least likely to describes the possible reason for the fall in the performance, especially when considering the self-learning nature of the Al system?
SELECT ONE OPTION
The difficulty of defining criteria for improvement before the model can be accepted.
The fast pace of change did not allow sufficient time for testing.
The unknown nature and insufficient specification of the operating environment might have caused the poor performance.
Which ONE of the following tests is MOST likely to describe a useful test to help detect different kinds of biases in ML pipeline?
SELECT ONE OPTION
Which of the following is an example of a clustering problem that can be resolved by unsupervised learning?
Upon testing a model used to detect rotten tomatoes, the following data was observed by the test engineer, based on certain number of tomato images.
For this confusion matrix which combinations of values of accuracy, recall, and specificity respectively is CORRECT?
SELECT ONE OPTION
Which ONE of the following statements correctly describes the importance of flexibility for Al systems?
SELECT ONE OPTION
"Splendid Healthcare" has started developing a cancer detection system based on ML. The type of cancer they plan on detecting has 2% prevalence rate in the population of a particular geography. It is required that the model performs well for both normal and cancer patients.
Which ONE of the following combinations requires MAXIMIZATION?
SELECT ONE OPTION
Which of the following is one of the reasons for data mislabelling?
A startup company has implemented a new facial recognition system for a banking application for mobile devices. The application is intended to learn at run-time on the device to determine if the user should be granted access. It also sends feedback over the Internet to the application developers. The application deployment resulted in continuous restarts of the mobile devices.
Which of the following is the most likely cause of the failure?
Which ONE of the following hardware is MOST suitable for implementing Al when using ML?
SELECT ONE OPTION
Which ONE of the following tests is LEAST likely to be performed during the ML model testing phase?
SELECT ONE OPTION
ln the near future, technology will have evolved, and Al will be able to learn multiple tasks by itself without needing to be retrained, allowing it to operate even in new environments. The cognitive abilities of Al are similar to a child of 1-2 years.’
In the above quote, which ONE of the following options is the correct name of this type of Al?
SELECT ONE OPTION
Which of the following is a technique used in machine learning?
Which ONE of the following options BEST DESCRIBES clustering?
SELECT ONE OPTION
Which of the following is an example of an input change where it would be expected that the AI system should be able to adapt?
Which of the following are the three activities in the data acquisition activities for data preparation?
Which of the following is correct regarding the layers of a deep neural network?
A software component uses machine learning to recognize the digits from a scan of handwritten numbers. In the scenario above, which type of Machine Learning (ML) is this an example of?
SELECT ONE OPTION
Which of the following is a dataset issue that can be resolved using pre-processing?
Consider an AI-system in which the complex internal structure has been generated by another software system. Why would the tester choose to do black-box testing on this particular system?
Which ONE of the following types of coverage SHOULD be used if test cases need to cause each neuron to achieve both positive and negative activation values?
SELECT ONE OPTION
A ML engineer is trying to determine the correctness of the new open-source implementation *X", of a supervised regression algorithm implementation. R-Square is one of the functional performance metrics used to determine the quality of the model.
Which ONE of the following would be an APPROPRIATE strategy to achieve this goal?
SELECT ONE OPTION
There is a growing backlog of unresolved defects for your project. You know the developers have an ML model that they have created which has learned which developers work on which type of software and the speed with which they resolve issues. How could you use this model to help reduce the backlog and implement more efficient defect resolution?
You are testing an autonomous vehicle which uses AI to determine proper driving actions and responses. You have evaluated the parameters and combinations to be tested and have determined that there are too many to test in the time allowed. It has been suggested that you use pairwise testing to limit the parameters. Given the complexity of the software under test, what is likely the outcome from using pairwise testing?
Which of the following approaches would help overcome testing challenges associated with probabilistic and non-deterministic AI-based systems?