The attention mechanism in foundation model architectures allows the model to focus on specific parts of the input data. Which of the following steps are key components of a standard attention mechanism?
If OpenCV is used to read an image and save it to variable "img" during image preprocessing, (h, w) = img.shape[:2] can be used to obtain the image size.
Which of the following statements about the multi-head attention mechanism of the Transformer are true?
Maximum likelihood estimation (MLE) can be used for parameter estimation in a Gaussian mixture model (GMM).
Which of the following statements about the standard normal distribution are true?
Maximum likelihood estimation (MLE) requires knowledge of the sample data's distribution type.
Which of the following statements about the functions of layer normalization and residual connection in the Transformer is true?
Overfitting is a condition where a model is overly simple and excessive generalization errors occur.
In the image recognition algorithm, the structure design of the convolutional layer has a great impact on its performance. Which of the following statements are true about the structure and mechanism of the convolutional layer? (Transposed convolution is not considered.)
In an HSV color space, H is for hue, S is for saturation, and V is for value. Which of the following statements about the HSV color space are true?
In 2017, the Google machine translation team proposed the Transformer in their paperAttention is All You Need. In a Transformer model, there is customized LSTM with CNN layers.
Which of the following statements about the levels of natural language understanding are true?
In natural language processing tasks, word vector evaluation is an important aspect for measuring the performance of a word embedding model. Which of the following statements about word vector evaluation are true?
Among image preprocessing techniques, gamma correction is a common non-linear brightness adjustment method. Which of the following statements are true about the application and features of gamma correction?
The development of large models should comply with ethical principles to ensure the legal, fair, and transparent use of data.
Which of the following methods are useful when tackling overfitting?
A text classification task has only one final output, while a sequence labeling task has an output in each input position.
The basic operations of morphological processing include dilation and erosion. These operations can be combined to achieve practical algorithms such as opening and closing operations.