Free Azure Data Scientist Associate (DP-100) certification test. Practice them if you Want to make a tech career as a data scientist
Welcome to your Microsoft Azure Test-11
You need to normalize values to produce an output column into bins to predict a target column.Solution: Apply a Quantiles normalization with a QuantileIndex normalization.Does the solution meet the goal?Select 1 option(s):
You need to configure the DLVM to support CUDA.What should you implement?Select 1 option(s):
You need to determine whether the feature values achieve the conditions to build a Poisson regression model.Which two conditions must the feature set contain?Select 2 option(s):
What should you do?Select 1 option(s):
How should you configure the Clean Missing Data module? To answer, select the appropriate options in the answer area.”Select 1 option(s):
How should you configure the module properties? To answer, select the appropriate options in the dialog box in the answer area.Select 1 option(s):
You need to evaluate the model results for imbalance.Which evaluation metric should you use?Select 1 option(s):
You need to normalize values to produce an output column into bins to predict a target column.Solution: Apply an Equal Width with Custom Start and Stop binning mode.Does the solution meet the goal?Select 1 option(s):
Which framework-specific estimator should you use to run the script as an experiment?Select 1 option(s):
Developers need to access data in the database using an API.You need to determine which API to use for the database model and type.Which two APIs should you use?Select 2 option(s):
You need to use the Clean Missing Data module in Azure Machine Learning Studio to identify and resolve the null and missing data in the dataset.Which parameter should you use?Select 1 option(s):
Which kind of task should you specify for automated machine learning?Select 1 option(s):
Student devices are not configured for Python development. Students do not have administrator access to install software on their devices. Azure subscriptions are not available for students.You need to ensure that students can run Python-based data visualization code.Which Azure tool should you use?Select 1 option(s):
Select 1 option(s):
You want to run a script as an experiment that loads the data files and trains a model.What should you do?Select 1 option(s):
You want to use the dataset in an experiment script that is run using an estimator.What should you do?Select 1 option(s):
You have the following requirements: Models must be built using Caffe2 or Chainer frameworks. Data scientists must be able to use a data science environment to build the machine learning pipelines and train models on their personal devices in both connected and disconnected network environments.Personal devices must support updating machine learning pipelines when connected to a network.You need to select a data science environment.Select 1 option(s):
Step 1 preprocesses some data, and step 2 uses the preprocessed data to train a model.What type of object should you use to pass data from step 1 to step 2 and create a dependency between these steps?Select 1 option(s):
You must evaluate your model on a limited data sample by using k-fold cross validation. You start by configuring a k parameter as the number of splits.You need to configure the k parameter for the cross-validation.Which value should you use?Select 1 option(s):
You have the following data available for model building: Video recordings of sporting events Transcripts of radio commentary about events Logs from related social media feeds captured during sporting eventsYou need to select an environment for creating the model.Which environment should you use?Select 1 option(s):
You need to ensure that workshop attendees can install Docker on their devices.Which two prerequisite components should attendees install on the devices?Select 2 option(s):
What functions must the entry script for the service include?Select 1 option(s):
You plan to use the Schedule.create method to create the schedule.What kind of object must you create first to configure how frequently the pipeline runs?Select 1 option(s):
Which three Azure Machine Learning Studio modules should you use? Each correct answer presents part of the solution.”Select 3 option(s):
You have the following requirements: Models must be built using Caffe2 or Chainer frameworks. Data scientists must be able to use a data science environment to build the machine learning pipelines and train models on their personal devices in bothconnected and disconnected network environments.Personal devices must support updating machine learning pipelines when connected to a network.You need to select a data science environment.Which environment should you use?Select 1 option(s):
One class has a much smaller number of observations than the other classes in the training set.You need to select an appropriate data sampling strategy to compensate for the class imbalance.Solution: You use the Scale and Reduce sampling mode.Does the solution meet the goal?Select 1 option(s):
You need to build and train the machine learning model to learn the sequence of the textual content.Which type of neural network should you use?Select 1 option(s):
The training loss, validation loss, training accuracy, and validation accuracy of each training epoch has been provided.You need to identify whether the classification model is overfitted.Which of the following is correct?Select 1 option(s):
You must tune hyperparameters by performing a parameter sweep of the model. The parameter sweep must meet the following requirements: iterate all possible combinations of hyperparameters minimize computing resources required to perform the sweepYou need to perform a parameter sweep of the model.Which parameter sweep mode should you use?Select 1 option(s):
You need to normalize values to produce a feature column grouped into bins.Solution: Apply an Entropy Minimum Description Length (MDL) binning mode.Does the solution meet the goal?Select 1 option(s):
When the run completes, which method of the run object should you use to retrieve the best model?Select 1 option(s):
Which method should you use?Select 1 option(s):
One class has a much smaller number of observations than the other classes in the training set.You need to select an appropriate data sampling strategy to compensate for the class imbalance.Solution: You use the Synthetic Minority Oversampling Technique (SMOTE) sampling mode.Does the solution meet the goal?Select 1 option(s):
Time's up