Data Science & Ai Applications

Reviewed by Editorial Team
The ProProfs editorial team is comprised of experienced subject matter experts. They've collectively created over 10,000 quizzes and lessons, serving over 100 million users. Our team includes in-house content moderators and subject matter experts, as well as a global network of rigorously trained contributors. All adhere to our comprehensive editorial guidelines, ensuring the delivery of high-quality content.
Learn about Our Editorial Process
| By Isie
I
Isie
Community Contributor
Quizzes Created: 1 | Total Attempts: 1,078
| Attempts: 1,078 | Questions: 40
Please wait...
Question 1 / 40
0 %
0/100
Score 0/100
1. Which of the following package is used for tidy data?

Explanation

The tidyr package is used for tidy data. Tidy data is a structured format where each variable has its own column and each observation has its own row. The tidyr package provides functions to easily manipulate and transform data to conform to this tidy format.

Submit
Please wait...
About This Quiz
Data Science Quizzes & Trivia

Explore the fundamentals of data handling in Data Science & AI through this quiz. Assess your understanding of raw and processed data, and learn how data is categorized... see moreand utilized in real-world applications. see less

2. What is the intersection of a column and a row on a Excel worksheet called?

Explanation

The intersection of a column and a row on an Excel worksheet is called a cell. A cell is the smallest unit of a worksheet and is identified by a unique address, which is a combination of the column letter and the row number. It is where data can be entered, edited, and displayed.

Submit
3.    Which of the following is an example of raw data?

Explanation

All of the mentioned options are examples of raw data because they are all original and unprocessed data that have been directly collected from a source. The original swath files generated from a sonar system, the initial time-series file of temperature values, and the real-time GPS-encoded navigation file are all examples of raw data that can be further analyzed and processed to extract meaningful information. Therefore, the correct answer is "all of the mentioned."

Submit
4. Which of the following are examples of software development tools?

Explanation

The correct answer is "all of the mentioned". Debuggers, editors, assemblers, compilers, and interpreters are all examples of software development tools. Debuggers are used to identify and fix errors in code, editors are used to write and modify code, assemblers, compilers, and interpreters are used to convert high-level programming languages into machine code that can be executed by a computer. Therefore, all of these options are examples of tools that are commonly used in software development.

Submit
5.    Which of the following is another name for raw data?

Explanation

The term "eggy data" is not commonly used or recognized in the context of raw data. It is likely a made-up or incorrect term. Raw data refers to unprocessed and unorganized data that is directly collected from a source. It has not undergone any manipulation or analysis.

Submit
6. Which of the following function gives information about top level data?

Explanation

The function "head" gives information about the top level data. It returns the first few rows of a dataset, allowing users to quickly view the structure and content of the data. This function is commonly used in data analysis and exploration tasks to get a glimpse of the dataset before performing further operations.

Submit
7. Who is a data scientist?

Explanation

A data scientist is someone who possesses a combination of skills in programming, statistics, and mathematics. They are proficient in coding and software development, allowing them to manipulate and analyze large datasets. They also have a deep understanding of statistical concepts and techniques, enabling them to extract meaningful insights from data. Additionally, their mathematical skills help them build models and algorithms to solve complex problems. Therefore, the correct answer is "All of the mentioned" as a data scientist requires expertise in software programming, statistics, and mathematics.

Submit
8. What type of chart is useful for comparing parts of a whole?

Explanation

A pie chart is useful for comparing parts of a whole because it visually represents the proportion of each category or component in relation to the whole. The chart is divided into slices, with each slice representing a specific category or component. The size of each slice is proportional to its percentage or share of the whole, making it easy to compare and understand the relative importance or distribution of each part. This type of chart is commonly used to show market share, budget allocation, or demographic distribution.

Submit
9. The __________ feature of MS Excel quickly completes a series of data

Explanation

The Auto fill feature of MS Excel quickly completes a series of data by automatically filling in the subsequent values based on the pattern established by the initial selection. This saves time and effort by eliminating the need to manually enter each value in the series.

Submit
10. How the new states are generated in genetic algorithm?

Explanation

In genetic algorithms, new states are generated through both mutation and cross-over. Mutation involves randomly altering certain genes in an individual's genetic code, introducing new variations. On the other hand, cross-over involves combining genetic material from two parent individuals to create offspring with a mix of their traits. By utilizing both mutation and cross-over, genetic algorithms can explore a wide range of possibilities and potentially find optimal solutions by gradually improving and evolving the population over successive generations.

Submit
11.     Data that summarize all observations in a category are called __________ data.

Explanation

Summarized data refers to data that has been condensed or aggregated to provide an overview or summary of the observations within a category. This type of data is useful for providing a high-level understanding of the information and can help to identify patterns or trends. In contrast, raw data refers to the original, unprocessed data that has not been summarized or analyzed. Therefore, the correct answer is "summarized".

Submit
12. Which of the following is used to extract data from HTML code of websites?

Explanation

Webscraping is the process of extracting data from the HTML code of websites. It involves using automated tools or scripts to navigate through the website's structure and retrieve specific information. Webdredging and webcleaning are not commonly used terms for this process. Therefore, the correct answer is Webscraping.

Submit
13. Which of the following contains output segments of AI programming?

Explanation

All of the mentioned options contain output segments of AI programming. Printed language and synthesized output can be used to communicate the results or information generated by AI systems. The manipulation of physical objects can also be an output of AI programming, where AI systems can control robots or machines to perform specific tasks. Locomotion, or the movement of objects or robots, can also be an output of AI programming where AI systems can control the motion of autonomous vehicles or robots. Therefore, all of the mentioned options contain output segments of AI programming.

Submit
14.   Point out the correct statement.

Explanation

The correct statement is that nearly 80% of data analysis is spent on the cleaning and preparing data. This means that a significant amount of time in data analysis is dedicated to organizing and formatting the data in a way that is suitable for analysis. This process involves tasks such as removing duplicates, handling missing values, and transforming the data into a usable format. By spending a majority of the time on data cleaning and preparation, analysts ensure that the data is accurate and reliable for further analysis and decision-making.

Submit
15. What is the complex system of structured message?

Explanation

Languages are the complex systems of structured messages. They consist of a set of rules and symbols that allow individuals to communicate and convey meaning. Languages are composed of words, which are the basic units of meaning, and they can be expressed through signs or speech. Through languages, individuals can convey complex ideas, express emotions, and engage in social interactions. Therefore, languages are the most suitable option as the complex system of structured message.

Submit
16.   Point out the correct statement.

Explanation

Primary data refers to the data that is collected firsthand from the original source. This means that primary data is the original source of data as it is collected directly by the researcher or organization for a specific purpose. Secondary data, on the other hand, is data that has been collected by someone else for their own purpose and is used by others for their research or analysis. Therefore, the correct statement is that primary data is the original source of data.

Submit
17. What is the name for information sent from robot sensors to robot controllers?

Explanation

Feedback is the correct answer because it refers to the information that is sent from robot sensors to robot controllers. Feedback is crucial for the robot controllers to make necessary adjustments and decisions based on the input received from the sensors. It helps in monitoring and controlling the robot's actions and ensuring its proper functioning. Temperature, pressure, and signal are not specifically related to the information sent from sensors to controllers, making them incorrect options.

Submit
18. Which of the following process involves structuring datasets to facilitate analysis?

Explanation

Data tidying involves structuring datasets to facilitate analysis. This process includes cleaning and organizing the data, dealing with missing values, standardizing variables, and transforming the data into a format that is suitable for analysis. By tidying the data, it becomes easier to perform various analytical tasks such as data exploration, visualization, and modeling.

Submit
19. Which of the following will set the character that represents missing value?

Explanation

The na.strings command is used to set the character that represents missing values in R. By using this command, any string specified will be treated as a missing value in the dataset. Therefore, the correct answer is na.strings.

Submit
20. Which of the following function is used for casting data frames?

Explanation

The function "dcast" is used for casting data frames. This function is used to reshape the data frame from long to wide format, allowing for a more organized and structured representation of the data. It takes a data frame as input and allows the user to specify the variables to be used as row and column identifiers, as well as the values to be populated in the resulting wide format. Therefore, "dcast" is the correct answer for this question.

Submit
21. Point out the correct statement.

Explanation

The correct statement is that raw data is the original source of data. Raw data refers to the unprocessed and unorganized data that is collected directly from the source. It has not undergone any manipulation or analysis. Preprocessed data, on the other hand, refers to the data that has been cleaned, transformed, and organized for further analysis. Therefore, raw data is the initial and untouched form of data that serves as the foundation for any data processing or analysis.

Submit
22. Which is the first AI programming language?

Explanation

LISP is considered the first AI programming language because it was specifically designed for the development of artificial intelligence applications. It was developed in the late 1950s and early 1960s by John McCarthy and his team at MIT. LISP stands for "LISt Processing" and it introduced many concepts that are still used in AI programming today, such as symbolic processing, recursion, and dynamic typing. Its focus on symbolic manipulation and its ability to represent and manipulate knowledge made it a popular choice for AI research and development.

Submit
23.    Raw data in the real-world is tidy and properly formatted.

Explanation

Raw data in the real world is often messy and unstructured, with inconsistencies, missing values, and formatting issues. It requires cleaning and organizing to make it tidy and properly formatted. Therefore, the statement that raw data in the real world is tidy and properly formatted is false.

Submit
24. Which of the following algorithm is online search algorithm?

Explanation

Hill-climbing search algorithm is an online search algorithm because it explores the search space by continuously moving towards the goal state based on the heuristic evaluation of the current state. It does not require the entire search space to be known in advance and can make decisions based on the current state and the available information. This makes it suitable for real-time or dynamic environments where new information is constantly being received. In contrast, breadth-first search and depth-first search algorithms are offline search algorithms as they require the complete search space to be known beforehand.

Submit
25. Which of the following terms refers to the rotational motion of a robot arm?

Explanation

Roll refers to the rotational motion of a robot arm. In robotics, roll is the movement of the arm around its longitudinal axis, similar to how a rolling pin moves. This motion allows the robot arm to rotate and reposition itself in a circular or rolling manner. Swivel refers to a horizontal rotation, axle refers to a shaft on which a wheel rotates, and retrograde refers to a backward or reverse motion.

Submit
26.     Which of the following package is used for reading excel data?

Explanation

The correct answer is "xlsx". The xlsx package is used for reading excel data.

Submit
27. What type of chart will you use to compare performance of two employees in the year 2016?

Explanation

A line chart is the most suitable type of chart to compare the performance of two employees in the year 2016. This type of chart is effective for showing trends and changes over time. By plotting the performance data for each employee on a line chart, it becomes easy to compare their progress and identify any patterns or differences in their performance throughout the year. The line chart allows for a clear visualization of how each employee's performance fluctuates over time, making it an ideal choice for this comparison.

Submit
28.   Which of the following is an important parameter of read.table function?

Explanation

The correct answer is "all of the mentioned." This is because all three parameters - file, header, and sep - are important in the read.table function. The "file" parameter specifies the file to be read, the "header" parameter indicates whether the file has a header row, and the "sep" parameter specifies the separator used in the file. Therefore, all of these parameters are necessary for the function to properly read the data from a file.

Submit
29. Which of the following function is used for quantiles of quantitative values?

Explanation

The correct answer is "quantile". The quantile function is used to calculate the quantiles of quantitative values. It is a statistical function that divides a dataset into equal-sized intervals or groups. These intervals represent the percentiles or quantiles of the data. The quantile function is commonly used in data analysis and statistics to understand the distribution and spread of data.

Submit
30.   Which of the following function is used for loading flat files?

Explanation

The function "read.table" is used for loading flat files. This function is commonly used in R programming to read data from a tabular file, such as a CSV file, into a data frame. It automatically detects the structure of the file and imports the data accordingly. "read.data" and "read.sheet" are not valid functions in R, and "none of the mentioned" is not the correct answer because "read.table" is indeed used for loading flat files.

Submit
31.     Which of the following data is put into a formula to produce commonly accepted results?

Explanation

Processed data refers to information that has been manipulated, organized, or transformed in some way to produce meaningful results. This data has undergone a series of steps or operations to extract valuable insights or to make it suitable for analysis. By using processed data in a formula, we can generate commonly accepted results that are reliable and accurate. Therefore, processed data is the correct choice as it is the type of data that is typically used in formulas to produce desired outcomes.

Submit
32.   Mutate function is used for casting as multi dimensional arrays.

Explanation

The statement is false because the mutate function is not used for casting as multi-dimensional arrays. The mutate function is used in programming languages like JavaScript to modify or change the value of an existing array, but it does not have any specific functionality for casting multi-dimensional arrays.

Submit
33.   Which type of data is generated by POS terminal in a busy supermarket each day?

Explanation

The correct answer is "Source" because a POS (Point of Sale) terminal in a busy supermarket generates raw data or the original data at the point of sale. This data includes information about the products sold, quantities, prices, and payment methods used. It is the starting point or the source of data that can be further processed, synchronized, or used for various analytical purposes.

Submit
34.    Which of the following can be used to view all the tables in memory?

Explanation

The correct answer is "tables". This command is used to view all the tables that are currently stored in memory. By executing the "tables" command, a list of all the tables will be displayed, allowing the user to see what tables are available for use or manipulation.

Submit
35.     Point out the correct statement.

Explanation

The correct statement is "none of the mentioned." This means that neither the head function nor the tail function work on a string.

Submit
36. What type of chart will you use to compare performance of sales of two products

Explanation

A column chart is the most suitable type of chart to compare the performance of sales of two products. This chart displays the data in vertical columns, with each column representing a different product. The height of each column represents the sales performance, allowing for easy visual comparison between the two products. A line chart, on the other hand, is more suitable for showing trends over time, while a pie chart is used to represent proportions or percentages of a whole. Therefore, the correct answer is a column chart.

Submit
37. Which of the following function is used to read data off the webpages?

Explanation

The correct answer is "read.Lines." This function is used to read data off webpages. It allows for reading multiple lines of data at once, making it suitable for extracting information from webpages that contain multiple lines of text. The other options, "read.web" and "read.Line," are not valid functions in R. Therefore, "read.Lines" is the correct answer.

Submit
38. Strong Artificial Intelligence is __________

Explanation

Strong Artificial Intelligence refers to the concept of creating a computer system or program that possesses the same intellectual capabilities as a human being. It involves the development of technology that can understand, learn, and reason like a human, and can perform tasks that would typically require human intelligence. This includes abilities such as problem-solving, decision-making, natural language processing, and even emotions. Strong AI aims to replicate and embody human intellectual capabilities within a computer system, enabling it to exhibit intelligent behavior and perform tasks that would be considered intelligent if done by a human.

Submit
39. When a top-level function is entered, the LISP processor does?

Explanation

When a top-level function is entered, the LISP processor prints the result returned by the function. This means that when the function is executed, the output or result of the function is displayed or printed by the LISP processor. This allows the user to see the outcome of the function and understand the result of their code.

Submit
40.   How do you check if “m” is a matrix data object in R?

Explanation

The correct answer is "And.matrix()". In R, the function "is.matrix()" is used to check if an object is a matrix. However, in the given options, there is no "is.matrix()" function available. Therefore, the correct option would be "And.matrix()" which suggests that we can use the "and" operator to check if "m" is a matrix data object.

Submit
View My Results

Quiz Review Timeline (Updated): Oct 7, 2024 +

Our quizzes are rigorously reviewed, monitored and continuously updated by our expert board to maintain accuracy, relevance, and timeliness.

  • Current Version
  • Oct 07, 2024
    Quiz Edited by
    ProProfs Editorial Team
  • Feb 07, 2020
    Quiz Created by
    Isie
Cancel
  • All
    All (40)
  • Unanswered
    Unanswered ()
  • Answered
    Answered ()
Which of the following package is used for tidy data?
What is the intersection of a column and a row on a Excel worksheet...
   Which of the following is an example of raw data?
Which of the following are examples of software development tools?
   Which of the following is another name for raw data?
Which of the following function gives information about top level...
Who is a data scientist?
What type of chart is useful for comparing parts of a whole?
The __________ feature of MS Excel quickly completes a series of data
How the new states are generated in genetic algorithm?
    Data that summarize all observations in a category are called...
Which of the following is used to extract data from HTML code of...
Which of the following contains output segments of AI programming?
  Point out the correct statement.
What is the complex system of structured message?
  Point out the correct statement.
What is the name for information sent from robot sensors to robot...
Which of the following process involves structuring datasets to...
Which of the following will set the character that represents missing...
Which of the following function is used for casting data frames?
Point out the correct statement.
Which is the first AI programming language?
   Raw data in the real-world is tidy and properly...
Which of the following algorithm is online search algorithm?
Which of the following terms refers to the rotational motion of a...
    Which of the following package is used for reading excel data?
What type of chart will you use to compare performance of two...
  Which of the following is an important parameter of read.table...
Which of the following function is used for quantiles of quantitative...
  Which of the following function is used for loading flat files?
    Which of the following data is put into a formula to produce...
  Mutate function is used for casting as multi dimensional...
  Which type of data is generated by POS terminal in a busy...
   Which of the following can be used to view all the tables in...
    Point out the correct statement.
What type of chart will you use to compare performance of sales of two...
Which of the following function is used to read data off the webpages?
Strong Artificial Intelligence is __________
When a top-level function is entered, the LISP processor does?
  How do you check if “m” is a matrix data object in R?
Alert!

Advertisement