Introduction to Big Data Techniques

28 questions
Question 1 of 28

Which statement about fintech is least likely accurate in this module?

Question 2 of 28

Assertion (A): Fintech is directly relevant to quantitative investment analysis partly because massive alternative datasets can be integrated into the investment decision-making process.
Reason (R): Non-traditional sources such as social media and sensor networks can help generate alpha and reduce losses when analyzed alongside other data.

Question 3 of 28

Consider the following:
I. Volume refers to datasets containing many millions or even billions of data points.
II. Variety refers to the use of structured, semistructured, and unstructured formats.
III. Velocity refers to the credibility and reliability of data sources when predictions are made.
How many of the above statements are most accurate?

Question 4 of 28

Which of the following is least likely a database or language pairing described in the module?

Question 5 of 28

NLP is most likely a useful application in investment management because it can:

Question 6 of 28

Which of the following is least likely to describe how fintech changed financial services according to the module?

Question 7 of 28

Consider the following:
I. In supervised learning, inputs and outputs are labeled for the algorithm.
II. In unsupervised learning, the algorithm is given data and seeks to describe the data and their structure.
III. Deep learning can use supervised or unsupervised machine learning approaches.
How many of the above statements are most accurate?

Question 8 of 28

Assertion (A): When Big Data is used for inference or prediction, veracity becomes an important consideration.
Reason (R): Veracity refers to the speed and frequency with which data are recorded and transmitted.

Question 9 of 28

Which dataset is least likely to be classified as unstructured in the module?

Question 10 of 28

An analyst uses AI to sort through company filings, annual reports, and earnings calls to determine which information is most important. This use of fintech is most likely justified in the module because AI may:

Question 11 of 28

Massive alternative datasets from sources such as social media and sensor networks are most likely relevant to a portfolio manager because they can:

Question 12 of 28

Assertion (A): NLP can detect shifts in analyst sentiment before a recommendation change.
Reason (R): NLP includes tasks such as translation, speech recognition, text mining, sentiment analysis, and topic analysis.

Question 13 of 28

Assertion (A): In this module, fintech excludes innovation in the design and delivery of financial services and refers only to the business sector of firms that build financial technology.
Reason (R): In common usage, fintech can also refer to companies involved in developing new technologies and to the business sector that includes such companies.

Question 14 of 28

Consider the following:
I. Early forms of fintech included data processing and the automation of routine tasks.
II. Systems executing decisions according to specified rules and instructions followed those early forms.
III. In this module, fintech refers only to companies that develop new technologies and not to the innovation itself.
How many of the above statements are most accurate?

Question 15 of 28

Consider the following:
I. Text analytics includes automated information retrieval from different, unrelated sources to aid decision making.
II. NLP can detect shifts in analyst sentiment before a recommendation change by evaluating nuanced commentary.
III. NLP can analyze policy maker communications to identify trending or waning topics of interest.
How many of the above statements are most accurate?

Question 16 of 28

Assertion (A): Deep learning can use supervised or unsupervised machine learning approaches.
Reason (R): Neural networks have existed since 1958 and have been used in applications such as forecasting and pattern recognition.

Question 17 of 28

Assertion (A): Low-latency systems are essential for automated trading applications that make decisions based on real-time prices and market events.
Reason (R): Low-latency systems operate on networks that communicate high volumes of data with minimal delay.

Question 18 of 28

Assertion (A): NLP can be used in compliance functions to review employee voice and electronic communications for adherence to company or regulatory policy.
Reason (R): NLP lies outside text analytics and focuses only on structured numerical data.

Question 19 of 28

For automated trading applications that act on real-time prices and market events, the most likely data-capture requirement is:

Question 20 of 28

Which statement about visualization techniques is most likely accurate?

Question 21 of 28

Which of the following is most likely an area of fintech development that the module identifies as directly relevant to quantitative investment analysis?

Question 22 of 28

Which of the following is most likely an unsupervised learning application described in the module?

Question 23 of 28

When Big Data is used for inference or prediction, investment professionals are most likely concerned with veracity because it addresses:

Question 24 of 28

Which of the following is least likely part of data curation?

Question 25 of 28

Consider the following:
I. Traditional data include trade prices and volumes generated in financial markets.
II. Alternative data arise only from governments and companies.
III. The Internet of Things refers to corporate exhaust produced by supply chains.
How many of the above statements are most accurate?

Question 26 of 28

An ML model fits the training data extremely precisely, treats noise as true parameters, and predicts poorly on a new dataset. The model is most likely:

Question 27 of 28

Consider the following:
I. Low-latency systems are essential for automated trading applications that act on real-time prices and market events.
II. Search refers to how data move from the source or storage location to the analytical tool.
III. Data visualization for non-traditional unstructured data can include heat maps, tree diagrams, and network graphs.
How many of the above statements are most accurate?

Question 28 of 28

Consider the following:
I. Growing amounts of traditional and alternative data can be integrated into a portfolio manager's investment decision-making process.
II. AI-based analytical tools may be better suited than traditional methods to identify complex, non-linear relationships in extremely large datasets.
III. The module identifies analysis of large datasets and advanced analytical tools as areas of fintech development directly relevant to quantitative analysis.
How many of the above statements are most accurate?