Examples of using Big data sources in English and their translations into Tamil
{-}
-
Ecclesiastic
-
Colloquial
-
Computer
Big data sources tend to have ten characteristics;
Measurement is much less likely to change behavior in big data sources.
Big data sources can be loaded with junk and spam.
The most widely discussed feature of big data sources is that they are BIG. .
Big data sources tend to have a number of characteristics in common;
Table 2.1: Studies of unexpected events using always-on big data sources.
Measurement in big data sources is much less likely to change behavior.
Table 2.1: Studies of unexpected events using always-on big data sources.
In conclusion, the big data sources of today(and tomorrow) generally have ten characteristics.
Figure 3.12: Two ways to combine big data sources and survey data. .
Rosenbaum(2015) and Hernán and Robins(2016) offer other advice for discovering useful comparisons within big data sources.
Two approaches that especially benefit from big data sources are natural experiments and matching.
For machine learning approaches thatattempt to automatically discover natural experiments inside of big data sources, see Jensen et al.
Another way in which researchers can use big data sources in survey research is as a sampling frame for people with specific characteristics.
Population drift, usage drift,and system drift make it hard to use big data sources to study long-term trends.
Rather than thinking of big data sources as observing people in a natural setting, a more apt metaphor is observing people in a casino.
Finally, I will describe tworesearch templates for linking survey data to big data sources(section 3.6).
The remainder of the chapter begins by arguing that big data sources will not replace surveys and that the abundance of data increases- not decreases- the value of surveys(Section 3.2).
For machine learning approaches thatattempt to automatically discover natural experiments inside of big data sources, see Jensen et al.
Third, when survey data collection is combined with big data sources- something that I think will become increasingly common, as I will argue later in this chapter- additional ethical issues can arise.
Social scientists call this match construct validity andit is a major challenge with using big data sources for social research(Lazer 2015).
This chapter has three parts. First, in section 2.2,I describe big data sources in more detail and clarify a fundamental difference between them and the data that have typically been used for social research in the past. Then, in section 2.3, I describe ten common characteristics of big data sources. .
See Westen and Rosenthal(2003), and for more on construct validity in big data sources, Lazer(2015) and Chapter 2 of this book.
(Note that this same activity also appears in chapter 6.) This activity will giveyou practice in data wrangling and thinking about natural experiments in big data sources.
Based on the ideas in this chapter,I think that there are three main ways that big data sources will be most valuable for social research:.
These four examples all show that a powerful strategy in the future will be to enrich big data sources, which are not collected for research, with additional information that makes them more suitable for research(Groves 2011).
For more on construct validity, see Westen and Rosenthal(2003),and for more on construct validity in big data sources, Lazer(2015) and Chapter 2 of this book.
As I'm describing thesecharacteristics you will notice that they often arise because big data sources were not created for the purpose of research.
In enriched asking, survey data builds context around a big data source that contains some important measurements but lack others.
Population drift, usage drift,and system drift make it hard to use big data source to study long-term trends.
