Примери коришћења Catalist на Енглеском и њихови преводи на Српски
{-}
-
Colloquial
-
Ecclesiastic
-
Computer
-
Latin
-
Cyrillic
Of the 40 companies that competed, Catalist came in second place.
In addition to this matching competition, Ansolabehere andHersh created their own matching challenge for Catalist.
To create the master datafile, Catalist combines and harmonizes information from many different sources.
For these four demographic variables,the researchers found much higher levels of agreement between survey report and data in the Catalist master file than for voting.
First, Catalist participated in a matching competition that was run by an independent, third-party: the MITRE Corporation.
Rather, the researchers were in a situation where the Catalist data file had some unknown, and perhaps unknowable, amount of error.
But, Catalist had to do the linkage using imperfect identifiers, in this case name, gender, birth year, and home address.
Obtained official listing on SESDAQ(now known as Catalist) on 27 September 1999 after the successful initial public offering of 16 million shares.
In fact, they were particularly interested in comparing reported voting behavior in surveys with validated voting behavior(i.e., the information in the Catalist database).
Thus, the Catalist master data file appears to have high quality information for traits other than voting, suggesting that it is not of poor overall quality.
It is important to note that although in this case researchers were encouraged by the quality of data from Catalist, other evaluations of commercial vendors have been less enthusiastic.
Fortunately, Catalist's reports were close to the withheld values, indicating that Catalist could match partial voter records onto their master data file.
They provided some of these records with some of their fields redacted to Catalist and then compared Catalist's reports of these fields to their actual values.
Although Catalist was willing to discuss its data processing and provide some of its raw data, it was simply impossible for researchers to review the entire Catalist data pipeline.
The second general lesson is that though aggregated, commercial data sources,such as the data from Catalist, should not be considered“ground truth,” in some cases, they can be useful.
In addition to this information, Ansolabehere and Hersh were particularly interested in comparing reported voting behavior to validated voting behavior(i.e., the information in the Catalist database).
Like many of the digital trace sources in Chapter 2, the Catalist master file did not include much of the demographic, attitudinal, and behavioral information that Ansolabehere and Hersh needed.
It is important to note that although Ansolabehere andHersh were encouraged by the quality of data from Catalist, other evaluations of commercial vendors have been less enthusiastic. Pasek et al.
Then they gave their data to Catalist, and Catalist gave them back a merged data file that included validated voting behavior(from Catalist), the self-reported voting behavior(from CCES) and the demographics and attitudes of respondents(from CCES)(figure 3.13).
These two challenges, one by a third-party and one by Ansolabehere and Hersh,give us more confidence in the Catalist matching algorithms, even though we cannot review their exact implementation ourselves.
Next, the researchers gave this data to Catalist, and Catalist gave the researchers back a merged data file that included validated voting behavior(from Catalist), the self-reported voting behavior(from CCES) and the demographics and attitudes of respondents(from CCES).
Like many of the big data sources in chapter 2, the Catalist master file did not include much of the demographic, attitudinal, and behavioral information that Ansolabehere and Hersh needed.
Second, in part using data from Catalist, Ansolabehere and Hersh developed three different measures of quality of county voting records, and they found that the estimated rate of over-reporting of voting was essentially unrelated to any of these data quality measures, a finding that suggest that the high rates of over-reporting are not being driven by counties with unusually low data quality.
However, both data processing and matching are critical to the continued existence of Catalist as a company so it can invest resources in solving these problems, often at a scale that no individual academic researcher or group of researchers can match.
First, in addition to comparing self-reported voting to voting in the Catalist master file, the researchers also compared self-reported party, race, voter registration status(e.g., registered or not registered) and voting method(e.g., in person, absentee ballot, etc.) to those values found in the Catalist databases.
However, both data processing andmatching are critical to the continued existence of Catalist as a company so it can invest resources in solving these problems, often at a scale that no individual academic researcher or group of researchers can match.