
Dataset Name: gwi_data
Group: altdata
Vendor: GWI
Data Starts at: 2012-07-31 00:00:00
Asset Class: Equity
Data Update Time(s): 12:05 AM EST
Data Update Frequency: quarter
GlobalWebIndex (GWI) Internet Behaviours Data. GlobalWebIndex (GWI) Tickerized Core Labels
GlobalWebIndex (GWI) Tickerized Core Labels is an enhanced version of the GWI Core Labels files that links a brand or company name included in the GWI survey to its related security identifier. GWI Core (Global)
GWI Core is a harmonized, global survey on how internet behaviors are evolving.
Starting with an annual study in the first year, it became bi-annual throughout 2010, 2011 and 2012 before being run each quarter from the start of 2013 onwards.
At the end of 2021, we will have run 44 separate waves of research. Over time, our country coverage has increased. Beginning with 16 markets in 2009, we were running our harmonized study in 36 countries by the end of 2016. We incorporated 6 new markets in 2017 and 5 more between 2018 and 2020, bringing our total to 47.
In 2017, we launched a mobile version of our Core study containing a selection of questions from the full Core study in a format optimised for mobile. This has helped us expand our Core study globally, especially in Africa, where most internet users are mobile-first (and where significant minorities might be mobile-only). The mobile survey is run alongside the full version on other devices, with the results of both being merged together and published as one dataset. In Ghana, Kenya, Morocco and Nigeria, we only run the mobile version of the survey.
We work with a number of industry-leading panel providers in each of 47 countries to interview a diverse range of high-quality respondents
We represent 16-64 year-olds We don't interview anyone aged 15 or under for our Core study because parental consent would be needed. In addition, questions would need to be asked in a way which was suitable for younger respondents. However, we do run a separate study, GWI Kids, which looks specifically at internet users aged 8-15.
We don't interview people aged 65+ because it's very challenging to find enough people in this age bracket in some markets - particularly emerging markets where internet penetration rates are low and online populations are therefore dominated by younger age groups. As we want our Core study to be representative and harmonized across markets, we therefore set an upper age limit.
We represent online populations Internet penetration rates vary significantly around the world, from highs of 90%+ in parts of Europe and North America to lows of around 40% in parts of Africa and Asia. Because of this, the demographic composition of the online population may look very different from one market to the next; this will be reflected in our sample structure.
To ensure that our Core sample is representative of each country's online population aged 16-64, we set quotas on age, gender and education. To set these, we conduct thorough research across a range of international and national sources.
The Matching Process
Brands or companies used in the matching process are identified in either the Question or Answers (sub answers are currently not covered). Private brands/companies are not linked to an identifier.
Equities are matched to GWI data using a proprietary methodology that includes both automated and human processes. Matches are also manually reviewed for quality.
Entity, security, and brand information used in the process is provided by FactSet Research Systems.
All reference data is updated regularly to incorporate changes in security or entity information prior to the mapping process.
Tickers are linked to questions on a Point-in-Time basis. The observation (valid) date used in the PIT process is the last day of the survey period e.g. the Q4 survey will include securities that were active on December 31st.
Tickerized GWI data currently only includes the security's exchange symbol (ticker) and the MIC code of the exchange.
Symbols are only provided for an entity's primary issuance and listing as determined by FactSet.
Data Contained in this Dataset
Column | Type | Description |
---|---|---|
_seq | uint | Internal sequence number used to keep data rows in order |
timestamp | string | Timestamp of the Data - America/New York Time. |
muts | uint64 | Microseconds Unix Timestamp. An integer representation of a timestamp with microsecond precision that can be compared directly to other timestamps. |
symbol | string | Trading Symbol or Ticker |
Hash | string | Hash |
Response | int | Response |