
Dataset Name: reddit_economy_posts
Group: social_media
Vendor: Reddit powered by CloudQuant
Data Starts at: 2021-01-01 00:00:00
Symbol Set: Meme Securities
Asset Class: Equity
Data Update Time(s): live
Data Update Frequency: intraday
The posts from Reddit's r/economy with multiple NLP sentiment scores and mapped to cash tags and stock trading symbols. This dataset is free of charge to licensed CloudQuant users. r/Economy is a mostly unmoderated forum for economy, business, politics, stocks, bonds, product releases, IPOs, advice, news, investment, videos, predictions, government, money, politics, debate, capitalism, current trends, and more.
Data Contained in this Dataset
Column | Type | Description |
---|---|---|
_seq | uint | Internal sequence number used to keep data rows in order |
timestamp | string | Timestamp of the Data - America/New York Time. |
muts | uint64 | Microseconds Unix Timestamp. An integer representation of a timestamp with microsecond precision that can be compared directly to other timestamps. |
symbol | string | Trading Symbol or Ticker |
approved_at_utc | string | timestamp of approval. null if nobody or you are not a mod |
subreddit | string | subreddit the post/comment belongs to |
selftext | string | Submission Text |
author_fullname | string | The comment author’s ID prepended with t2_. |
saved | bool | true if this post is saved by the logged in user |
mod_reason_title | string | The mod reason’s title if applicable. |
gilded | int | the number of times this comment received reddit gold |
clicked | bool | Bool - Clicked |
title | string | Title |
link_flair_richtext | string | String - Link flair Rich Text |
subreddit_name_prefixed | string | subreddit_name_prefixed The name of the subreddit the submission was posted on, prefixed with “r/”. |
hidden | bool | Hidden (bool) |
pwls | int | Reddit PWLS code |
link_flair_css_class | string | Flair applied to the reddit link |
downs | int | the number of downvotes. (includes own) |
thumbnail_height | int | Height of the thumbnail |
top_awarded_type | string | post/comment highest (most expensive) reward |
hide_score | bool | Bool - Hide Score |
name | string | Full name of the submission. |
quarantine | bool | Whether the submission was posted in a quarantined subreddit |
link_flair_text_color | string | The submission’s flair text color if applicable. |
upvote_ratio | double | Reddit Up Vote Ratio |
author_flair_background_color | string | The submission/comment author’s flair background color. |
subreddit_type | string | the subreddit's type - one of public, private, restricted, or in very special cases gold_restricted or archived |
ups | int | the number of upvotes. (includes own) |
total_awards_received | int | The number of awards on the submission |
media_embed | string | Media Embedded in Post or Comment |
thumbnail_width | int | The width of the submission’s thumbnail if applicable |
author_flair_template_id | string | The comment author’s flair template ID if applicable. |
is_original_content | bool | Whether the submission has been marked as original content |
user_reports | string | A list of the user reports on the submission |
secure_media | string | Secure media flag |
is_reddit_media_domain | bool | Whether the media has been uploaded to Reddit. |
is_meta | bool | Whether the submission is a meta post. |
category | string | The submission’s category. |
secure_media_embed | string | Secure media embedded in post or comment |
link_flair_text | string | The submission’s flair text |
can_mod_post | string | Whether the logged-in user can modify the post. |
score | int | the net-score of the comment |
approved_by | string | who approved this comment. null if nobody or you are not a mod |
author_premium | bool | bool - Author Premium |
thumbnail | string | A URL to the submission’s thumbnail if applicable |
edited | string | Whether or not the submission has been edited |
author_flair_css_class | string | the CSS class of the author's flair. subreddit specific |
author_flair_richtext | string | The comment author’s flair text if applicable |
gildings | string | The gild awards the submission has received. |
content_categories | string | The content categories assigned to the submission |
is_self | bool | Whether the submission is a self post. |
mod_note | string | Moderator notes added to the submission. |
created | string | the time of creation in local epoch-second format. ex: 1331042771.0 |
link_flair_type | string | The type of flair applied to the submission |
wls | int | Reddit WLS Code |
removed_by_category | string | removed by category |
banned_by | string | moderator that banned this post/comment |
author_flair_type | string | The type of flair used by the submission’s author. |
domain | string | The domain of the submission. |
allow_live_comments | bool | Whether live comments have been enabled on this submission. |
selftext_html | string | The submission text as HTML |
likes | string | how the logged-in user has voted on the link - True = upvoted, False = downvoted, null = no vote |
suggested_sort | string | The suggested sort method for comments |
banned_at_utc | string | The UTC timestamp at which the author was banned. |
view_count | string | The number of views on the submission. |
archived | bool | Whether the submission has been archived by Reddit. |
no_follow | bool | Bool - No Follow indicator |
is_crosspostable | bool | Whether the submission can be cross-posted to other subreddits |
pinned | bool | Whether the submission has been pinned on the subreddit. |
over_18 | bool | Whether the submission/comment has been marked NSFW. |
all_awardings | string | A list of awards added to the submission/comment |
awarders | string | List of users who gave this post/comment an award |
media_only | bool | Whether the submission only consists of media |
link_flair_template_id | string | The submission’s flair template ID if applicable |
can_gild | bool | Whether the logged-in user can gild the submission |
spoiler | bool | Whether the submission contains a spoiler. |
locked | bool | whether the link is locked (closed to new comments) or not. |
author_flair_text | string | the text of the author's flair. subreddit specific |
treatment_tags | string | Community content tags are tags that moderators add to their communities to let redditors know what kind of mature content is in that community. In the past, Reddit used a Not Safe for Work (NSFW) tag to distinguish communities and content most people wou |
visited | bool | Whether the logged-in user has visited the submission previously. |
removed_by | string | reddit id of who removed the content |
num_reports | string | how many times this comment has been reported, null if not a mod |
distinguished | string | to allow determining whether they have been distinguished by moderators/admins. null = not distinguished. moderator = the green [M]. admin = the red [A]. special = various other special distinguishes |
subreddit_id | string | subreddit_id: The subreddit’s ID prepended with t5_. |
mod_reason_by | string | The moderator who added the removal reason if applicable. |
removal_reason | string | A removal reason set by moderators if applicable. |
link_flair_background_color | string | The submission’s flair background color |
id | string | unique id for post/comment |
is_robot_indexable | bool | Whether the submission can be indexed by robots |
report_reasons | string | A list of report reasons on the submission. |
author | string | User information who wrote the post/comment |
discussion_type | string | reddit discussion type indicators |
num_comments | int | The number of comments on the submission. |
send_replies | bool | Whether the author of the submission will receive reply notifications |
whitelist_status | string | Submission whitelist status |
contest_mode | bool | Whether the moderators of the subreddit have enabled contest mode on the submission |
mod_reports | string | A list of moderator reports on the submission/comment |
author_patreon_flair | bool | The comment author’s Patreon flair if applicable. |
author_flair_text_color | string | The submission/comment author flair text color if applicable. |
permalink | string | The collection’s permalink (to view on the web). |
parent_whitelist_status | string | Parent Whitelist Status (string) |
stickied | bool | true if the post is set as the sticky in its subreddit. |
url | string | The full URL of the submission |
subreddit_subscribers | int | The number of subscribers to the submission’s subreddit |
created_utc | string | the time of creation in UTC epoch-second format. Note that neither of these ever have a non-zero fraction. |
num_crossposts | int | The number of times the submission has been cross-posted |
media | string | Post/Comment media |
is_video | bool | Whether the submission is a video post. |
comment_limit | int | comment limit |
comment_sort | string | comment sort |
symbols | string | Symbols found in title and body and parent of a post or a comment which includes cashtags |
cashtags | string | Cashtags found in title and body of a post or a comment as well as a comment's parents |
symbol_src | string | where the symbol was found. Either: title, body, or parent |
vader_body_sentiment_neg | double | Percentage (%) of the body text that is negative |
vader_body_sentiment_neu | double | Percentage (%) of the body text that is neutral |
vader_body_sentiment_pos | double | Percentage (%) of the body text that is positive |
vader_body_sentiment_compound | double | sum of all the body text sentiment ratings |
vader_title_sentiment_neg | double | Percentage (%) of the title text that is negative (always 0 for a comment) |
vader_title_sentiment_neu | double | Percentage (%) of the title text that is neutral (always 0 for a comment) |
vader_title_sentiment_pos | double | Percentage (%) of the body text that is positive (always 0 for a comment) |
vader_title_sentiment_compound | double | sum of all the body text sentiment ratings |
textblob_body_sentiment_polarity | double | The polarity score is a float within the range [-1.0, 1.0]. The subjectivity is a float within the range [0.0, 1.0] where 0.0 is very objective and 1.0 is very subjective. |
textblob_body_sentiment_subjectivity | double | The subjectivity is a float within the range [0.0, 1.0] where 0.0 is very objective and 1.0 is very subjective. |
textblob_title_sentiment_polarity | double | The polarity score is a float within the range [-1.0, 1.0]. The subjectivity is a float within the range [0.0, 1.0] where 0.0 is very objective and 1.0 is very subjective. |
textblob_title_sentiment_subjectivity | double | The subjectivity is a float within the range [0.0, 1.0] where 0.0 is very objective and 1.0 is very subjective. |