Skip to main content

Early Findings from the Distroid Database for Issues 28 - 38

Analysis on news content in the Distroid Database

Published onJul 15, 2023
Early Findings from the Distroid Database for Issues 28 - 38
·

Overall

Early work on analyzing content to find some insights in the Distroid Database.

Content Types

The most frequent content type was news, with 282 entries. The least frequent was analytics dashboard with 3 entries.

You can find more information in the table and graphic below.

Content Type

Count

News

282

Analytics Dashboard

3

Glossary

4

Research

92

Tools

45

Events

36

Books

19

Count of content types in Distroid Database for Issues 28 - 38

Idea Machines

For Idea Machines (IM), Web3 was the most common IM. You can find more information in the table and graphic below.

Idea Machine

Count

Web3

22

Platform Cooperativism, Web3

13

Tools for Thought

11

Open Science

3

Web3,Open Science

3

Tools for Thought, Web3

2

Platform Cooperativism, Web3, Responsible Technology

1

Open Science, Platform Cooperativism

1

Climate Change

1

Tools for Thought, Open Science

1

Tools for Thought, Web3, Open Science

1

Effective Altruism

1

Platform Cooperativism

1

Count of idea machine categorizations in Distroid Database for Issues 28 - 38

For News Articles

I focused on news articles to start the analysis of the database for issues 28-38.

In total, there were 282 news items in the database. I reduced the news items to 274 after removing N/A or NaN values from the title column.

News Entity Counts

Entity

Count

Authors

117

Publications

70

Articles

274

At most, an author or publication appeared twice in the dataset.

Sentiment Analysis

I then used spaCy with NLTK to conduct sentiment analysis on titles and text. The sentiment was usually positive for an article’s text.

This might be reflective of the high-spirited, optimistic, and forward-looking nature of content written about the frontier.

The sentiment for an article’s title was usually neutral.

Pie chart of sentiment for text of news articles

Pie chart of sentiment for titles of news articles

Visualization of Authors, Articles, and Publications

I created a graph visualization of the relationship between authors, publications, and news articles.

I still need to work on my visualization skills (especially with graphs). So stay tuned for better visualizations in the next update.

Visualization of relations between authors, news articles, and publications

I also created an interactive graph visualization with pyvis.

The color scheme for the interactive pyvis visualization is:

  • red = Author,

  • purple = Publication, and

  • blue = Article.

You can find the visualization on GitHub as an HTML file. Load the HTML file on your browser and you can interact with the graph visualization.

Wordcloud Visualizations

I also made wordcloud visualizations for news titles, text, keywords, and summaries.

Wordcloud of news article text

Wordcloud of news article titles

Wordcloud of news article summaries.

Comments
0
comment
No comments here
Why not start the discussion?