wikiidmon

Indonesian Wikipedia Monthly User Activity Dashboard

How to Use

Click the username to get the details

brave_Um0AMuzApa.mp4

How It Works

Open the WmCloud Superset Instance to access idwiki_p schema at Wikimedia S2 database. Then, execute this query :

select actor_name, group_concat(distinct rc_title separator ';') as nya
from recentchanges join actor where rc_actor = actor_id and rc_timestamp >= 20240101000000 and rc_timestamp <= 20240201000000
group by actor_name

P.S. : You can configure the time window according to your own needs

After the result is generated, click "copy to clipboard", paste it to a file (all_act.txt), then convert this file into json by using this python script :

import sys
import json
sys.stdout = open(sys.stdout.fileno(), mode='w', encoding='utf8')
d = dict()

with open('all_act.txt', 'r', encoding='utf-8') as file:
    line = file.readline()
    while line:
        nya = line.split("\t")
        d[nya[0]] = dict()
        d[nya[0]]["a"] = nya[1]
        d[nya[0]]["l"] = len(nya[1].split(";"))
        line = file.readline()
sorted_dict = dict(sorted(d.items(), key=lambda x: x[1]["l"], reverse=True)) 
json_data = json.dumps(sorted_dict)
print(json_data)

Finally, put this json into a.js

Optional : User Collab Stat

Use this script to generate user collab stat.

import sys
import json

sys.stdout = open(sys.stdout.fileno(), mode='w', encoding='utf8')
d = dict()
dd = dict()

with open('all_act.txt', 'r', encoding='utf-8') as file:
    line = file.readline()
    while line:
        nya = line.split("\t")
        d[nya[0]] = dict()
        d[nya[0]]["a"] = nya[1]
        d[nya[0]]["l"] = len(nya[1].split(";"))
        line = file.readline()
        for i in nya[1].split(";"):
            if i not in dd:
                dd[i] = dict()
                dd[i]["a"] = set()
                dd[i]["a"].add(nya[0])
                dd[i]["l"] = len(dd[i]["a"])
            else:
                dd[i]["a"].add(nya[0])
                dd[i]["l"] = len(dd[i]["a"]) 

sorted_dict = dict(sorted(dd.items(), key=lambda x: x[1]["l"], reverse=True))   

for nya in sorted_dict:
    sorted_dict[nya]["a"] = list(sorted_dict[nya]["a"])

json_data = json.dumps(sorted_dict)
print(json_data)

Put this json to b.js, access it by using pco.html

Optional : Monthly Pageview Statistics

Get the top 1000 most visited articles from id.wikipedia for all days in specified month as JSON data:

https://wikimedia.org/api/rest_v1/metrics/pageviews/top/id.wikipedia/all-access/2024/01/all-days

Copy the JSON data, execute this python script.

import sys
sys.stdout = open(sys.stdout.fileno(), mode='w', encoding='utf8')
nya = PASTE_THE_JSON_DATA_HERE
rank = 1
print("== Pageview ==")
print("{| class='wikitable'")
print("!rank!!rc_title!!pageview")
print("|-")
for i in nya:
	for j in nya[i][0]["articles"]:
		strs = "|"+str(rank)+"||"+"[["+str(j["article"])+"]]||"+str(j["views"])
		print(strs)
		print("|-")
		rank = rank + 1
print("|}")

Compiled statistics

January 2024
- User
- User collab
- Pages
February 2024
- User
- User collab
- Pages
March 2024
- Pageread
- Pagewrite
- User
April 2024
- Pageread
- Pagewrite
- User
May 2024
- Pageread
- Pagewrite
- User
June 2024
- Pageread
- Pagewrite
- User
July 2024
- Pageread
- Pagewrite
- User
August 2024 : Write, Read, User

altilunium / wikiidmon Goto Github PK

wikiidmon's Introduction

wikiidmon

How to Use

How It Works

Optional : User Collab Stat

Optional : Monthly Pageview Statistics

Compiled statistics

wikiidmon's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent