Code Monkey home page Code Monkey logo

poetry's Introduction

Poetry

非常全的古诗词数据,收录了从先秦到现代的共计 85 万余首古诗词。

统计信息

朝代 诗词数 作者数
287114 9446
236957 4439
90089 8872
49195 2736
37375 1209
近现代 28419 790
当代 28219 177
明末清初 17700 176
元末明初 15736 79
清末民国初 15367 99
清末近现代初 12464 48
宋末元初 12058 41
南北朝 4586 434
近现代末当代初 3426 23
魏晋 3020 251
金末元初 3019 17
2741 253
民国末当代初 1948 9
1170 84
唐末宋初 1118 44
先秦 570 8
隋末唐初 472 40
363 83
宋末金初 234 9
22 7
2 2
魏晋末南北朝初 1 1
总和 853385 29377

数据说明

古诗词数据按朝代存储在多个 CSV 文件中,以避免单个文件过大。有 题目朝代作者内容 四个字段。

古诗词中有一些生僻字,属于 utf8mb4 字符,在许多设备中无法显示,使用 ? 替代。

导入数据库

为方便导入,将多个 CSV 文件合并成一个。这通过执行如下命令实现:

python scripts/merge.py

该命令将在当前目录下生成 poetry.csv 文件。

MySQL 8

创建数据库:

CREATE DATABASE poetry CHARACTER SET utf8mb4 COLLATE utf8mb4_unicode_ci;

创建数据表:

use poetry;
CREATE TABLE `poetry` (
  `id` int(11) NOT NULL AUTO_INCREMENT,
  `title` char(200) DEFAULT NULL,
  `dynasty` char(50) DEFAULT NULL,
  `author` char(100) DEFAULT NULL,
  `content` text,
  PRIMARY KEY (`id`)
);

查看 secure_file_priv 设置:

SHOW variables like '%secure_file_priv%';

结果类似于:

+------------------+------------------------------------------------+
| Variable_name    | Value                                          |
+------------------+------------------------------------------------+
| secure_file_priv | C:\ProgramData\MySQL\MySQL Server 8.0\Uploads\ |
+------------------+------------------------------------------------+
1 row in set, 1 warning (0.0014 sec)

该目录可能因环境不同而不同。若 secure_file_privValue 为空,请自行搜索如何设置。

poetry.csv 文件复制到 secure_file_priv 目录中,Windows 用户可参考如下命令:

copy poetry.csv "C:\ProgramData\MySQL\MySQL Server 8.0\Uploads"

从 CSV 文件中导入数据:

LOAD DATA INFILE 'C:\\ProgramData\\MySQL\\MySQL Server 8.0\\Uploads\\poetry.csv'
INTO TABLE `poetry`
FIELDS TERMINATED BY ','
ENCLOSED BY '"'
LINES TERMINATED BY '\r\n' (title, dynasty, author, content);

License

MIT 许可证。

poetry's People

Contributors

werneror avatar zhaokuohaha avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

poetry's Issues

部分csv数据错误

清_2.csv
360行左右,《重赋》
466行左右,《台南盐政归台道督办而积弊甚深皆由委员少廉洁者殊可慨也》
清末民国初.csv
8023行左右,《次林十暮春瑞轩养病即事韵》
8124行左右,《补祝叔臧先生四十双寿》
(github 萌新,想共同维护这个库

请问易代作者是怎么区分的

您好,请问下您的数据中,对于易代作者是怎么区分的呢?比如元末明初、明末清初的诗人,划分依据是什么呢~

小程序调用

请问一下您https://api.nextrt.com/V1/Gushi/{Type}/{Value} 后面的Type和Value怎么填啊,我微信小程序成功调用到您的API了,就是渲染不出来具体的内容,想调用特定的古诗的api,随机的不好渲染啊,Type和Value上随便填两个数字每次刷新页面调到的都是不同的古诗。

朝代更迭的时间段

辛苦了。请问该古诗数据集中,“宋末元初”、“明末清初”等一系列朝代更迭的时间段是如何设计的呢?有没有什么相应根据?感谢!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.