Github address for the project: Poetry

Ancient Poetry database

This ancient poetry database was extracted from Gushiwen.com in 2017. Although the total amount of data is not as large as that of Gushiwen.com, the poetry data has been cleaned, sorted and formatted to facilitate research or creativity for those who need it. The ancient poetry database currently contains 73,281 ancient poems and detailed data of 3,156 poets, and has been applied to two apps: The Android client of Cijing and the wechat mini program of Cijing.

The data shows that

1. Gushiwen folder

Below this folder is the original content that crawler crawls, among which there are ancient poems in view folder, poets in author folder and some famous poems and sentences in JU folder.

2. The image folder

Below this folder is the head picture of the person, image_XXx. JPG indicates that this is the URL of the head picture of the poet whose number is XXX.

3. The data folder

This folder is the latest collated data at present, among which poetry subdirectory contains ancient poems, POET subdirectory contains poets, and AIO (All in One) files are the result of collating poet and poetry data into one file.

4. Other folders

The contents of other folders are old versions of collated data reserved for backward compatibility and can be ignored.

The data structure

1. Poetry data

Id is the index of the poem on Gushiwen.com, name is the name of the poem, content is the content of the poem, dynasty is the dynasty of the poem, star is the number of likes of the poem when the data is extracted, poet is the poet’s information. Fanyi is the annotation and interpretation of the poetry data, Shangxi is the appreciation of the poetry, about is about the other content of the poetry, such as the creation background of the poet, etc. In the ancient poetry website, all the content that is not the interpretation of the poetry and the appreciation of the poetry will be summed up in the content of the poetry.

{
  "about": "Background \n\n &EMsp;   Tang Xuanzong Tianbao early years, Li Bai XXX"."content": "Don't you see, the water of the Yellow River comes from heaven and flows into the sea and never returns. xxx"."dynasty": "Tang dynasty"."fanyi": Can't you see the waters of the Yellow River rushing in from the sky, the waves rolling straight to the East China Sea and never coming back again? xxx"."id": 7722,
  "name": "Wine will come in."."poet": {
    "desc": Li Bai (701 -- 762), courtesy name Taibai, also known by his literary name Qinglian Lay Shi, was a romantic poet of the Tang Dynasty. xxx"."id": 247,
    "image": "https://raw.githubusercontent.com/hujiaweibujidao/poetry/master/image/image_247.jpg"."name": "Li bai"."star": 0}."shangxi": "Appreciation \ n \ n    Will enter wine, before the Tang Dynasty, a topic of yuefu songs, mostly singing about drinking and singing. xxx"."star": 32615,
  "tags": [
    "乐府"."Three Hundred Tang Poems"."Poems"."Lyric"."Philosophy"."Drinking"]}Copy the code

2. Poet data

Name is the poet’s name, desc is the poet’s brief introduction, content is the detailed introduction of the poet, dynasty is the poet’s dynasty, and star is the number of likes of the poet when the data is crawled.

{
  "content": "Anecdotes \n\n origin of name \ NXXX"."desc": Li Bai (701 -- 762), courtesy name Taibai, also known by his literary name Qinglian Lay Shi, was a romantic poet of the Tang Dynasty. xxx"."dynasty": "Tang dynasty"."id": 247,
  "image": "https://raw.githubusercontent.com/hujiaweibujidao/poetry/master/image/image_247.jpg"."name": "Li bai"."star": 4895}Copy the code

LICENSE

GNU General Public License version 3

Copyright (c) 2018 Javayhu. All rights reserved.