太宰治の作品 – ソフトウェアエンジニアの技術ブログ：Software engineer tech blog

青空文庫のページが単純なli要素の為、

校長三代　（旧字旧仮名、作品ID：43313）　

故郷　（新字新仮名、作品ID：1585）　

国技館　（旧字旧仮名、作品ID：52461）　

心の王者　（新字新仮名、作品ID：18346）　

乞食学生　（新字新仮名、作品ID：285）　

五所川原　（新字新仮名、作品ID：45688）　

pythonもシンプルです。

from bs4 import BeautifulSoup
import urllib.request as req

url = "https://www.aozora.gr.jp/index_pages/person35.html"
res = req.urlopen(url)
soup = BeautifulSoup(res, "html.parser")

li_list = soup.select("ol > li")
for li in li_list:
	a = li.a
	if a != None:
		name = a.string
		href = a.attrs["href"]
		print(name, ">", href)