跳到主要内容

Hacker News

Hacker News(有时缩写为 HN)是一个专注于计算机科学和创业的社交新闻网站。它由投资基金和初创企业孵化器 Y Combinator 运营。一般来说,可以提交的内容被定义为“任何能够满足人们求知欲的东西”。

本笔记本介绍了如何从 Hacker News 中提取页面数据和评论

from langchain_community.document_loaders import HNLoader
API 参考:HNLoader
loader = HNLoader("https://news.ycombinator.com/item?id=34817881")
data = loader.load()
data[0].page_content[:300]
"delta_p_delta_x 73 days ago  \n             | next [–] \n\nAstrophysical and cosmological simulations are often insightful. They're also very cross-disciplinary; besides the obvious astrophysics, there's networking and sysadmin, parallel computing and algorithm theory (so that the simulation programs a"
data[0].metadata
{'source': 'https://news.ycombinator.com/item?id=34817881',
'title': 'What Lights the Universe’s Standard Candles?'}

此页面是否对您有帮助?