Skip to content
Projects
Groups
Snippets
Help
This project
Loading...
Sign in / Register
Toggle navigation
人
人工智能系统实战第三期
Overview
Overview
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
liyinkai
人工智能系统实战第三期
Commits
2fb82a54
Commit
2fb82a54
authored
Oct 07, 2023
by
前钰
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Upload New File
parent
3030d8ea
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
77 additions
and
0 deletions
+77
-0
question.ipynb
人工智能系统实战第三期/实战代码/AAAI会议论文聚类分析/question.ipynb
+77
-0
No files found.
人工智能系统实战第三期/实战代码/AAAI会议论文聚类分析/question.ipynb
0 → 100644
View file @
2fb82a54
{
{
"cells": [
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## 任务介绍\n",
"\n",
"每年国际上召开的大大小小学术会议不计其数,发表了非常多的论文。在计算机领域的一些大型学术会议上,一次就可以发表涉及各个方向的几百篇论文。按论文的主题、内容进行聚类,有助于人们高效地查找和获得所需要的论文。本案例数据来源于AAAI 2014上发表的约400篇文章,由[UCI](https://archive.ics.uci.edu/ml/datasets/AAAI+2014+Accepted+Papers!)公开提供,提供包括标题、作者、关键词、摘要在内的信息,希望大家能根据这些信息,合理地构造特征向量来表示这些论文,并设计实现或调用聚类算法对论文进行聚类。最后也可以对聚类结果进行观察,看每一类都是什么样的论文,是否有一些主题。\n",
" \n",
"\n",
"注:数据中的group和topic也不能完全算是标签,因为\n",
"1. 有些文章作者投稿时可能会选择某个group/topic但实际和另外group/topic也相关甚至更相关;\n",
"2. 一篇文章可能有多个group和topic,作为标签会出现有的文章同属多个类别,这里暂不考虑这样的聚类; "
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## 问题一\n",
"\n",
"将文本转化为向量,实现或调用无监督聚类算法,对论文聚类,例如10类(可使用已有工具包例如sklearn);"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## 问题二\n",
"\n",
"并将聚类结果可视化成散点图"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## 问题三\n",
"\n",
"高维向量的降维旨在去除一些高相关性的特征维度,尝试使用PCA进行降维后,再聚类"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## 问题四\n",
"\n",
"尝试使用一些评价指标,对聚类后的结果进行评价"
]
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.11.4"
},
"orig_nbformat": 4
},
"nbformat": 4,
"nbformat_minor": 2
}
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment