Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

综合使用KNIME、Python绘制分子相似性矩阵热图 #2

Open
xiyanglee opened this issue Aug 27, 2019 · 0 comments
Open

综合使用KNIME、Python绘制分子相似性矩阵热图 #2

xiyanglee opened this issue Aug 27, 2019 · 0 comments

Comments

@xiyanglee
Copy link
Owner

xiyanglee commented Aug 27, 2019

综合使用KNIME、Python绘制分子相似性矩阵热图

1. 目的

  计算同类分子间或不同类分子的相似性(矩阵)并提取相似性较高的结果绘制热图。

2. 数据及分析平台准备

2.1 分析数据

  本文以DrugBank数据库的Nutraceutical分子及Illicit分子数据(*.sdf文件)为例,计算两类分子的相似性矩阵及Illicit分子间的相似性矩阵,并绘制热图。这些数据可在DrugBank数据库注册审核后下载获得。

2.2 分析平台

  KNIME:用于计算分子相似性矩阵。KNIME是一个开源数据挖掘平台,以Node依次连接成Workflow为数据处理流程,类似于图形化编程语言;本文所用的Workflow为 KNIME workflow to calculate Tanimoto similarities based upon molecular fingerprints

  Seaborn(Python包):用于绘制热图,Anaconda下的安装方法为conda install seaborn;本文编程在Jupyter Notebook平台实现;

  其他数据处理及绘图所需Python包:pandasnumpymatplotlib

3. 操作方法

3.1 计算分子相似性矩阵

  在KNIME中导入所需Workflow,

  • Note:注意此流程中的一些节点(Nodes)需要下载RDKit及CDK的包才能正常使用,打开Workflow时会有下载提示;

图1: KNIME工作流程

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant