树:

ref:https://github.com/microgenomics/tutorials/blob/master/pangenome.md

使用FastTree生成系统发育树

FastTree是一个快速且高效的工具,用于从对齐序列中推断最大似然树。你可以使用FastTree来生成系统发育树。

首先,确保你已经安装了FastTree。你可以通过以下命令安装(以Ubuntu为例):

sudo apt-get install fasttree

然后,使用FastTree生成系统发育树:

nohup FastTree -nt -gtr core_gene_alignment.aln > core_gene_tree.nwk 2> fasttree_error.log &

生成的系统发育树会保存在 core_gene_tree.nwk 文件中,该文件是一个Newick格式的树文件。

roary_plots.py

#download roary_plots.py
# <https://github.com/sanger-pathogens/Roary/tree/master/contrib/roary_plots>
scp -P 20328 -r /Users/mszeng/Desktop/work/other/泛基因组/roary_plots.py [email protected]:~/pangenome/data
chmod +x roary_plots.py
#安装依赖--报错
conda install matplotlib pandas seaborn
##seaborn包安装失败:错误信息ModuleNotFoundError: No module named '_sysconfigdata_x86_64_conda_cos7_linux_gnu'
conda install -c conda-forge --force-reinstall seaborn #强制重装seaborn
#查找相关模块
sudo find ~ -name _sysconfigdata_x86_64*
#发现没有_sysconfigdata_x86_64_conda_linux_gnu.py文件,复制一个到pg1环境
#复制到报错信息对应的python3.7的文件夹下
cp /home/zms/anaconda3/pkgs/python-3.7.16-h7a1cb2a_0/lib/python3.7/_sysconfigdata_x86_64_conda_cos7_linux_gnu.py /home/zms/anaconda3/envs/pg1/lib/python3.7/

#run
python roary_plots.py core_gene_tree.nwk gene_presence_absence.csv

解决ModuleNotFoundError: No module named '_sysconfigdata_x86_64_conda_cos7_linux_gnu'报错:

https://blog.csdn.net/baidu_37157624/article/details/121123195

pangenome_matrix.png

pangenome_frequency.png

pangenome_pie.png