Pipeline to create and analyze protein-protein interaction networks.
These commands will create a network based on a set of genes you should provide as “seeds.txt”.
With git:
git clone git@github.com:lelimat/PPI_network_pipeline.git
cd PPI_network_pipeline
Or directly downloading the zip file:
wget https://github.com/lelimat/PPI_network_pipeline/archive/master.zip
unzip master.zip
cd PPI_network_pipeline-master
Usage: R —slave —file=get_PPI_data_iRefIndex.R —args [min_evidence] [database_dir].
[database_dir] is the directory where the iRefIndex file will be saved.
R --slave --file=get_PPI_data_iRefIndex.R --args 2 databases
wget ftp://ftp.ncbi.nih.gov/gene/DATA/GENE_INFO/Mammalia/Homo_sapiens.gene_info.gz
gunzip Homo_sapiens.gene_info.gz
python create_net_irefindex.py --database=iRefIndex_human.txt --seeds=seeds.txt --output=. --distance=1
The distance is the number of levels of neighbors added to the network starting from the seeds. If you only need to see how the seeds connect. Choose distance=0. For seeds + first neighbors, choose distance=1, and so on.
Output:
echo -e "node_1\tnode_2" > edges_attributes.txt
cat edges.txt >> edges_attributes.txt
python network_topology.py edges_attributes.txt Homo_sapiens.gene_info
rm edges.txt
R --slave --file=get_network_info.R --args nodes_attributes.txt
grep Yes nodes_attributes.txt | cut -f1 > in_list_Yes.txt
cut -f1 nodes_attributes.txt | grep -v node > all_genes.txt
python net_cytoscape.py
The command above will generate a file “network.xgmml” to be open with Cytoscape.