项目作者: zy26

项目描述 :
C++ demo for Mutual Rank based co-expression hub/module identification
高级语言: C++
项目地址: git://github.com/zy26/mrct.git
创建时间: 2017-07-30T01:12:19Z
项目社区:https://github.com/zy26/mrct

开源协议:

下载


About MRCT

MRCT is a program could find hub structure and coexpression module.

More information related the method, see this link.

For the R codes which could reproduce the result of “MRHCA: A nonparametric statistics based method for hub and co-expression module identification in large gene co-expression network”, see this link.

How to clone the codes

  1. git clone --recursive https://github.com/zy26/mrct.git
  2. cd mrct/

Or, you can just click Clone or download under the repository name, then put the code files of libmrct to lib dir.

How to compile and run the program?

Linux

You may need g++ version >= 5.40.

  1. g++ -std=c++14 -fopenmp -O3 lib/*.cc src/*.cc
  1. ./a.out -i testdata/E_coli_anaerobic.txt -o E_coli_anaerobic.id.txt -e E_coli_anaerobic.mr.txt -t 0.9

Windows

Use Visual Studio project file mrct.vcxproj inside. You may need modify the Platform Toolset to the version on your PC (You will need Visual Studio 2019 or higher to allow unsigned int for the OpenMP loop counter).

The Command Arguments should be -i testdata/E_coli_anaerobic.txt -o E_coli_anaerobic.id.txt -e E_coli_anaerobic.mr.txt -t 0.9, or you can use your file name instead.

Warning: Try to avoid Debug mode, it may be very slow.

What does the output mean?

You will see the content of the first output file (E_coli_anaerobic.id.txt) as below somehow.

  1. 1 0 1639 979 b3385
  2. 1 0 1589 1232 b0421
  3. 1 0 1583 2167 b3649
  4. 1 0 1559 535 b0184
  5. 1 0 1549 3789 b4557
  6. 1 0 1547 2562 b3833
  7. 1 0 1532 2777 b0434
  8. 1 0 1528 960 b3560
  9. 1 0 1520 2837 b0637
  10. 1 0 1512 2115 b3704
  11. 1 0 1506 3861 b3999
  12. 1 0 1501 13 b3255
  13. 1 0 1496 1880 b4201
  14. 1 0 1481 698 b2610
  15. 1 0 1480 2425 b3838
  16. 1 0 1462 2112 b0183
  17. 1 0 1458 1049 b3997
  18. 1 0 1447 496 b3639
  19. 1 0 1433 2103 b0953
  20. 1 0 1426 1099 b4372
  21. 1 0 1417 1335 b3200
  22. 1 0 1414 2032 b3620
  23. 1 0 1409 2521 b2960
  24. 1 0 1406 566 b3640
  25. 1 0 1406 2113 b0610
  26. 1 0 1406 2133 b3985
  27. 1 0 1395 2489 b3035
  28. 1 0 1392 2520 b3651
  29. 1 0 1386 3769 b3611
  30. 1 0 1382 1340 b0179
  31. 1 0 1374 3770 b3614
  32. 1 0 1365 2066 b3783
  33. 1 0 1367 3727 b3499
  34. 1 0 1359 992 b3610
  35. 1 0 1344 1480 b1782
  36. 1 0 1342 2836 b0636
  37. 1 0 1340 2176 b3341
  38. 1 0 1330 1894 b1831
  39. 1 0 1325 1234 b2747
  40. 1 0 1315 2183 b3307
  41. 1 0 1315 4185 b2792
  42. 1 0 1314 2519 b2607
  43. 1 0 1314 823 b2748
  44. 1 0 1310 2126 b3984
  45. 1 0 1305 2134 b3983
  46. 1 0 1301 2181 b3342
  47. 1 0 1293 1694 b3982
  48. 1 0 1289 1305 b2828
  49. 1 0 1284 1551 b0923
  50. 1 0 1282 801 b0172
  51. 1 0 1273 1236 b2746
  52. 1 0 1265 197 b3734
  53. 1 0 1254 2336 b3641
  54. 1 0 1253 203 b3733
  55. 1 0 1245 4267 b4212
  56. 1 0 1232 204 b3735
  57. 1 0 1230 2784 b0471
  58. 1 0 1228 1336 b3201
  59. 1 0 1234 3746 b3552
  60. 1 0 1208 2539 b0170
  61. 1 0 1198 3592 b2953
  62. 1 0 1200 3104 b1447
  63. 1 0 1188 3498 b2643
  64. 1 0 1179 4245 b3199
  65. 1 0 1175 2204 b0176
  66. 1 0 1170 2146 b3318
  67. 1 0 1174 2292 b3981
  68. 1 0 1162 1828 b3821
  69. 1 0 1164 2030 b3619
  70. 1 0 1160 1762 b2320
  71. 1 0 1171 3606 b2986
  72. 1 0 1136 2535 b2745
  73. 1 0 1124 846 b3340
  74. 1 0 1139 1048 b3804
  75. 1 0 1134 2427 b4483
  76. 1 0 1119 2849 b0699
  77. 1 0 1116 2122 b3643
  78. 1 0 1121 2972 b1034
  79. 1 0 1101 4243 b3195
  80. 1 0 1087 3788 b3705
  81. 1 0 1078 1854 b4226
  82. 1 0 1077 3285 b1978
  83. 1 0 1086 2168 b2830
  84. 1 0 1070 3603 b2983
  85. 1 0 1068 2932 b0898
  86. 1 0 1052 969 b2029
  87. 1 0 1077 3231 b1803
  88. 1 0 1051 3736 b3527
  89. 1 0 1025 4242 b3194
  90. 1 0 1016 363 b0624
  91. 1 0 1030 1900 b0331
  92. 1 0 979 3328 b2122
  93. 1 0 977 4175 b2647
  94. 1 0 969 3417 b2358
  95. 1 139 0 2013 b3700
  96. 1 0 972 3416 b2357
  97. 1 0 977 2742 b0285
  98. 1 0 975 1477 b1175
  99. 1 0 937 1158 b2490
  100. 1 0 942 277 b1987
  101. 1 0 924 4176 b2650
  102. 1 0 951 1534 b2113
  103. 1 0 932 2528 b1265
  104. 1 0 896 2875 b0718
  105. 1 0 933 2034 b3842
  106. 1 0 876 2060 b3901
  107. 1 0 881 2811 b0521
  108. 1 0 853 3421 b2362
  109. 1 372 0 2233 b1012
  110. 1 0 901 1252 b3526
  111. 1 0 848 2640 b2047
  112. 1 0 840 809 b3897
  113. 1 0 836 3536 b2770
  114. 1 0 844 4094 b1585
  115. 1 0 885 1825 b2648
  116. 1 0 865 1253 b1827
  117. 1 364 0 3073 b1360
  118. 1 131 0 2612 b1914
  119. 1 0 834 3032 b1251
  120. 1 0 813 340 b2933
  121. 1 293 0 2302 b2913
  122. 1 0 816 2639 b2050
  123. 1 0 849 3835 b3878
  124. 1 0 819 1757 b3403
  125. 1 0 840 3196 b1724
  126. 1 183 0 3244 b1832
  127. 1 324 0 2747 b0291
  128. 1 135 0 3007 b1179
  129. 1 289 0 1570 b0750
  130. 1 0 801 2865 b0719
  131. 1 308 0 1295 b0075
  132. 1 0 805 3190 b1698
  133. 1 0 791 2727 b0267
  134. 1 0 799 1801 b4105
  135. 1 373 0 3535 b2769
  136. 1 291 0 1443 b3008
  137. 1 0 792 661 b1701
  138. 1 357 0 2746 b0290
  139. 1 315 0 3418 b2359
  140. 1 0 786 3078 b1407
  141. 1 0 779 4171 b2547
  142. 1 0 781 3629 b3080
  143. 1 221 0 2455 b2103
  144. 1 288 0 127 b3172
  145. 1 0 779 3415 b2356
  146. 1 0 781 2307 b1975
  147. 1 0 781 3913 b4183
  148. 1 0 773 2952 b0941
  149. 1 0 784 2467 b0002
  150. 1 374 0 1248 b3137
  151. 1 371 0 3912 b4182
  152. 1 253 0 121 b2818
  153. 1 247 0 126 b0273
  154. 1 318 0 3385 b2294
  155. 1 0 768 2859 b0682
  156. 1 368 0 122 b3959
  157. 1 0 768 2647 b2061
  158. 1 231 0 1194 b0077
  159. 1 201 0 123 b3958
  160. 1 367 0 1730 b1392
  161. 1 211 0 2463 b2104
  162. 1 214 0 1290 b0074
  163. 1 0 761 153 b0754
  164. 1 304 0 1727 b1389
  165. 1 0 738 795 b4474
  166. 1 346 0 3456 b2506
  167. 1 228 0 167 b0860
  168. 1 218 0 1193 b0078
  169. 1 325 0 7 b1338
  170. 1 177 0 1957 b1302
  171. 1 0 738 359 b2048
  172. 1 254 0 2085 b1066
  173. 1 209 0 158 b0112
  174. 1 0 736 4017 b4572
  175. 1 233 0 1293 b0071
  176. 1 207 0 2459 b3990
  177. 1 223 0 2526 b1263
  178. 1 236 0 1639 b3661
  179. 1 219 0 1444 b3829
  180. 1 237 0 1292 b0072
  181. 1 233 0 3626 b3074
  182. 1 210 0 889 b1761
  183. 1 239 0 124 b3359
  184. 1 309 0 2642 b2044
  185. 1 212 0 376 b1040
  186. 1 199 0 1441 b4013
  187. 1 202 0 1291 b0073
  188. 1 206 0 378 b1038
  189. 1 224 0 129 b4254
  190. 1 220 0 3582 b2930
  191. 1 218 0 2523 b1260
  192. 1 141 0 2939 b0906
  193. 1 136 0 665 b2097
  194. 1 131 0 2420 b1519

That means: We find 194 hubs.
The second column is the cluster bound of the cluster.
The third column is a upper bound of the cluster.
The forth column is the index of the original rows in the matrix of the input file.
The last column is the row names of the forth column.

We can use the R package fix the thrid column later.

We have another output file, the first column in it is the same as the forth column of the file below.
Other columns are the EM values for that row.