{"id":888,"date":"2023-08-15T11:16:00","date_gmt":"2023-08-15T18:16:00","guid":{"rendered":"https:\/\/labs.wsu.edu\/scads\/?page_id=888"},"modified":"2025-08-19T13:32:12","modified_gmt":"2025-08-19T20:32:12","slug":"bioinformatics","status":"publish","type":"page","link":"https:\/\/labs.wsu.edu\/scads\/research\/bioinformatics\/","title":{"rendered":"Bioinformatics"},"content":{"rendered":"\n<h1 class=\"wp-block-heading\">Bioinformatics<\/h1>\n\n\n\n<hr class=\"wp-block-separator has-css-opacity\" \/>\n\n\n\n<p><strong>Software Tool for Short Sequence Repeat Analysis<\/strong><\/p>\n\n\n\n<p>Patterns of short-sequence repeats (SSRs) at specific loci are widely used to distinguish microbial strains. In collaboration with&nbsp;<a href=\"https:\/\/vetmed.wsu.edu\/departments\/paul-g-allen-school-for-global-health\/\">Prof. Kelly Brayton<\/a>&nbsp;(Allen School for Global Animal Health, WSU), we developed&nbsp;<em>RepeatAnalyzer<\/em>\u2014a robust software tool for managing and analyzing SSR data on bacterial strains infecting cattle. The initial release included an&nbsp;<em>Anaplasma marginale<\/em>&nbsp;dataset and supported storing, searching, and identifying (i) SSR motifs, (ii) strains containing those motifs, and (iii) the publications reporting them. The prototype was deployed to track&nbsp;<em>A. marginale<\/em>&nbsp;strains in South Africa, Asia, and multiple U.S. groups. Although our first focus was&nbsp;<em>A. marginale<\/em>, the software was designed to remain flexible for any species whose strains are defined by SSRs.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n\n\n\n<p><strong>Sequence Similarity Network Model and Applications<\/strong><\/p>\n\n\n\n<p>Sequence similarity networks are useful for classifying and characterizing biologically important\u202fproteins and other structures. Threshold-based approaches to similarity network construction using exact distance measures are\u202fprohibitively slow to compute and rely on the difficult task of selecting an appropriate threshold, while similarity\u202fnetworks based on approximate distance calculations compromise useful structural information.&nbsp;<\/p>\n\n\n\n<p>We introduced the Directed Weighted Average Nearest-Neighbor (DiWANN) framework that overcomes both of these drawbacks. DiWANN creates sparse but information-rich similarity graphs by representing each sequence by a node and connecting the node via a directed edge to only the closest sequence, or sequences in the case of ties. The weights on the edges represent the edit distance between the sequences. Our DiWANN model is accompanied by an efficient pruning and bounding-based algorithm for constructing the network. This loss-aware pruning lets researchers map relationships across tens of thousands of genomes without the noise or computational burden of dense networks, revealing clusters and outliers that dense graphs often obscure.<\/p>\n\n\n\n<p>Building on DiWANN\u2019s concise representation, we examined various applications, including analysis of cancer\u2019s driver-mutation landscape through two complementary graph views. A reduced-data DiWANN sequence-similarity network links tumors with highly similar driver-gene profiles, spotlighting cancer types whose samples cluster tightly\u2014and may therefore respond to common therapies. In parallel, a bipartite tumor-to-gene network (and its one-mode gene projection) shows which driver genes frequently co-occur and which remain exclusive to particular cancers. Together, these network perspectives uncover both pan-cancer and cancer-specific driver mutations, offering insight into tumour heterogeneity and potential therapeutic targets.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\" \/>\n\n\n\n<h2 class=\"wp-block-heading\">Papers<\/h2>\n\n\n\n<ul>\n<li>H. Catanese, K. Brayton and A.H. Gebremedhin.&nbsp;<strong>A Nearest-Neighbors Network Model for Sequence Data Reveals New Insight into Genotype Distribution of a Pathogen,<\/strong>&nbsp;BMC Bioinformatics (2018) 19:475. https:\/\/doi.org\/10.1186\/s12859-018-2453-2.<br><a href=\"https:\/\/labs.wsu.edu\/scads\/research\/#nearest_neighbor_pathogen_2018\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"https:\/\/rdcu.be\/bdivz\">Paper<\/a><\/li>\n<\/ul>\n\n\n\n<ul>\n<li>S. Patil, S. Roberts, A. Gebremedhin.<br><strong>Network Analysis of Driver Genes in Human Cancers<\/strong>, <em>Frontiers in Bioinformatics<\/em> 4 (2024).<br><a href=\"https:\/\/labs.wsu.edu\/scads\/research\/#network_analysis_driver_genes_2024\">Abstract<\/a>&nbsp;&nbsp;<a href=\"https:\/\/www.frontiersin.org\/journals\/bioinformatics\/articles\/10.3389\/fbinf.2024.1365200\/full\">Paper<\/a><\/li>\n<\/ul>\n\n\n\n<ul>\n<li>S. Patil, H. Catanese, K. Brayton, E. Lofgren, A. H. Gebremedhin.<br><strong>Sequence Similarity Network Analysis Provides Insight into the Temporal and Geographical Distribution of Mutations in SARS-CoV-2 Spike Protein<\/strong>, <em>Viruses<\/em> 14, 1672 (2022).<br><a href=\"https:\/\/labs.wsu.edu\/scads\/research\/#sars_cov2_spike_2022\">Abstract<\/a>&nbsp;&nbsp;<a href=\"#\">Paper<\/a><\/li>\n<\/ul>\n\n\n\n<ul>\n<li>H.N. Catanese, K.A. Brayton and A.H. Gebremedhin,&nbsp;<strong>RepeatAnalyzer: a tool for analysing and managing short-sequence repeat data<\/strong>, BMC Genomics 2016 17:422. DOI: 10.1186\/s12864-016-2686-2<br><a href=\"https:\/\/labs.wsu.edu\/scads\/research\/#repeat_analyzer_2016\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/bmcgenomics.biomedcentral.com\/articles\/10.1186\/s12864-016-2686-2\">Paper<\/a><\/li>\n\n\n\n<li>Z.T.H. Khumalo, H.N. Catanese, N. Leisching, P. Hove, N.E. Collins, M.E. Chaisit,&nbsp;&nbsp;A.H. Gebremedhin, M.C. Oosthuizen and K.A. Brayton,&nbsp;<strong>Characterization of&nbsp;<em>Anaplasma marginale<\/em>&nbsp;subspecies&nbsp;<em>centrale<\/em>&nbsp;using msp1aS genotyping reveals wildfire reservoir<\/strong>,&nbsp; Journal of Clinical Microbiology, 2016 54:10, 2503-2512<br><a href=\"https:\/\/labs.wsu.edu\/scads\/research\/#msp1as_2016\">Abstract<\/a>&nbsp;&nbsp;&nbsp;&nbsp;<a href=\"http:\/\/bmcgenomics.biomedcentral.com\/articles\/10.1186\/s12864-016-2686-2\">Paper<\/a><\/li>\n\n\n\n<li>P. Hove, M.E. Chaisi, K.A. Brayton, H. Ganesan, H.N. Catanese, M.S. Mtshali, A.M. Mutshembele, M.C. Oosthuizen, and N.E. Collins,&nbsp;<strong>Co-infections with multiple genotypes of&nbsp;<em>Anaplasma marginale<\/em>&nbsp;in cattle indicate pathogen diversity<\/strong>, Parasites &amp; Vectors (2018) 11:5<br><a href=\"https:\/\/labs.wsu.edu\/scads\/research\/#coinfection_2017\">Abstract<\/a>&nbsp;&nbsp;<a href=\"https:\/\/parasitesandvectors.biomedcentral.com\/articles\/10.1186\/s13071-017-2595-5\">Paper<\/a><\/li>\n\n\n\n<li>E. Khaledian, A. H. Gebremedhin, K. Brayton, S. Broschat.<br><strong>A Network Science Approach for Determining the Ancestral Phylum of Bacteria<\/strong>, <em>ACM-BCB 2018<\/em>.<br><a href=\"https:\/\/labs.wsu.edu\/scads\/research\/#network_science_bacteria_2018\">Abstract<\/a>&nbsp;|&nbsp;<a href=\"#\">Paper<\/a><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Bioinformatics Software Tool for Short Sequence Repeat Analysis Patterns of short-sequence repeats (SSRs) at specific loci are widely used to distinguish microbial strains. In collaboration with&nbsp;Prof. Kelly Brayton&nbsp;(Allen School for Global Animal Health, WSU), we developed&nbsp;RepeatAnalyzer\u2014a robust software tool for managing and analyzing SSR data on bacterial strains infecting cattle. The initial release included an&nbsp;Anaplasma [&hellip;]<\/p>\n","protected":false},"author":5234,"featured_media":0,"parent":861,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":[],"categories":[],"tags":[],"university_category":[],"location":[],"organization":[],"_links":{"self":[{"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/pages\/888"}],"collection":[{"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/users\/5234"}],"replies":[{"embeddable":true,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/comments?post=888"}],"version-history":[{"count":22,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/pages\/888\/revisions"}],"predecessor-version":[{"id":1852,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/pages\/888\/revisions\/1852"}],"up":[{"embeddable":true,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/pages\/861"}],"wp:attachment":[{"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/media?parent=888"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/categories?post=888"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/tags?post=888"},{"taxonomy":"wsuwp_university_category","embeddable":true,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/university_category?post=888"},{"taxonomy":"wsuwp_university_location","embeddable":true,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/location?post=888"},{"taxonomy":"wsuwp_university_org","embeddable":true,"href":"https:\/\/labs.wsu.edu\/scads\/wp-json\/wp\/v2\/organization?post=888"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}