Virus Host Classifier

Step 1: Predict overall viral sequence origin (human vs non-human) and identify extreme regions.
Step 2: Explore subregions to see local feature influence, distribution, GC content, etc.
Step 3: Analyze gene features and their contributions.
Step 4: Compare sequences and analyze differences.

Color Scale: Negative values = Blue, Zero = White, Positive values = Red.

5 30
100 5000

Interface Features

  • Overall Classification (human vs non-human) using k-mer frequencies
  • Feature Importance Analysis shows which k-mers push classification toward or away from human
  • White-Centered Gradient:
    • Negative (blue), 0 (white), Positive (red)
    • Symmetrical color range around 0
  • Identify Subregions with strongest push for human or non-human
  • Gene Feature Analysis:
    • Analyze individual genes' contributions
    • Interactive genome viewer
    • Gene-level statistics and classification
  • Sequence Comparison:
    • Compare two sequences to identify regions of difference
    • Normalized comparison to handle different lengths
    • Statistical summary of differences
  • Data Export:
    • Download results as CSV files
    • Download k-mer importance values
    • Save analysis outputs for further processing