Amino acid dipepetide frequency for Wuhan Insect virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.525AlaAla: 3.525 ± 1.0
1.007AlaCys: 1.007 ± 0.587
5.035AlaAsp: 5.035 ± 4.061
3.021AlaGlu: 3.021 ± 0.984
1.762AlaPhe: 1.762 ± 1.134
2.518AlaGly: 2.518 ± 1.062
1.007AlaHis: 1.007 ± 0.32
3.273AlaIle: 3.273 ± 0.894
3.273AlaLys: 3.273 ± 1.548
4.532AlaLeu: 4.532 ± 0.838
1.259AlaMet: 1.259 ± 0.496
1.762AlaAsn: 1.762 ± 0.569
1.007AlaPro: 1.007 ± 0.507
1.511AlaGln: 1.511 ± 0.624
3.273AlaArg: 3.273 ± 1.35
2.769AlaSer: 2.769 ± 0.791
2.014AlaThr: 2.014 ± 0.932
2.518AlaVal: 2.518 ± 1.421
0.0AlaTrp: 0.0 ± 0.0
2.518AlaTyr: 2.518 ± 1.414
0.0AlaXaa: 0.0 ± 0.0
Cys
1.259CysAla: 1.259 ± 0.984
0.252CysCys: 0.252 ± 0.313
1.007CysAsp: 1.007 ± 0.852
0.504CysGlu: 0.504 ± 0.213
0.755CysPhe: 0.755 ± 0.286
1.259CysGly: 1.259 ± 0.558
0.0CysHis: 0.0 ± 0.0
2.014CysIle: 2.014 ± 1.417
1.762CysLys: 1.762 ± 0.476
1.762CysLeu: 1.762 ± 0.832
0.755CysMet: 0.755 ± 0.44
1.007CysAsn: 1.007 ± 0.427
1.511CysPro: 1.511 ± 0.627
0.755CysGln: 0.755 ± 0.286
0.252CysArg: 0.252 ± 0.213
2.014CysSer: 2.014 ± 0.532
1.007CysThr: 1.007 ± 0.308
1.007CysVal: 1.007 ± 0.487
0.252CysTrp: 0.252 ± 0.147
0.755CysTyr: 0.755 ± 0.44
0.0CysXaa: 0.0 ± 0.0
Asp
3.525AspAla: 3.525 ± 0.594
1.511AspCys: 1.511 ± 0.494
5.539AspAsp: 5.539 ± 1.192
3.273AspGlu: 3.273 ± 1.751
1.762AspPhe: 1.762 ± 0.492
3.273AspGly: 3.273 ± 0.443
1.511AspHis: 1.511 ± 0.542
7.301AspIle: 7.301 ± 0.867
4.28AspLys: 4.28 ± 1.397
6.294AspLeu: 6.294 ± 1.195
1.259AspMet: 1.259 ± 0.36
3.525AspAsn: 3.525 ± 0.905
2.769AspPro: 2.769 ± 0.835
2.266AspGln: 2.266 ± 0.954
2.769AspArg: 2.769 ± 0.591
5.539AspSer: 5.539 ± 1.422
2.014AspThr: 2.014 ± 0.577
2.518AspVal: 2.518 ± 0.54
0.755AspTrp: 0.755 ± 0.298
1.762AspTyr: 1.762 ± 0.58
0.0AspXaa: 0.0 ± 0.0
Glu
3.021GluAla: 3.021 ± 1.229
0.755GluCys: 0.755 ± 0.4
2.769GluAsp: 2.769 ± 0.475
3.273GluGlu: 3.273 ± 0.892
2.518GluPhe: 2.518 ± 0.675
3.021GluGly: 3.021 ± 1.501
1.259GluHis: 1.259 ± 0.424
5.791GluIle: 5.791 ± 1.165
3.525GluLys: 3.525 ± 1.313
7.805GluLeu: 7.805 ± 1.53
1.762GluMet: 1.762 ± 1.024
2.518GluAsn: 2.518 ± 0.373
2.014GluPro: 2.014 ± 0.558
1.511GluGln: 1.511 ± 0.452
2.014GluArg: 2.014 ± 0.975
4.532GluSer: 4.532 ± 1.492
4.532GluThr: 4.532 ± 1.325
3.776GluVal: 3.776 ± 0.691
0.755GluTrp: 0.755 ± 0.505
2.014GluTyr: 2.014 ± 0.509
0.0GluXaa: 0.0 ± 0.0
Phe
1.762PheAla: 1.762 ± 0.581
0.504PheCys: 0.504 ± 0.213
1.762PheAsp: 1.762 ± 0.676
3.021PheGlu: 3.021 ± 0.526
0.504PhePhe: 0.504 ± 0.293
2.014PheGly: 2.014 ± 0.734
1.511PheHis: 1.511 ± 0.532
1.762PheIle: 1.762 ± 0.697
2.266PheLys: 2.266 ± 0.755
2.266PheLeu: 2.266 ± 0.846
1.259PheMet: 1.259 ± 0.527
2.518PheAsn: 2.518 ± 0.675
2.014PhePro: 2.014 ± 0.806
1.259PheGln: 1.259 ± 0.551
2.014PheArg: 2.014 ± 0.771
2.769PheSer: 2.769 ± 0.582
1.511PheThr: 1.511 ± 0.533
1.511PheVal: 1.511 ± 0.596
0.755PheTrp: 0.755 ± 0.298
0.755PheTyr: 0.755 ± 0.298
0.0PheXaa: 0.0 ± 0.0
Gly
2.014GlyAla: 2.014 ± 0.44
0.755GlyCys: 0.755 ± 0.363
2.266GlyAsp: 2.266 ± 0.656
2.769GlyGlu: 2.769 ± 0.867
2.266GlyPhe: 2.266 ± 0.749
2.014GlyGly: 2.014 ± 0.566
1.762GlyHis: 1.762 ± 0.303
3.021GlyIle: 3.021 ± 0.664
2.769GlyLys: 2.769 ± 0.727
4.532GlyLeu: 4.532 ± 1.075
3.525GlyMet: 3.525 ± 1.049
3.776GlyAsn: 3.776 ± 0.542
0.504GlyPro: 0.504 ± 0.42
0.504GlyGln: 0.504 ± 0.213
1.762GlyArg: 1.762 ± 0.492
5.035GlySer: 5.035 ± 1.274
1.511GlyThr: 1.511 ± 0.301
4.028GlyVal: 4.028 ± 0.972
0.755GlyTrp: 0.755 ± 0.44
2.266GlyTyr: 2.266 ± 0.717
0.0GlyXaa: 0.0 ± 0.0
His
1.259HisAla: 1.259 ± 0.363
0.252HisCys: 0.252 ± 0.213
2.518HisAsp: 2.518 ± 0.678
0.252HisGlu: 0.252 ± 0.147
1.511HisPhe: 1.511 ± 0.69
0.755HisGly: 0.755 ± 0.298
1.007HisHis: 1.007 ± 0.416
2.518HisIle: 2.518 ± 1.487
1.762HisLys: 1.762 ± 0.597
2.266HisLeu: 2.266 ± 1.118
0.504HisMet: 0.504 ± 0.293
0.755HisAsn: 0.755 ± 0.353
1.259HisPro: 1.259 ± 0.99
0.504HisGln: 0.504 ± 0.293
2.014HisArg: 2.014 ± 0.566
0.755HisSer: 0.755 ± 0.286
0.755HisThr: 0.755 ± 0.298
1.511HisVal: 1.511 ± 0.461
0.504HisTrp: 0.504 ± 0.293
1.259HisTyr: 1.259 ± 0.513
0.0HisXaa: 0.0 ± 0.0
Ile
3.776IleAla: 3.776 ± 0.329
1.511IleCys: 1.511 ± 0.937
4.783IleAsp: 4.783 ± 0.785
4.532IleGlu: 4.532 ± 1.462
2.266IlePhe: 2.266 ± 0.434
5.539IleGly: 5.539 ± 1.297
2.769IleHis: 2.769 ± 1.148
4.28IleIle: 4.28 ± 0.63
7.805IleLys: 7.805 ± 1.602
6.546IleLeu: 6.546 ± 1.024
2.518IleMet: 2.518 ± 0.568
3.273IleAsn: 3.273 ± 0.905
1.762IlePro: 1.762 ± 0.495
1.259IleGln: 1.259 ± 0.477
3.776IleArg: 3.776 ± 0.704
8.308IleSer: 8.308 ± 1.194
4.028IleThr: 4.028 ± 0.405
3.021IleVal: 3.021 ± 0.726
0.504IleTrp: 0.504 ± 0.213
2.769IleTyr: 2.769 ± 0.586
0.0IleXaa: 0.0 ± 0.0
Lys
1.007LysAla: 1.007 ± 1.067
1.007LysCys: 1.007 ± 1.002
4.028LysAsp: 4.028 ± 0.775
6.798LysGlu: 6.798 ± 1.111
1.511LysPhe: 1.511 ± 0.724
5.035LysGly: 5.035 ± 1.288
1.007LysHis: 1.007 ± 0.427
5.035LysIle: 5.035 ± 1.307
6.042LysLys: 6.042 ± 2.979
4.28LysLeu: 4.28 ± 0.915
3.273LysMet: 3.273 ± 0.803
4.532LysAsn: 4.532 ± 1.31
2.266LysPro: 2.266 ± 0.778
1.762LysGln: 1.762 ± 0.937
4.532LysArg: 4.532 ± 0.764
6.798LysSer: 6.798 ± 0.981
4.532LysThr: 4.532 ± 1.23
6.042LysVal: 6.042 ± 0.882
0.504LysTrp: 0.504 ± 0.293
3.776LysTyr: 3.776 ± 1.075
0.0LysXaa: 0.0 ± 0.0
Leu
3.776LeuAla: 3.776 ± 1.909
3.273LeuCys: 3.273 ± 0.85
5.035LeuAsp: 5.035 ± 1.099
4.532LeuGlu: 4.532 ± 1.088
3.021LeuPhe: 3.021 ± 0.71
3.776LeuGly: 3.776 ± 0.623
2.266LeuHis: 2.266 ± 0.745
7.553LeuIle: 7.553 ± 1.296
5.539LeuLys: 5.539 ± 0.613
6.546LeuLeu: 6.546 ± 0.9
4.532LeuMet: 4.532 ± 0.954
4.783LeuAsn: 4.783 ± 1.069
3.525LeuPro: 3.525 ± 0.629
2.266LeuGln: 2.266 ± 0.632
6.798LeuArg: 6.798 ± 1.158
10.826LeuSer: 10.826 ± 1.508
4.532LeuThr: 4.532 ± 0.692
3.776LeuVal: 3.776 ± 0.833
0.504LeuTrp: 0.504 ± 0.268
4.028LeuTyr: 4.028 ± 1.421
0.0LeuXaa: 0.0 ± 0.0
Met
2.266MetAla: 2.266 ± 1.096
0.504MetCys: 0.504 ± 0.426
2.769MetAsp: 2.769 ± 0.522
1.007MetGlu: 1.007 ± 0.553
1.762MetPhe: 1.762 ± 0.432
2.518MetGly: 2.518 ± 0.881
0.252MetHis: 0.252 ± 0.313
5.035MetIle: 5.035 ± 1.054
1.511MetLys: 1.511 ± 0.531
3.021MetLeu: 3.021 ± 0.945
2.014MetMet: 2.014 ± 1.084
2.266MetAsn: 2.266 ± 1.168
1.259MetPro: 1.259 ± 0.733
0.504MetGln: 0.504 ± 0.268
2.518MetArg: 2.518 ± 1.194
4.783MetSer: 4.783 ± 1.019
2.518MetThr: 2.518 ± 0.458
1.259MetVal: 1.259 ± 0.681
0.252MetTrp: 0.252 ± 0.147
0.755MetTyr: 0.755 ± 0.4
0.0MetXaa: 0.0 ± 0.0
Asn
3.776AsnAla: 3.776 ± 0.477
1.259AsnCys: 1.259 ± 0.624
3.021AsnAsp: 3.021 ± 1.541
3.021AsnGlu: 3.021 ± 0.821
1.511AsnPhe: 1.511 ± 0.88
1.007AsnGly: 1.007 ± 0.507
1.259AsnHis: 1.259 ± 0.522
3.273AsnIle: 3.273 ± 1.002
4.28AsnLys: 4.28 ± 0.499
6.294AsnLeu: 6.294 ± 1.292
2.266AsnMet: 2.266 ± 0.477
2.769AsnAsn: 2.769 ± 0.605
3.273AsnPro: 3.273 ± 0.701
1.762AsnGln: 1.762 ± 0.561
3.273AsnArg: 3.273 ± 1.132
3.021AsnSer: 3.021 ± 0.534
2.014AsnThr: 2.014 ± 0.806
2.518AsnVal: 2.518 ± 1.265
1.007AsnTrp: 1.007 ± 0.587
2.518AsnTyr: 2.518 ± 0.493
0.0AsnXaa: 0.0 ± 0.0
Pro
1.007ProAla: 1.007 ± 0.427
0.504ProCys: 0.504 ± 0.312
1.259ProAsp: 1.259 ± 0.362
2.266ProGlu: 2.266 ± 1.046
1.007ProPhe: 1.007 ± 0.62
1.007ProGly: 1.007 ± 0.305
1.259ProHis: 1.259 ± 0.515
1.762ProIle: 1.762 ± 0.532
3.021ProLys: 3.021 ± 1.1
4.532ProLeu: 4.532 ± 0.695
0.755ProMet: 0.755 ± 0.286
1.511ProAsn: 1.511 ± 0.542
1.007ProPro: 1.007 ± 0.487
1.007ProGln: 1.007 ± 0.416
2.769ProArg: 2.769 ± 0.693
3.776ProSer: 3.776 ± 0.909
3.273ProThr: 3.273 ± 1.065
2.518ProVal: 2.518 ± 0.669
0.504ProTrp: 0.504 ± 0.293
1.007ProTyr: 1.007 ± 0.592
0.0ProXaa: 0.0 ± 0.0
Gln
0.755GlnAla: 0.755 ± 0.94
0.755GlnCys: 0.755 ± 0.639
1.762GlnAsp: 1.762 ± 0.307
0.755GlnGlu: 0.755 ± 0.393
0.755GlnPhe: 0.755 ± 0.298
0.504GlnGly: 0.504 ± 0.733
0.252GlnHis: 0.252 ± 0.213
3.273GlnIle: 3.273 ± 0.809
2.769GlnLys: 2.769 ± 0.77
2.518GlnLeu: 2.518 ± 1.061
1.259GlnMet: 1.259 ± 0.705
1.259GlnAsn: 1.259 ± 0.496
0.504GlnPro: 0.504 ± 0.312
1.007GlnGln: 1.007 ± 0.336
1.007GlnArg: 1.007 ± 0.416
3.021GlnSer: 3.021 ± 0.535
2.014GlnThr: 2.014 ± 0.668
1.762GlnVal: 1.762 ± 0.56
0.252GlnTrp: 0.252 ± 0.213
0.755GlnTyr: 0.755 ± 0.4
0.0GlnXaa: 0.0 ± 0.0
Arg
3.273ArgAla: 3.273 ± 1.751
1.511ArgCys: 1.511 ± 0.582
1.007ArgAsp: 1.007 ± 0.336
4.028ArgGlu: 4.028 ± 0.592
2.014ArgPhe: 2.014 ± 0.523
1.762ArgGly: 1.762 ± 1.027
0.755ArgHis: 0.755 ± 0.286
2.769ArgIle: 2.769 ± 0.785
4.28ArgLys: 4.28 ± 0.836
5.791ArgLeu: 5.791 ± 0.767
0.755ArgMet: 0.755 ± 0.44
2.266ArgAsn: 2.266 ± 0.586
1.762ArgPro: 1.762 ± 0.671
2.014ArgGln: 2.014 ± 0.806
2.266ArgArg: 2.266 ± 0.473
4.783ArgSer: 4.783 ± 1.032
1.259ArgThr: 1.259 ± 0.605
3.776ArgVal: 3.776 ± 1.268
1.511ArgTrp: 1.511 ± 0.448
3.776ArgTyr: 3.776 ± 0.89
0.0ArgXaa: 0.0 ± 0.0
Ser
2.769SerAla: 2.769 ± 1.288
1.007SerCys: 1.007 ± 1.217
7.301SerAsp: 7.301 ± 1.19
6.546SerGlu: 6.546 ± 0.758
4.028SerPhe: 4.028 ± 0.794
6.294SerGly: 6.294 ± 0.779
3.021SerHis: 3.021 ± 0.636
6.042SerIle: 6.042 ± 1.407
6.042SerLys: 6.042 ± 1.275
7.805SerLeu: 7.805 ± 2.206
4.28SerMet: 4.28 ± 1.754
5.539SerAsn: 5.539 ± 1.583
3.273SerPro: 3.273 ± 1.827
3.021SerGln: 3.021 ± 1.49
3.525SerArg: 3.525 ± 0.707
9.063SerSer: 9.063 ± 1.951
6.042SerThr: 6.042 ± 1.595
6.294SerVal: 6.294 ± 0.978
1.259SerTrp: 1.259 ± 0.362
3.525SerTyr: 3.525 ± 1.208
0.0SerXaa: 0.0 ± 0.0
Thr
2.266ThrAla: 2.266 ± 1.07
1.259ThrCys: 1.259 ± 0.318
2.769ThrAsp: 2.769 ± 0.522
5.035ThrGlu: 5.035 ± 1.285
1.259ThrPhe: 1.259 ± 0.445
0.755ThrGly: 0.755 ± 0.531
0.0ThrHis: 0.0 ± 0.0
3.021ThrIle: 3.021 ± 1.057
3.525ThrLys: 3.525 ± 0.774
3.776ThrLeu: 3.776 ± 1.104
2.518ThrMet: 2.518 ± 0.514
1.762ThrAsn: 1.762 ± 0.367
2.266ThrPro: 2.266 ± 0.64
1.511ThrGln: 1.511 ± 0.887
2.769ThrArg: 2.769 ± 0.765
6.546ThrSer: 6.546 ± 1.62
2.518ThrThr: 2.518 ± 1.371
3.021ThrVal: 3.021 ± 0.526
1.762ThrTrp: 1.762 ± 0.599
2.266ThrTyr: 2.266 ± 0.741
0.0ThrXaa: 0.0 ± 0.0
Val
4.028ValAla: 4.028 ± 1.01
0.755ValCys: 0.755 ± 0.298
5.287ValAsp: 5.287 ± 1.263
1.511ValGlu: 1.511 ± 1.202
1.511ValPhe: 1.511 ± 0.63
2.266ValGly: 2.266 ± 0.421
1.762ValHis: 1.762 ± 0.711
4.532ValIle: 4.532 ± 1.142
4.532ValLys: 4.532 ± 1.088
5.791ValLeu: 5.791 ± 1.591
1.511ValMet: 1.511 ± 0.626
3.776ValAsn: 3.776 ± 0.519
2.014ValPro: 2.014 ± 0.771
1.511ValGln: 1.511 ± 0.402
2.266ValArg: 2.266 ± 0.669
7.805ValSer: 7.805 ± 1.019
2.266ValThr: 2.266 ± 0.741
3.776ValVal: 3.776 ± 1.051
0.755ValTrp: 0.755 ± 0.298
1.511ValTyr: 1.511 ± 0.301
0.0ValXaa: 0.0 ± 0.0
Trp
0.504TrpAla: 0.504 ± 0.293
0.504TrpCys: 0.504 ± 0.268
0.504TrpAsp: 0.504 ± 0.38
1.007TrpGlu: 1.007 ± 0.587
1.007TrpPhe: 1.007 ± 0.32
0.755TrpGly: 0.755 ± 0.44
0.252TrpHis: 0.252 ± 0.213
0.0TrpIle: 0.0 ± 0.0
1.762TrpLys: 1.762 ± 0.674
0.252TrpLeu: 0.252 ± 0.147
1.007TrpMet: 1.007 ± 0.418
0.755TrpAsn: 0.755 ± 0.298
0.252TrpPro: 0.252 ± 0.147
0.0TrpGln: 0.0 ± 0.0
0.755TrpArg: 0.755 ± 0.298
1.259TrpSer: 1.259 ± 0.626
1.007TrpThr: 1.007 ± 0.418
1.007TrpVal: 1.007 ± 0.305
0.252TrpTrp: 0.252 ± 0.147
0.252TrpTyr: 0.252 ± 0.313
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.014TyrAla: 2.014 ± 0.697
1.007TyrCys: 1.007 ± 0.587
3.021TyrAsp: 3.021 ± 0.681
2.014TyrGlu: 2.014 ± 0.48
1.259TyrPhe: 1.259 ± 0.827
1.762TyrGly: 1.762 ± 0.417
1.007TyrHis: 1.007 ± 0.305
2.769TyrIle: 2.769 ± 0.771
3.021TyrLys: 3.021 ± 1.28
4.28TyrLeu: 4.28 ± 0.848
1.511TyrMet: 1.511 ± 0.461
2.769TyrAsn: 2.769 ± 0.815
1.511TyrPro: 1.511 ± 0.632
1.007TyrGln: 1.007 ± 0.62
1.007TyrArg: 1.007 ± 0.549
3.273TyrSer: 3.273 ± 1.395
1.259TyrThr: 1.259 ± 0.303
3.525TyrVal: 3.525 ± 1.162
0.252TyrTrp: 0.252 ± 0.213
1.007TyrTyr: 1.007 ± 0.418
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3973 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski