Amino acid dipepetide frequency for Banna virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.186AlaAla: 5.186 ± 0.984
0.335AlaCys: 0.335 ± 0.188
4.517AlaAsp: 4.517 ± 1.053
2.509AlaGlu: 2.509 ± 0.541
2.342AlaPhe: 2.342 ± 0.384
3.513AlaGly: 3.513 ± 1.071
0.669AlaHis: 0.669 ± 0.394
3.68AlaIle: 3.68 ± 0.635
3.011AlaLys: 3.011 ± 0.738
7.193AlaLeu: 7.193 ± 1.379
2.007AlaMet: 2.007 ± 0.428
3.513AlaAsn: 3.513 ± 0.587
2.342AlaPro: 2.342 ± 0.56
2.342AlaGln: 2.342 ± 0.707
2.007AlaArg: 2.007 ± 0.568
4.182AlaSer: 4.182 ± 0.868
4.182AlaThr: 4.182 ± 0.816
4.851AlaVal: 4.851 ± 0.748
0.669AlaTrp: 0.669 ± 0.276
2.175AlaTyr: 2.175 ± 0.48
0.0AlaXaa: 0.0 ± 0.0
Cys
0.502CysAla: 0.502 ± 0.194
0.502CysCys: 0.502 ± 0.342
0.502CysAsp: 0.502 ± 0.336
1.004CysGlu: 1.004 ± 0.418
0.502CysPhe: 0.502 ± 0.309
1.004CysGly: 1.004 ± 0.389
0.0CysHis: 0.0 ± 0.0
0.502CysIle: 0.502 ± 0.44
1.004CysLys: 1.004 ± 0.436
1.171CysLeu: 1.171 ± 0.363
1.004CysMet: 1.004 ± 0.394
0.836CysAsn: 0.836 ± 0.429
0.167CysPro: 0.167 ± 0.243
0.0CysGln: 0.0 ± 0.0
0.502CysArg: 0.502 ± 0.27
0.669CysSer: 0.669 ± 0.289
0.836CysThr: 0.836 ± 0.361
1.506CysVal: 1.506 ± 0.421
0.0CysTrp: 0.0 ± 0.0
0.335CysTyr: 0.335 ± 0.218
0.0CysXaa: 0.0 ± 0.0
Asp
4.349AspAla: 4.349 ± 0.745
0.836AspCys: 0.836 ± 0.362
4.349AspAsp: 4.349 ± 0.82
5.353AspGlu: 5.353 ± 0.953
3.011AspPhe: 3.011 ± 0.645
5.353AspGly: 5.353 ± 0.879
1.84AspHis: 1.84 ± 0.523
6.858AspIle: 6.858 ± 0.85
3.178AspLys: 3.178 ± 0.864
4.349AspLeu: 4.349 ± 1.124
1.004AspMet: 1.004 ± 0.276
3.513AspAsn: 3.513 ± 0.605
2.676AspPro: 2.676 ± 0.671
1.338AspGln: 1.338 ± 0.423
2.509AspArg: 2.509 ± 0.455
3.847AspSer: 3.847 ± 0.637
3.346AspThr: 3.346 ± 0.648
5.353AspVal: 5.353 ± 0.684
0.669AspTrp: 0.669 ± 0.322
3.011AspTyr: 3.011 ± 0.72
0.0AspXaa: 0.0 ± 0.0
Glu
1.673GluAla: 1.673 ± 0.425
1.004GluCys: 1.004 ± 0.478
2.175GluAsp: 2.175 ± 0.784
2.007GluGlu: 2.007 ± 0.541
1.673GluPhe: 1.673 ± 0.502
3.011GluGly: 3.011 ± 0.896
1.673GluHis: 1.673 ± 0.424
4.349GluIle: 4.349 ± 0.65
3.011GluLys: 3.011 ± 0.627
6.357GluLeu: 6.357 ± 0.527
1.506GluMet: 1.506 ± 0.491
2.844GluAsn: 2.844 ± 0.681
1.506GluPro: 1.506 ± 0.446
1.506GluGln: 1.506 ± 0.552
4.182GluArg: 4.182 ± 0.818
4.015GluSer: 4.015 ± 0.751
2.509GluThr: 2.509 ± 0.587
4.015GluVal: 4.015 ± 0.805
0.335GluTrp: 0.335 ± 0.158
2.007GluTyr: 2.007 ± 0.645
0.0GluXaa: 0.0 ± 0.0
Phe
2.342PheAla: 2.342 ± 0.589
0.502PheCys: 0.502 ± 0.191
3.513PheAsp: 3.513 ± 0.74
2.509PheGlu: 2.509 ± 0.532
1.171PhePhe: 1.171 ± 0.383
2.342PheGly: 2.342 ± 0.392
0.502PheHis: 0.502 ± 0.241
3.68PheIle: 3.68 ± 0.649
4.182PheLys: 4.182 ± 0.714
2.342PheLeu: 2.342 ± 0.717
1.171PheMet: 1.171 ± 0.45
4.015PheAsn: 4.015 ± 0.734
0.167PhePro: 0.167 ± 0.218
1.338PheGln: 1.338 ± 0.477
2.509PheArg: 2.509 ± 0.71
1.506PheSer: 1.506 ± 0.574
1.84PheThr: 1.84 ± 0.814
2.509PheVal: 2.509 ± 0.64
0.167PheTrp: 0.167 ± 0.143
0.669PheTyr: 0.669 ± 0.282
0.0PheXaa: 0.0 ± 0.0
Gly
2.676GlyAla: 2.676 ± 0.483
0.335GlyCys: 0.335 ± 0.243
2.342GlyAsp: 2.342 ± 0.629
1.171GlyGlu: 1.171 ± 0.315
2.342GlyPhe: 2.342 ± 0.485
2.844GlyGly: 2.844 ± 0.613
1.338GlyHis: 1.338 ± 0.272
3.68GlyIle: 3.68 ± 1.002
3.513GlyLys: 3.513 ± 0.429
7.026GlyLeu: 7.026 ± 1.382
0.669GlyMet: 0.669 ± 0.28
3.847GlyAsn: 3.847 ± 0.638
1.506GlyPro: 1.506 ± 0.488
2.007GlyGln: 2.007 ± 0.314
3.513GlyArg: 3.513 ± 0.878
5.018GlySer: 5.018 ± 0.474
3.011GlyThr: 3.011 ± 0.914
5.353GlyVal: 5.353 ± 0.467
0.669GlyTrp: 0.669 ± 0.252
2.175GlyTyr: 2.175 ± 0.475
0.0GlyXaa: 0.0 ± 0.0
His
2.007HisAla: 2.007 ± 0.332
0.335HisCys: 0.335 ± 0.196
2.342HisAsp: 2.342 ± 0.603
0.669HisGlu: 0.669 ± 0.226
1.338HisPhe: 1.338 ± 0.461
1.84HisGly: 1.84 ± 0.647
0.502HisHis: 0.502 ± 0.358
1.004HisIle: 1.004 ± 0.494
1.84HisLys: 1.84 ± 0.606
2.007HisLeu: 2.007 ± 0.54
0.669HisMet: 0.669 ± 0.419
1.506HisAsn: 1.506 ± 0.582
1.004HisPro: 1.004 ± 0.413
0.669HisGln: 0.669 ± 0.418
0.335HisArg: 0.335 ± 0.435
1.673HisSer: 1.673 ± 0.357
0.836HisThr: 0.836 ± 0.245
1.338HisVal: 1.338 ± 0.506
0.167HisTrp: 0.167 ± 0.145
1.338HisTyr: 1.338 ± 0.429
0.0HisXaa: 0.0 ± 0.0
Ile
3.178IleAla: 3.178 ± 1.254
1.004IleCys: 1.004 ± 0.343
6.022IleAsp: 6.022 ± 0.61
3.513IleGlu: 3.513 ± 0.777
1.673IlePhe: 1.673 ± 0.623
5.018IleGly: 5.018 ± 1.113
1.506IleHis: 1.506 ± 0.364
3.68IleIle: 3.68 ± 0.743
4.684IleLys: 4.684 ± 0.756
5.52IleLeu: 5.52 ± 0.82
2.676IleMet: 2.676 ± 0.705
4.851IleAsn: 4.851 ± 0.74
2.844IlePro: 2.844 ± 0.983
2.509IleGln: 2.509 ± 0.877
3.513IleArg: 3.513 ± 0.458
4.182IleSer: 4.182 ± 0.89
5.186IleThr: 5.186 ± 1.104
4.851IleVal: 4.851 ± 1.142
0.335IleTrp: 0.335 ± 0.158
1.84IleTyr: 1.84 ± 0.268
0.0IleXaa: 0.0 ± 0.0
Lys
3.011LysAla: 3.011 ± 0.814
0.502LysCys: 0.502 ± 0.449
4.684LysAsp: 4.684 ± 0.645
3.847LysGlu: 3.847 ± 0.696
2.007LysPhe: 2.007 ± 0.468
2.844LysGly: 2.844 ± 0.56
1.004LysHis: 1.004 ± 0.378
3.178LysIle: 3.178 ± 0.447
4.517LysLys: 4.517 ± 0.551
9.702LysLeu: 9.702 ± 1.108
3.178LysMet: 3.178 ± 0.662
3.513LysAsn: 3.513 ± 0.598
2.509LysPro: 2.509 ± 0.559
2.342LysGln: 2.342 ± 0.613
2.844LysArg: 2.844 ± 0.761
4.517LysSer: 4.517 ± 1.291
3.847LysThr: 3.847 ± 0.519
4.851LysVal: 4.851 ± 0.972
0.335LysTrp: 0.335 ± 0.341
2.509LysTyr: 2.509 ± 0.423
0.0LysXaa: 0.0 ± 0.0
Leu
6.022LeuAla: 6.022 ± 0.935
0.669LeuCys: 0.669 ± 0.366
7.36LeuAsp: 7.36 ± 1.393
5.186LeuGlu: 5.186 ± 0.932
2.175LeuPhe: 2.175 ± 0.38
3.178LeuGly: 3.178 ± 0.817
1.506LeuHis: 1.506 ± 0.658
7.026LeuIle: 7.026 ± 1.264
7.026LeuLys: 7.026 ± 1.098
7.026LeuLeu: 7.026 ± 1.334
2.007LeuMet: 2.007 ± 0.504
7.695LeuAsn: 7.695 ± 1.082
4.015LeuPro: 4.015 ± 0.804
1.84LeuGln: 1.84 ± 0.525
5.52LeuArg: 5.52 ± 0.869
7.862LeuSer: 7.862 ± 1.017
6.691LeuThr: 6.691 ± 1.52
6.022LeuVal: 6.022 ± 0.994
0.167LeuTrp: 0.167 ± 0.133
3.847LeuTyr: 3.847 ± 0.631
0.0LeuXaa: 0.0 ± 0.0
Met
2.175MetAla: 2.175 ± 0.723
0.502MetCys: 0.502 ± 0.286
1.673MetAsp: 1.673 ± 0.465
0.502MetGlu: 0.502 ± 0.25
1.171MetPhe: 1.171 ± 0.292
1.171MetGly: 1.171 ± 0.301
0.836MetHis: 0.836 ± 0.375
1.84MetIle: 1.84 ± 0.461
2.007MetLys: 2.007 ± 0.606
2.676MetLeu: 2.676 ± 0.772
1.004MetMet: 1.004 ± 0.381
2.175MetAsn: 2.175 ± 0.416
1.004MetPro: 1.004 ± 0.288
0.669MetGln: 0.669 ± 0.301
1.004MetArg: 1.004 ± 0.402
1.673MetSer: 1.673 ± 0.442
2.007MetThr: 2.007 ± 0.526
1.84MetVal: 1.84 ± 0.499
0.167MetTrp: 0.167 ± 0.133
1.171MetTyr: 1.171 ± 0.491
0.0MetXaa: 0.0 ± 0.0
Asn
3.513AsnAla: 3.513 ± 0.675
1.004AsnCys: 1.004 ± 0.383
5.688AsnAsp: 5.688 ± 1.183
3.346AsnGlu: 3.346 ± 0.665
3.178AsnPhe: 3.178 ± 0.882
4.684AsnGly: 4.684 ± 0.898
1.338AsnHis: 1.338 ± 0.376
5.186AsnIle: 5.186 ± 0.649
3.346AsnLys: 3.346 ± 0.668
5.855AsnLeu: 5.855 ± 0.928
1.84AsnMet: 1.84 ± 0.527
4.517AsnAsn: 4.517 ± 0.912
2.342AsnPro: 2.342 ± 0.759
3.011AsnGln: 3.011 ± 0.765
3.011AsnArg: 3.011 ± 0.452
4.349AsnSer: 4.349 ± 1.131
3.513AsnThr: 3.513 ± 0.751
5.186AsnVal: 5.186 ± 0.717
1.338AsnTrp: 1.338 ± 0.348
3.346AsnTyr: 3.346 ± 0.493
0.0AsnXaa: 0.0 ± 0.0
Pro
2.342ProAla: 2.342 ± 0.573
0.167ProCys: 0.167 ± 0.145
1.004ProAsp: 1.004 ± 0.298
2.342ProGlu: 2.342 ± 0.791
1.84ProPhe: 1.84 ± 0.696
1.338ProGly: 1.338 ± 0.433
1.171ProHis: 1.171 ± 0.313
1.673ProIle: 1.673 ± 0.484
1.84ProLys: 1.84 ± 0.528
3.68ProLeu: 3.68 ± 0.955
1.171ProMet: 1.171 ± 0.55
3.513ProAsn: 3.513 ± 0.608
0.0ProPro: 0.0 ± 0.0
0.836ProGln: 0.836 ± 0.323
1.673ProArg: 1.673 ± 0.565
1.673ProSer: 1.673 ± 0.348
1.506ProThr: 1.506 ± 0.34
3.346ProVal: 3.346 ± 0.731
0.167ProTrp: 0.167 ± 0.17
1.506ProTyr: 1.506 ± 0.441
0.0ProXaa: 0.0 ± 0.0
Gln
1.84GlnAla: 1.84 ± 0.583
0.502GlnCys: 0.502 ± 0.34
1.673GlnAsp: 1.673 ± 0.558
1.673GlnGlu: 1.673 ± 0.299
1.84GlnPhe: 1.84 ± 0.595
1.506GlnGly: 1.506 ± 0.404
1.338GlnHis: 1.338 ± 0.339
1.84GlnIle: 1.84 ± 0.634
1.673GlnLys: 1.673 ± 0.365
3.011GlnLeu: 3.011 ± 0.665
0.836GlnMet: 0.836 ± 0.36
2.509GlnAsn: 2.509 ± 0.931
0.502GlnPro: 0.502 ± 0.231
0.335GlnGln: 0.335 ± 0.288
1.506GlnArg: 1.506 ± 0.535
2.342GlnSer: 2.342 ± 0.82
2.007GlnThr: 2.007 ± 0.615
1.84GlnVal: 1.84 ± 0.465
0.502GlnTrp: 0.502 ± 0.246
1.338GlnTyr: 1.338 ± 0.508
0.0GlnXaa: 0.0 ± 0.0
Arg
4.182ArgAla: 4.182 ± 0.6
0.335ArgCys: 0.335 ± 0.29
2.676ArgAsp: 2.676 ± 0.586
3.011ArgGlu: 3.011 ± 0.723
2.175ArgPhe: 2.175 ± 0.609
1.84ArgGly: 1.84 ± 0.571
1.338ArgHis: 1.338 ± 0.341
3.346ArgIle: 3.346 ± 0.46
2.007ArgLys: 2.007 ± 0.421
4.349ArgLeu: 4.349 ± 0.745
0.836ArgMet: 0.836 ± 0.307
4.349ArgAsn: 4.349 ± 1.111
1.171ArgPro: 1.171 ± 0.338
1.506ArgGln: 1.506 ± 0.456
2.007ArgArg: 2.007 ± 0.486
3.847ArgSer: 3.847 ± 0.596
2.342ArgThr: 2.342 ± 0.601
3.178ArgVal: 3.178 ± 0.885
0.167ArgTrp: 0.167 ± 0.143
2.342ArgTyr: 2.342 ± 0.602
0.0ArgXaa: 0.0 ± 0.0
Ser
5.353SerAla: 5.353 ± 0.801
0.335SerCys: 0.335 ± 0.347
4.517SerAsp: 4.517 ± 0.73
3.513SerGlu: 3.513 ± 0.663
2.676SerPhe: 2.676 ± 0.456
3.68SerGly: 3.68 ± 0.924
2.509SerHis: 2.509 ± 0.451
5.186SerIle: 5.186 ± 1.064
5.018SerLys: 5.018 ± 1.095
6.189SerLeu: 6.189 ± 0.873
2.007SerMet: 2.007 ± 0.587
4.182SerAsn: 4.182 ± 0.838
2.844SerPro: 2.844 ± 0.588
2.007SerGln: 2.007 ± 0.643
3.68SerArg: 3.68 ± 0.991
5.855SerSer: 5.855 ± 0.899
4.182SerThr: 4.182 ± 0.548
4.349SerVal: 4.349 ± 0.949
0.502SerTrp: 0.502 ± 0.248
3.847SerTyr: 3.847 ± 0.692
0.0SerXaa: 0.0 ± 0.0
Thr
3.68ThrAla: 3.68 ± 0.872
0.502ThrCys: 0.502 ± 0.278
3.011ThrAsp: 3.011 ± 0.858
2.509ThrGlu: 2.509 ± 0.669
1.84ThrPhe: 1.84 ± 0.563
3.011ThrGly: 3.011 ± 0.525
1.171ThrHis: 1.171 ± 0.374
4.015ThrIle: 4.015 ± 0.843
4.182ThrLys: 4.182 ± 0.647
4.684ThrLeu: 4.684 ± 0.762
0.836ThrMet: 0.836 ± 0.384
3.68ThrAsn: 3.68 ± 0.749
2.175ThrPro: 2.175 ± 0.629
2.844ThrGln: 2.844 ± 0.465
2.175ThrArg: 2.175 ± 0.383
6.189ThrSer: 6.189 ± 0.914
4.517ThrThr: 4.517 ± 0.763
5.353ThrVal: 5.353 ± 0.831
0.167ThrTrp: 0.167 ± 0.17
3.513ThrTyr: 3.513 ± 0.499
0.0ThrXaa: 0.0 ± 0.0
Val
4.517ValAla: 4.517 ± 1.377
1.84ValCys: 1.84 ± 0.423
5.353ValAsp: 5.353 ± 0.631
4.517ValGlu: 4.517 ± 1.512
3.68ValPhe: 3.68 ± 0.469
4.349ValGly: 4.349 ± 0.788
1.506ValHis: 1.506 ± 0.35
4.684ValIle: 4.684 ± 0.833
6.022ValLys: 6.022 ± 1.016
4.684ValLeu: 4.684 ± 0.97
0.836ValMet: 0.836 ± 0.374
5.688ValAsn: 5.688 ± 0.749
2.509ValPro: 2.509 ± 0.771
2.509ValGln: 2.509 ± 0.741
2.175ValArg: 2.175 ± 0.551
5.186ValSer: 5.186 ± 0.938
5.353ValThr: 5.353 ± 0.746
5.52ValVal: 5.52 ± 0.683
0.167ValTrp: 0.167 ± 0.143
3.513ValTyr: 3.513 ± 0.726
0.0ValXaa: 0.0 ± 0.0
Trp
0.167TrpAla: 0.167 ± 0.17
0.335TrpCys: 0.335 ± 0.195
0.502TrpAsp: 0.502 ± 0.24
0.335TrpGlu: 0.335 ± 0.158
0.0TrpPhe: 0.0 ± 0.0
0.335TrpGly: 0.335 ± 0.158
0.335TrpHis: 0.335 ± 0.196
0.335TrpIle: 0.335 ± 0.203
0.167TrpLys: 0.167 ± 0.145
1.004TrpLeu: 1.004 ± 0.39
0.167TrpMet: 0.167 ± 0.187
0.502TrpAsn: 0.502 ± 0.253
0.335TrpPro: 0.335 ± 0.257
0.335TrpGln: 0.335 ± 0.188
0.167TrpArg: 0.167 ± 0.133
0.335TrpSer: 0.335 ± 0.158
0.669TrpThr: 0.669 ± 0.23
0.335TrpVal: 0.335 ± 0.203
0.0TrpTrp: 0.0 ± 0.0
0.335TrpTyr: 0.335 ± 0.269
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.676TyrAla: 2.676 ± 0.421
1.004TyrCys: 1.004 ± 0.385
2.844TyrAsp: 2.844 ± 0.676
1.84TyrGlu: 1.84 ± 0.451
2.342TyrPhe: 2.342 ± 0.689
2.175TyrGly: 2.175 ± 0.532
1.338TyrHis: 1.338 ± 0.58
2.844TyrIle: 2.844 ± 0.641
3.513TyrLys: 3.513 ± 0.844
4.015TyrLeu: 4.015 ± 0.88
1.673TyrMet: 1.673 ± 0.337
2.342TyrAsn: 2.342 ± 0.575
1.171TyrPro: 1.171 ± 0.415
0.669TyrGln: 0.669 ± 0.314
2.175TyrArg: 2.175 ± 0.808
3.513TyrSer: 3.513 ± 0.828
1.673TyrThr: 1.673 ± 0.451
3.011TyrVal: 3.011 ± 0.519
0.0TyrTrp: 0.0 ± 0.0
1.673TyrTyr: 1.673 ± 0.51
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (5979 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski