Amino acid dipepetide frequency for Tioman virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.058AlaAla: 6.058 ± 2.093
1.759AlaCys: 1.759 ± 0.508
3.909AlaAsp: 3.909 ± 1.008
3.518AlaGlu: 3.518 ± 1.333
2.931AlaPhe: 2.931 ± 0.522
3.322AlaGly: 3.322 ± 0.947
1.563AlaHis: 1.563 ± 0.308
5.863AlaIle: 5.863 ± 1.141
2.736AlaLys: 2.736 ± 0.693
5.667AlaLeu: 5.667 ± 1.634
1.563AlaMet: 1.563 ± 0.588
3.518AlaAsn: 3.518 ± 0.584
2.931AlaPro: 2.931 ± 1.252
2.345AlaGln: 2.345 ± 1.104
4.495AlaArg: 4.495 ± 1.122
3.909AlaSer: 3.909 ± 0.435
4.299AlaThr: 4.299 ± 1.262
2.736AlaVal: 2.736 ± 1.052
1.173AlaTrp: 1.173 ± 0.455
1.563AlaTyr: 1.563 ± 0.378
0.0AlaXaa: 0.0 ± 0.0
Cys
1.368CysAla: 1.368 ± 0.495
0.586CysCys: 0.586 ± 0.311
1.173CysAsp: 1.173 ± 0.235
1.173CysGlu: 1.173 ± 0.549
1.173CysPhe: 1.173 ± 0.235
0.977CysGly: 0.977 ± 0.373
0.586CysHis: 0.586 ± 0.345
1.759CysIle: 1.759 ± 0.467
1.563CysLys: 1.563 ± 0.377
1.954CysLeu: 1.954 ± 0.525
0.586CysMet: 0.586 ± 0.217
1.173CysAsn: 1.173 ± 0.583
1.173CysPro: 1.173 ± 0.583
0.977CysGln: 0.977 ± 0.449
0.782CysArg: 0.782 ± 0.337
1.759CysSer: 1.759 ± 0.603
0.977CysThr: 0.977 ± 0.686
0.977CysVal: 0.977 ± 0.326
0.0CysTrp: 0.0 ± 0.0
1.173CysTyr: 1.173 ± 0.597
0.0CysXaa: 0.0 ± 0.0
Asp
3.518AspAla: 3.518 ± 0.747
0.977AspCys: 0.977 ± 0.571
2.345AspAsp: 2.345 ± 0.838
4.104AspGlu: 4.104 ± 1.238
1.173AspPhe: 1.173 ± 0.442
1.173AspGly: 1.173 ± 0.376
0.586AspHis: 0.586 ± 0.37
4.299AspIle: 4.299 ± 1.236
2.736AspLys: 2.736 ± 0.489
6.645AspLeu: 6.645 ± 1.543
1.173AspMet: 1.173 ± 0.628
1.954AspAsn: 1.954 ± 0.643
4.495AspPro: 4.495 ± 1.291
3.127AspGln: 3.127 ± 0.923
2.541AspArg: 2.541 ± 0.6
1.954AspSer: 1.954 ± 1.032
3.127AspThr: 3.127 ± 0.769
2.345AspVal: 2.345 ± 1.122
0.977AspTrp: 0.977 ± 0.355
1.759AspTyr: 1.759 ± 0.739
0.0AspXaa: 0.0 ± 0.0
Glu
2.931GluAla: 2.931 ± 0.749
0.977GluCys: 0.977 ± 0.326
1.563GluAsp: 1.563 ± 0.597
3.518GluGlu: 3.518 ± 1.429
1.759GluPhe: 1.759 ± 0.714
2.541GluGly: 2.541 ± 0.59
0.586GluHis: 0.586 ± 0.511
4.886GluIle: 4.886 ± 1.052
3.713GluLys: 3.713 ± 1.449
5.277GluLeu: 5.277 ± 0.909
0.977GluMet: 0.977 ± 0.412
1.954GluAsn: 1.954 ± 0.398
1.368GluPro: 1.368 ± 0.446
2.736GluGln: 2.736 ± 1.008
1.368GluArg: 1.368 ± 0.747
4.69GluSer: 4.69 ± 1.335
3.518GluThr: 3.518 ± 1.198
3.518GluVal: 3.518 ± 0.475
0.586GluTrp: 0.586 ± 0.294
1.368GluTyr: 1.368 ± 0.369
0.0GluXaa: 0.0 ± 0.0
Phe
2.541PheAla: 2.541 ± 0.503
0.977PheCys: 0.977 ± 0.281
2.345PheAsp: 2.345 ± 0.72
1.368PheGlu: 1.368 ± 0.601
1.563PhePhe: 1.563 ± 0.492
1.759PheGly: 1.759 ± 1.066
0.782PheHis: 0.782 ± 0.284
2.345PheIle: 2.345 ± 0.753
1.954PheLys: 1.954 ± 0.836
4.886PheLeu: 4.886 ± 0.986
0.586PheMet: 0.586 ± 0.261
1.759PheAsn: 1.759 ± 0.648
1.759PhePro: 1.759 ± 0.59
1.368PheGln: 1.368 ± 0.619
0.977PheArg: 0.977 ± 0.452
2.15PheSer: 2.15 ± 0.601
1.954PheThr: 1.954 ± 0.578
0.782PheVal: 0.782 ± 0.284
0.0PheTrp: 0.0 ± 0.0
0.586PheTyr: 0.586 ± 0.254
0.0PheXaa: 0.0 ± 0.0
Gly
3.909GlyAla: 3.909 ± 1.566
0.586GlyCys: 0.586 ± 0.513
2.345GlyAsp: 2.345 ± 0.768
3.127GlyGlu: 3.127 ± 0.902
1.563GlyPhe: 1.563 ± 0.938
2.345GlyGly: 2.345 ± 0.421
1.173GlyHis: 1.173 ± 0.384
4.69GlyIle: 4.69 ± 1.11
2.541GlyLys: 2.541 ± 0.56
5.472GlyLeu: 5.472 ± 0.896
0.782GlyMet: 0.782 ± 0.247
3.518GlyAsn: 3.518 ± 1.246
1.563GlyPro: 1.563 ± 1.083
2.15GlyGln: 2.15 ± 0.524
3.713GlyArg: 3.713 ± 1.252
4.886GlySer: 4.886 ± 0.755
3.909GlyThr: 3.909 ± 1.185
2.541GlyVal: 2.541 ± 0.799
0.586GlyTrp: 0.586 ± 0.283
1.368GlyTyr: 1.368 ± 0.625
0.0GlyXaa: 0.0 ± 0.0
His
0.782HisAla: 0.782 ± 0.284
0.782HisCys: 0.782 ± 0.487
0.977HisAsp: 0.977 ± 0.464
1.173HisGlu: 1.173 ± 0.457
0.782HisPhe: 0.782 ± 0.543
0.586HisGly: 0.586 ± 0.256
1.368HisHis: 1.368 ± 0.746
1.563HisIle: 1.563 ± 0.597
0.391HisLys: 0.391 ± 0.334
2.736HisLeu: 2.736 ± 0.993
0.391HisMet: 0.391 ± 0.332
0.391HisAsn: 0.391 ± 0.362
0.977HisPro: 0.977 ± 0.468
1.563HisGln: 1.563 ± 0.835
1.368HisArg: 1.368 ± 0.747
0.195HisSer: 0.195 ± 0.123
0.782HisThr: 0.782 ± 0.354
0.977HisVal: 0.977 ± 0.429
0.195HisTrp: 0.195 ± 0.123
0.391HisTyr: 0.391 ± 0.323
0.0HisXaa: 0.0 ± 0.0
Ile
4.495IleAla: 4.495 ± 0.989
1.173IleCys: 1.173 ± 0.474
3.322IleAsp: 3.322 ± 0.771
3.127IleGlu: 3.127 ± 0.751
1.954IlePhe: 1.954 ± 0.46
3.322IleGly: 3.322 ± 0.774
1.368IleHis: 1.368 ± 0.442
5.277IleIle: 5.277 ± 1.555
3.909IleLys: 3.909 ± 1.007
8.99IleLeu: 8.99 ± 1.847
2.15IleMet: 2.15 ± 0.394
4.886IleAsn: 4.886 ± 1.951
4.69IlePro: 4.69 ± 1.042
2.931IleGln: 2.931 ± 1.444
3.713IleArg: 3.713 ± 0.632
7.426IleSer: 7.426 ± 1.185
4.69IleThr: 4.69 ± 0.781
6.058IleVal: 6.058 ± 1.506
0.977IleTrp: 0.977 ± 0.334
1.954IleTyr: 1.954 ± 0.587
0.0IleXaa: 0.0 ± 0.0
Lys
2.15LysAla: 2.15 ± 0.502
1.173LysCys: 1.173 ± 0.391
1.563LysAsp: 1.563 ± 0.622
4.104LysGlu: 4.104 ± 0.957
1.368LysPhe: 1.368 ± 0.376
3.518LysGly: 3.518 ± 1.259
0.782LysHis: 0.782 ± 0.341
1.954LysIle: 1.954 ± 0.674
1.759LysLys: 1.759 ± 0.551
4.69LysLeu: 4.69 ± 1.253
1.368LysMet: 1.368 ± 0.656
2.345LysAsn: 2.345 ± 0.553
1.954LysPro: 1.954 ± 1.28
2.541LysGln: 2.541 ± 0.706
2.541LysArg: 2.541 ± 0.614
5.472LysSer: 5.472 ± 1.218
2.541LysThr: 2.541 ± 0.584
4.104LysVal: 4.104 ± 1.025
0.195LysTrp: 0.195 ± 0.212
1.759LysTyr: 1.759 ± 0.503
0.0LysXaa: 0.0 ± 0.0
Leu
6.449LeuAla: 6.449 ± 1.118
2.345LeuCys: 2.345 ± 0.855
9.185LeuAsp: 9.185 ± 1.271
3.909LeuGlu: 3.909 ± 0.663
3.713LeuPhe: 3.713 ± 1.25
5.081LeuGly: 5.081 ± 1.002
1.368LeuHis: 1.368 ± 0.456
6.254LeuIle: 6.254 ± 0.857
5.277LeuLys: 5.277 ± 1.405
8.013LeuLeu: 8.013 ± 1.415
2.345LeuMet: 2.345 ± 0.554
6.058LeuAsn: 6.058 ± 1.547
4.299LeuPro: 4.299 ± 1.166
5.277LeuGln: 5.277 ± 1.015
3.518LeuArg: 3.518 ± 1.063
13.68LeuSer: 13.68 ± 1.947
9.771LeuThr: 9.771 ± 2.25
7.035LeuVal: 7.035 ± 1.317
1.368LeuTrp: 1.368 ± 0.304
2.541LeuTyr: 2.541 ± 0.513
0.0LeuXaa: 0.0 ± 0.0
Met
2.15MetAla: 2.15 ± 1.074
0.586MetCys: 0.586 ± 0.261
0.977MetAsp: 0.977 ± 0.664
0.782MetGlu: 0.782 ± 0.247
0.0MetPhe: 0.0 ± 0.0
0.977MetGly: 0.977 ± 0.545
0.195MetHis: 0.195 ± 0.219
2.15MetIle: 2.15 ± 1.132
0.195MetLys: 0.195 ± 0.123
1.368MetLeu: 1.368 ± 0.481
1.563MetMet: 1.563 ± 1.153
1.173MetAsn: 1.173 ± 0.539
0.586MetPro: 0.586 ± 0.261
1.368MetGln: 1.368 ± 0.747
2.541MetArg: 2.541 ± 0.462
3.127MetSer: 3.127 ± 1.081
1.759MetThr: 1.759 ± 0.46
1.368MetVal: 1.368 ± 0.407
0.586MetTrp: 0.586 ± 0.283
0.586MetTyr: 0.586 ± 0.301
0.0MetXaa: 0.0 ± 0.0
Asn
2.345AsnAla: 2.345 ± 0.828
1.368AsnCys: 1.368 ± 1.001
2.541AsnAsp: 2.541 ± 0.558
1.368AsnGlu: 1.368 ± 0.681
1.368AsnPhe: 1.368 ± 0.633
2.931AsnGly: 2.931 ± 0.759
0.782AsnHis: 0.782 ± 0.433
4.104AsnIle: 4.104 ± 0.665
1.759AsnLys: 1.759 ± 0.377
8.013AsnLeu: 8.013 ± 1.642
0.782AsnMet: 0.782 ± 0.398
2.345AsnAsn: 2.345 ± 0.738
4.495AsnPro: 4.495 ± 1.153
2.931AsnGln: 2.931 ± 0.631
2.345AsnArg: 2.345 ± 0.725
4.69AsnSer: 4.69 ± 1.054
3.518AsnThr: 3.518 ± 0.998
2.15AsnVal: 2.15 ± 0.437
0.782AsnTrp: 0.782 ± 0.494
1.173AsnTyr: 1.173 ± 0.407
0.0AsnXaa: 0.0 ± 0.0
Pro
4.104ProAla: 4.104 ± 0.763
0.586ProCys: 0.586 ± 0.273
3.127ProAsp: 3.127 ± 1.628
2.736ProGlu: 2.736 ± 0.887
2.541ProPhe: 2.541 ± 1.037
3.322ProGly: 3.322 ± 0.924
0.782ProHis: 0.782 ± 0.266
4.299ProIle: 4.299 ± 1.125
3.127ProLys: 3.127 ± 1.376
6.058ProLeu: 6.058 ± 0.894
1.173ProMet: 1.173 ± 0.24
2.541ProAsn: 2.541 ± 0.633
4.104ProPro: 4.104 ± 1.391
1.173ProGln: 1.173 ± 0.377
2.931ProArg: 2.931 ± 1.089
4.495ProSer: 4.495 ± 1.675
3.713ProThr: 3.713 ± 0.718
2.736ProVal: 2.736 ± 0.568
0.195ProTrp: 0.195 ± 0.231
1.954ProTyr: 1.954 ± 0.632
0.0ProXaa: 0.0 ± 0.0
Gln
3.127GlnAla: 3.127 ± 1.429
1.173GlnCys: 1.173 ± 0.444
1.759GlnAsp: 1.759 ± 0.889
1.954GlnGlu: 1.954 ± 1.139
0.977GlnPhe: 0.977 ± 0.557
3.322GlnGly: 3.322 ± 1.592
0.977GlnHis: 0.977 ± 0.604
4.495GlnIle: 4.495 ± 1.213
2.931GlnLys: 2.931 ± 0.682
5.863GlnLeu: 5.863 ± 0.868
1.173GlnMet: 1.173 ± 0.448
2.931GlnAsn: 2.931 ± 0.793
1.759GlnPro: 1.759 ± 0.534
2.345GlnGln: 2.345 ± 0.97
1.173GlnArg: 1.173 ± 0.24
3.713GlnSer: 3.713 ± 1.109
2.541GlnThr: 2.541 ± 0.401
2.345GlnVal: 2.345 ± 0.512
0.195GlnTrp: 0.195 ± 0.258
0.977GlnTyr: 0.977 ± 0.427
0.0GlnXaa: 0.0 ± 0.0
Arg
1.954ArgAla: 1.954 ± 0.813
0.782ArgCys: 0.782 ± 0.4
2.15ArgAsp: 2.15 ± 0.717
1.563ArgGlu: 1.563 ± 0.542
1.563ArgPhe: 1.563 ± 0.421
4.495ArgGly: 4.495 ± 1.586
1.954ArgHis: 1.954 ± 0.618
4.104ArgIle: 4.104 ± 0.483
2.541ArgLys: 2.541 ± 0.977
6.254ArgLeu: 6.254 ± 0.922
0.977ArgMet: 0.977 ± 0.345
2.931ArgAsn: 2.931 ± 0.818
2.931ArgPro: 2.931 ± 0.966
2.541ArgGln: 2.541 ± 1.062
4.104ArgArg: 4.104 ± 0.577
3.909ArgSer: 3.909 ± 0.64
2.15ArgThr: 2.15 ± 0.328
2.736ArgVal: 2.736 ± 1.061
0.391ArgTrp: 0.391 ± 0.217
1.759ArgTyr: 1.759 ± 0.58
0.0ArgXaa: 0.0 ± 0.0
Ser
7.035SerAla: 7.035 ± 1.698
2.736SerCys: 2.736 ± 0.909
4.495SerAsp: 4.495 ± 0.786
3.518SerGlu: 3.518 ± 1.048
1.368SerPhe: 1.368 ± 0.481
4.495SerGly: 4.495 ± 0.825
0.391SerHis: 0.391 ± 0.247
5.667SerIle: 5.667 ± 2.028
2.931SerLys: 2.931 ± 1.0
8.794SerLeu: 8.794 ± 1.061
2.345SerMet: 2.345 ± 0.702
2.931SerAsn: 2.931 ± 0.453
6.645SerPro: 6.645 ± 1.846
3.127SerGln: 3.127 ± 0.658
3.127SerArg: 3.127 ± 1.003
6.645SerSer: 6.645 ± 1.587
5.863SerThr: 5.863 ± 1.755
6.449SerVal: 6.449 ± 1.341
1.173SerTrp: 1.173 ± 0.387
3.127SerTyr: 3.127 ± 0.919
0.0SerXaa: 0.0 ± 0.0
Thr
4.69ThrAla: 4.69 ± 0.941
1.173ThrCys: 1.173 ± 0.4
2.736ThrAsp: 2.736 ± 1.098
4.299ThrGlu: 4.299 ± 1.083
2.345ThrPhe: 2.345 ± 0.483
3.909ThrGly: 3.909 ± 1.178
1.563ThrHis: 1.563 ± 0.412
3.909ThrIle: 3.909 ± 0.988
2.15ThrLys: 2.15 ± 0.835
6.058ThrLeu: 6.058 ± 0.866
1.368ThrMet: 1.368 ± 0.351
3.713ThrAsn: 3.713 ± 1.272
4.495ThrPro: 4.495 ± 0.423
3.713ThrGln: 3.713 ± 0.853
3.909ThrArg: 3.909 ± 0.901
4.299ThrSer: 4.299 ± 1.375
5.472ThrThr: 5.472 ± 1.032
3.322ThrVal: 3.322 ± 0.787
0.782ThrTrp: 0.782 ± 0.378
2.541ThrTyr: 2.541 ± 0.996
0.0ThrXaa: 0.0 ± 0.0
Val
2.931ValAla: 2.931 ± 1.096
0.977ValCys: 0.977 ± 0.566
3.322ValAsp: 3.322 ± 0.54
2.541ValGlu: 2.541 ± 0.429
2.736ValPhe: 2.736 ± 0.441
3.127ValGly: 3.127 ± 1.132
1.173ValHis: 1.173 ± 0.601
4.69ValIle: 4.69 ± 0.744
3.909ValLys: 3.909 ± 1.454
5.081ValLeu: 5.081 ± 1.686
1.759ValMet: 1.759 ± 0.483
2.931ValAsn: 2.931 ± 0.917
3.127ValPro: 3.127 ± 0.589
2.345ValGln: 2.345 ± 0.645
5.081ValArg: 5.081 ± 1.217
2.541ValSer: 2.541 ± 1.0
3.713ValThr: 3.713 ± 0.721
2.541ValVal: 2.541 ± 1.628
0.391ValTrp: 0.391 ± 0.331
1.954ValTyr: 1.954 ± 0.309
0.0ValXaa: 0.0 ± 0.0
Trp
0.782TrpAla: 0.782 ± 0.278
0.195TrpCys: 0.195 ± 0.31
0.195TrpAsp: 0.195 ± 0.123
0.586TrpGlu: 0.586 ± 0.261
0.586TrpPhe: 0.586 ± 0.255
0.977TrpGly: 0.977 ± 0.449
0.195TrpHis: 0.195 ± 0.231
1.368TrpIle: 1.368 ± 0.373
0.782TrpLys: 0.782 ± 0.354
0.782TrpLeu: 0.782 ± 0.33
0.0TrpMet: 0.0 ± 0.0
0.782TrpAsn: 0.782 ± 0.284
1.173TrpPro: 1.173 ± 0.375
0.0TrpGln: 0.0 ± 0.0
0.195TrpArg: 0.195 ± 0.123
0.977TrpSer: 0.977 ± 0.603
0.195TrpThr: 0.195 ± 0.123
0.391TrpVal: 0.391 ± 0.334
0.0TrpTrp: 0.0 ± 0.0
0.391TrpTyr: 0.391 ± 0.209
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.541TyrAla: 2.541 ± 0.725
0.977TyrCys: 0.977 ± 0.316
0.977TyrAsp: 0.977 ± 0.338
1.368TyrGlu: 1.368 ± 0.473
0.977TyrPhe: 0.977 ± 0.36
0.586TyrGly: 0.586 ± 0.283
0.195TyrHis: 0.195 ± 0.123
2.736TyrIle: 2.736 ± 0.794
0.977TyrLys: 0.977 ± 0.276
4.886TyrLeu: 4.886 ± 1.341
0.782TyrMet: 0.782 ± 0.312
1.954TyrAsn: 1.954 ± 1.032
1.173TyrPro: 1.173 ± 0.431
1.173TyrGln: 1.173 ± 0.376
1.368TyrArg: 1.368 ± 0.606
2.541TyrSer: 2.541 ± 0.621
1.954TyrThr: 1.954 ± 0.564
1.759TyrVal: 1.759 ± 0.643
0.0TyrTrp: 0.0 ± 0.0
1.368TyrTyr: 1.368 ± 0.48
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (5118 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski