Amino acid dipepetide frequency for Icoaraci virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.318AlaAla: 6.318 ± 2.489
1.769AlaCys: 1.769 ± 0.325
2.527AlaAsp: 2.527 ± 0.307
3.285AlaGlu: 3.285 ± 0.798
1.516AlaPhe: 1.516 ± 0.448
3.791AlaGly: 3.791 ± 0.502
1.769AlaHis: 1.769 ± 0.928
3.791AlaIle: 3.791 ± 0.472
2.78AlaLys: 2.78 ± 0.673
4.802AlaLeu: 4.802 ± 0.529
3.538AlaMet: 3.538 ± 0.94
1.264AlaAsn: 1.264 ± 0.558
2.022AlaPro: 2.022 ± 0.367
0.758AlaGln: 0.758 ± 0.15
3.791AlaArg: 3.791 ± 0.899
3.538AlaSer: 3.538 ± 0.493
2.78AlaThr: 2.78 ± 0.707
3.791AlaVal: 3.791 ± 0.65
0.0AlaTrp: 0.0 ± 0.0
2.022AlaTyr: 2.022 ± 0.626
0.0AlaXaa: 0.0 ± 0.0
Cys
0.505CysAla: 0.505 ± 0.113
0.505CysCys: 0.505 ± 0.277
0.758CysAsp: 0.758 ± 0.15
0.758CysGlu: 0.758 ± 0.415
1.516CysPhe: 1.516 ± 0.666
1.011CysGly: 1.011 ± 0.497
1.011CysHis: 1.011 ± 0.497
1.264CysIle: 1.264 ± 0.575
1.516CysLys: 1.516 ± 0.3
1.769CysLeu: 1.769 ± 0.546
0.758CysMet: 0.758 ± 0.298
1.516CysAsn: 1.516 ± 0.595
1.264CysPro: 1.264 ± 0.402
2.022CysGln: 2.022 ± 0.654
0.505CysArg: 0.505 ± 0.113
3.791CysSer: 3.791 ± 0.803
1.769CysThr: 1.769 ± 0.794
1.516CysVal: 1.516 ± 1.218
0.0CysTrp: 0.0 ± 0.0
1.264CysTyr: 1.264 ± 0.402
0.0CysXaa: 0.0 ± 0.0
Asp
3.285AspAla: 3.285 ± 1.115
1.516AspCys: 1.516 ± 1.218
5.054AspAsp: 5.054 ± 1.277
4.549AspGlu: 4.549 ± 1.272
2.022AspPhe: 2.022 ± 0.803
3.538AspGly: 3.538 ± 0.661
1.769AspHis: 1.769 ± 0.327
1.769AspIle: 1.769 ± 0.479
3.033AspLys: 3.033 ± 1.062
4.296AspLeu: 4.296 ± 0.68
2.527AspMet: 2.527 ± 0.544
2.527AspAsn: 2.527 ± 0.563
1.769AspPro: 1.769 ± 0.272
1.769AspGln: 1.769 ± 0.465
1.769AspArg: 1.769 ± 0.509
4.296AspSer: 4.296 ± 0.835
3.791AspThr: 3.791 ± 0.415
2.527AspVal: 2.527 ± 0.307
1.264AspTrp: 1.264 ± 0.226
2.022AspTyr: 2.022 ± 0.438
0.0AspXaa: 0.0 ± 0.0
Glu
5.054GluAla: 5.054 ± 1.808
1.264GluCys: 1.264 ± 0.402
5.812GluAsp: 5.812 ± 1.356
6.318GluGlu: 6.318 ± 1.49
3.791GluPhe: 3.791 ± 0.373
4.802GluGly: 4.802 ± 0.457
0.505GluHis: 0.505 ± 0.113
5.56GluIle: 5.56 ± 0.318
4.296GluLys: 4.296 ± 0.687
6.318GluLeu: 6.318 ± 1.118
1.264GluMet: 1.264 ± 0.804
3.538GluAsn: 3.538 ± 1.307
2.022GluPro: 2.022 ± 0.804
1.011GluGln: 1.011 ± 0.306
4.043GluArg: 4.043 ± 0.849
4.549GluSer: 4.549 ± 0.457
2.274GluThr: 2.274 ± 0.45
5.054GluVal: 5.054 ± 1.293
0.505GluTrp: 0.505 ± 0.113
2.022GluTyr: 2.022 ± 0.373
0.0GluXaa: 0.0 ± 0.0
Phe
2.78PheAla: 2.78 ± 1.071
1.264PheCys: 1.264 ± 0.402
2.78PheAsp: 2.78 ± 1.186
1.769PheGlu: 1.769 ± 0.689
2.527PhePhe: 2.527 ± 1.079
1.516PheGly: 1.516 ± 0.531
0.505PheHis: 0.505 ± 0.277
2.78PheIle: 2.78 ± 0.625
2.274PheLys: 2.274 ± 0.941
3.538PheLeu: 3.538 ± 0.419
1.516PheMet: 1.516 ± 0.338
3.033PheAsn: 3.033 ± 0.771
2.274PhePro: 2.274 ± 1.216
0.505PheGln: 0.505 ± 0.406
1.769PheArg: 1.769 ± 0.772
4.043PheSer: 4.043 ± 0.535
2.78PheThr: 2.78 ± 0.212
3.285PheVal: 3.285 ± 0.449
0.758PheTrp: 0.758 ± 0.15
0.505PheTyr: 0.505 ± 0.451
0.0PheXaa: 0.0 ± 0.0
Gly
4.549GlyAla: 4.549 ± 0.852
1.264GlyCys: 1.264 ± 0.698
2.527GlyAsp: 2.527 ± 0.453
3.791GlyGlu: 3.791 ± 0.943
3.538GlyPhe: 3.538 ± 0.102
5.307GlyGly: 5.307 ± 0.557
1.516GlyHis: 1.516 ± 0.37
3.033GlyIle: 3.033 ± 0.387
4.549GlyLys: 4.549 ± 0.704
6.318GlyLeu: 6.318 ± 1.411
2.274GlyMet: 2.274 ± 0.211
3.538GlyAsn: 3.538 ± 0.814
2.527GlyPro: 2.527 ± 0.546
1.516GlyGln: 1.516 ± 0.666
3.033GlyArg: 3.033 ± 0.395
5.812GlySer: 5.812 ± 0.792
2.022GlyThr: 2.022 ± 0.63
4.043GlyVal: 4.043 ± 1.063
1.011GlyTrp: 1.011 ± 0.547
2.274GlyTyr: 2.274 ± 1.079
0.0GlyXaa: 0.0 ± 0.0
His
0.253HisAla: 0.253 ± 0.138
1.011HisCys: 1.011 ± 0.225
1.264HisAsp: 1.264 ± 0.31
0.505HisGlu: 0.505 ± 0.113
1.011HisPhe: 1.011 ± 0.435
3.033HisGly: 3.033 ± 0.467
0.253HisHis: 0.253 ± 0.423
1.516HisIle: 1.516 ± 0.3
1.769HisLys: 1.769 ± 0.772
2.274HisLeu: 2.274 ± 0.352
0.0HisMet: 0.0 ± 0.0
1.011HisAsn: 1.011 ± 0.266
1.516HisPro: 1.516 ± 0.766
1.516HisGln: 1.516 ± 0.595
1.264HisArg: 1.264 ± 0.551
1.011HisSer: 1.011 ± 0.225
1.264HisThr: 1.264 ± 0.226
1.264HisVal: 1.264 ± 0.396
0.0HisTrp: 0.0 ± 0.0
1.769HisTyr: 1.769 ± 0.667
0.0HisXaa: 0.0 ± 0.0
Ile
4.043IleAla: 4.043 ± 0.876
0.758IleCys: 0.758 ± 0.15
2.527IleAsp: 2.527 ± 0.513
3.791IleGlu: 3.791 ± 1.189
2.022IlePhe: 2.022 ± 0.532
4.296IleGly: 4.296 ± 1.064
1.264IleHis: 1.264 ± 0.396
2.78IleIle: 2.78 ± 0.346
4.296IleLys: 4.296 ± 0.471
4.549IleLeu: 4.549 ± 0.888
3.033IleMet: 3.033 ± 0.592
2.78IleAsn: 2.78 ± 0.32
3.033IlePro: 3.033 ± 1.041
2.78IleGln: 2.78 ± 0.646
4.296IleArg: 4.296 ± 1.312
4.043IleSer: 4.043 ± 0.992
3.285IleThr: 3.285 ± 0.456
4.043IleVal: 4.043 ± 1.014
0.758IleTrp: 0.758 ± 0.15
2.022IleTyr: 2.022 ± 0.292
0.0IleXaa: 0.0 ± 0.0
Lys
3.033LysAla: 3.033 ± 0.548
2.274LysCys: 2.274 ± 0.893
3.285LysAsp: 3.285 ± 0.802
5.812LysGlu: 5.812 ± 0.313
2.78LysPhe: 2.78 ± 0.64
2.78LysGly: 2.78 ± 0.476
1.516LysHis: 1.516 ± 0.656
5.054LysIle: 5.054 ± 0.959
4.802LysLys: 4.802 ± 0.88
3.538LysLeu: 3.538 ± 0.439
3.791LysMet: 3.791 ± 1.347
2.022LysAsn: 2.022 ± 0.292
4.802LysPro: 4.802 ± 0.867
1.769LysGln: 1.769 ± 0.325
2.022LysArg: 2.022 ± 0.451
6.823LysSer: 6.823 ± 1.019
3.285LysThr: 3.285 ± 0.591
4.802LysVal: 4.802 ± 0.68
1.769LysTrp: 1.769 ± 0.232
1.769LysTyr: 1.769 ± 0.409
0.0LysXaa: 0.0 ± 0.0
Leu
4.802LeuAla: 4.802 ± 0.642
1.769LeuCys: 1.769 ± 0.534
5.054LeuAsp: 5.054 ± 0.734
8.087LeuGlu: 8.087 ± 1.695
3.791LeuPhe: 3.791 ± 1.04
5.812LeuGly: 5.812 ± 0.437
2.274LeuHis: 2.274 ± 0.493
6.065LeuIle: 6.065 ± 2.51
6.823LeuLys: 6.823 ± 1.702
7.329LeuLeu: 7.329 ± 0.972
3.285LeuMet: 3.285 ± 0.773
2.022LeuAsn: 2.022 ± 0.63
1.516LeuPro: 1.516 ± 0.435
3.285LeuGln: 3.285 ± 0.591
5.812LeuArg: 5.812 ± 1.224
9.351LeuSer: 9.351 ± 0.815
2.78LeuThr: 2.78 ± 0.707
4.549LeuVal: 4.549 ± 0.704
0.253LeuTrp: 0.253 ± 0.203
1.264LeuTyr: 1.264 ± 0.396
0.0LeuXaa: 0.0 ± 0.0
Met
1.264MetAla: 1.264 ± 0.24
0.253MetCys: 0.253 ± 0.138
2.527MetAsp: 2.527 ± 0.44
2.274MetGlu: 2.274 ± 0.394
0.758MetPhe: 0.758 ± 0.415
1.769MetGly: 1.769 ± 0.232
1.264MetHis: 1.264 ± 0.594
2.78MetIle: 2.78 ± 1.078
1.516MetLys: 1.516 ± 0.338
4.296MetLeu: 4.296 ± 1.193
1.516MetMet: 1.516 ± 0.766
1.769MetAsn: 1.769 ± 1.565
1.264MetPro: 1.264 ± 0.325
1.516MetGln: 1.516 ± 0.216
1.011MetArg: 1.011 ± 0.266
3.033MetSer: 3.033 ± 0.509
1.516MetThr: 1.516 ± 0.448
3.538MetVal: 3.538 ± 0.996
0.505MetTrp: 0.505 ± 0.113
0.505MetTyr: 0.505 ± 0.277
0.0MetXaa: 0.0 ± 0.0
Asn
1.516AsnAla: 1.516 ± 0.37
0.758AsnCys: 0.758 ± 0.15
1.011AsnAsp: 1.011 ± 0.266
3.791AsnGlu: 3.791 ± 1.726
2.527AsnPhe: 2.527 ± 0.243
2.022AsnGly: 2.022 ± 0.24
0.505AsnHis: 0.505 ± 0.113
2.527AsnIle: 2.527 ± 0.544
3.538AsnLys: 3.538 ± 0.803
4.802AsnLeu: 4.802 ± 1.537
0.505AsnMet: 0.505 ± 0.451
2.527AsnAsn: 2.527 ± 0.453
4.549AsnPro: 4.549 ± 1.65
1.011AsnGln: 1.011 ± 0.497
3.538AsnArg: 3.538 ± 0.772
1.011AsnSer: 1.011 ± 0.435
1.769AsnThr: 1.769 ± 0.325
2.274AsnVal: 2.274 ± 0.751
0.253AsnTrp: 0.253 ± 0.203
1.264AsnTyr: 1.264 ± 0.457
0.0AsnXaa: 0.0 ± 0.0
Pro
2.022ProAla: 2.022 ± 0.477
0.758ProCys: 0.758 ± 0.585
3.791ProAsp: 3.791 ± 0.371
4.296ProGlu: 4.296 ± 1.622
1.516ProPhe: 1.516 ± 0.447
3.285ProGly: 3.285 ± 1.181
0.253ProHis: 0.253 ± 0.305
2.78ProIle: 2.78 ± 0.346
4.043ProLys: 4.043 ± 0.876
4.549ProLeu: 4.549 ± 0.485
1.516ProMet: 1.516 ± 0.216
2.022ProAsn: 2.022 ± 0.852
1.011ProPro: 1.011 ± 0.497
1.769ProGln: 1.769 ± 0.325
1.264ProArg: 1.264 ± 0.484
4.296ProSer: 4.296 ± 0.319
2.022ProThr: 2.022 ± 1.116
2.022ProVal: 2.022 ± 0.63
1.516ProTrp: 1.516 ± 0.37
0.758ProTyr: 0.758 ± 0.405
0.0ProXaa: 0.0 ± 0.0
Gln
2.527GlnAla: 2.527 ± 0.546
1.011GlnCys: 1.011 ± 0.497
1.264GlnAsp: 1.264 ± 0.226
1.264GlnGlu: 1.264 ± 0.226
0.758GlnPhe: 0.758 ± 0.25
2.527GlnGly: 2.527 ± 0.48
1.516GlnHis: 1.516 ± 0.3
2.78GlnIle: 2.78 ± 0.673
3.033GlnLys: 3.033 ± 0.141
1.769GlnLeu: 1.769 ± 1.104
1.011GlnMet: 1.011 ± 0.327
0.758GlnAsn: 0.758 ± 0.415
1.516GlnPro: 1.516 ± 0.501
1.011GlnGln: 1.011 ± 0.48
1.011GlnArg: 1.011 ± 0.266
3.033GlnSer: 3.033 ± 0.676
1.769GlnThr: 1.769 ± 1.103
1.516GlnVal: 1.516 ± 0.656
0.253GlnTrp: 0.253 ± 0.138
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.538ArgAla: 3.538 ± 0.102
1.516ArgCys: 1.516 ± 1.218
2.274ArgAsp: 2.274 ± 0.792
3.033ArgGlu: 3.033 ± 0.241
1.011ArgPhe: 1.011 ± 0.554
3.285ArgGly: 3.285 ± 0.918
1.264ArgHis: 1.264 ± 0.226
3.791ArgIle: 3.791 ± 0.696
3.033ArgLys: 3.033 ± 0.692
3.538ArgLeu: 3.538 ± 0.439
2.022ArgMet: 2.022 ± 0.668
2.022ArgAsn: 2.022 ± 0.367
3.033ArgPro: 3.033 ± 0.548
1.264ArgGln: 1.264 ± 0.31
2.78ArgArg: 2.78 ± 0.497
4.296ArgSer: 4.296 ± 1.036
3.538ArgThr: 3.538 ± 0.599
4.296ArgVal: 4.296 ± 0.421
1.011ArgTrp: 1.011 ± 0.463
1.011ArgTyr: 1.011 ± 0.225
0.0ArgXaa: 0.0 ± 0.0
Ser
4.043SerAla: 4.043 ± 0.486
2.022SerCys: 2.022 ± 1.305
6.318SerAsp: 6.318 ± 1.405
5.307SerGlu: 5.307 ± 0.408
3.791SerPhe: 3.791 ± 1.017
5.56SerGly: 5.56 ± 0.776
2.274SerHis: 2.274 ± 0.43
4.043SerIle: 4.043 ± 0.853
6.571SerLys: 6.571 ± 1.211
8.087SerLeu: 8.087 ± 1.652
1.516SerMet: 1.516 ± 0.3
2.78SerAsn: 2.78 ± 0.212
4.802SerPro: 4.802 ± 0.546
2.78SerGln: 2.78 ± 0.346
4.802SerArg: 4.802 ± 1.383
10.109SerSer: 10.109 ± 2.837
4.549SerThr: 4.549 ± 0.338
5.054SerVal: 5.054 ± 0.973
1.516SerTrp: 1.516 ± 0.719
2.274SerTyr: 2.274 ± 0.366
0.0SerXaa: 0.0 ± 0.0
Thr
2.274ThrAla: 2.274 ± 0.43
2.022ThrCys: 2.022 ± 0.759
2.274ThrAsp: 2.274 ± 0.493
3.033ThrGlu: 3.033 ± 0.548
2.78ThrPhe: 2.78 ± 0.346
2.78ThrGly: 2.78 ± 0.995
0.758ThrHis: 0.758 ± 0.415
3.538ThrIle: 3.538 ± 0.543
2.78ThrLys: 2.78 ± 0.32
6.065ThrLeu: 6.065 ± 0.317
1.011ThrMet: 1.011 ± 0.247
1.516ThrAsn: 1.516 ± 0.692
2.78ThrPro: 2.78 ± 0.32
0.758ThrGln: 0.758 ± 0.415
3.033ThrArg: 3.033 ± 0.141
3.538ThrSer: 3.538 ± 0.287
4.549ThrThr: 4.549 ± 0.457
3.538ThrVal: 3.538 ± 0.705
0.505ThrTrp: 0.505 ± 0.591
1.516ThrTyr: 1.516 ± 1.31
0.0ThrXaa: 0.0 ± 0.0
Val
2.78ValAla: 2.78 ± 0.89
1.769ValCys: 1.769 ± 0.325
3.033ValAsp: 3.033 ± 0.686
5.307ValGlu: 5.307 ± 1.089
2.78ValPhe: 2.78 ± 0.538
4.802ValGly: 4.802 ± 0.907
2.274ValHis: 2.274 ± 0.366
3.033ValIle: 3.033 ± 0.387
4.549ValLys: 4.549 ± 1.584
5.812ValLeu: 5.812 ± 0.596
2.022ValMet: 2.022 ± 0.23
3.538ValAsn: 3.538 ± 1.092
1.516ValPro: 1.516 ± 0.531
2.274ValGln: 2.274 ± 0.601
3.033ValArg: 3.033 ± 0.62
8.845ValSer: 8.845 ± 0.532
1.769ValThr: 1.769 ± 0.325
5.56ValVal: 5.56 ± 1.68
0.505ValTrp: 0.505 ± 0.277
1.769ValTyr: 1.769 ± 0.325
0.0ValXaa: 0.0 ± 0.0
Trp
0.758TrpAla: 0.758 ± 0.415
0.0TrpCys: 0.0 ± 0.0
0.253TrpAsp: 0.253 ± 0.138
0.758TrpGlu: 0.758 ± 0.15
0.505TrpPhe: 0.505 ± 0.39
0.758TrpGly: 0.758 ± 0.36
0.0TrpHis: 0.0 ± 0.0
0.758TrpIle: 0.758 ± 0.36
1.011TrpLys: 1.011 ± 0.558
0.758TrpLeu: 0.758 ± 0.15
0.758TrpMet: 0.758 ± 0.298
0.758TrpAsn: 0.758 ± 0.298
0.505TrpPro: 0.505 ± 0.39
0.253TrpGln: 0.253 ± 0.138
0.505TrpArg: 0.505 ± 0.113
0.505TrpSer: 0.505 ± 0.113
1.516TrpThr: 1.516 ± 0.415
2.022TrpVal: 2.022 ± 0.63
0.253TrpTrp: 0.253 ± 0.138
0.253TrpTyr: 0.253 ± 0.138
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.505TyrAla: 0.505 ± 0.317
1.516TyrCys: 1.516 ± 0.666
0.758TyrAsp: 0.758 ± 0.405
2.274TyrGlu: 2.274 ± 0.999
1.264TyrPhe: 1.264 ± 0.396
1.769TyrGly: 1.769 ± 0.365
1.011TyrHis: 1.011 ± 0.463
1.011TyrIle: 1.011 ± 0.266
1.516TyrLys: 1.516 ± 0.3
1.516TyrLeu: 1.516 ± 0.338
0.758TyrMet: 0.758 ± 0.15
0.758TyrAsn: 0.758 ± 0.415
1.769TyrPro: 1.769 ± 1.085
0.758TyrGln: 0.758 ± 0.25
2.274TyrArg: 2.274 ± 0.601
2.274TyrSer: 2.274 ± 0.755
2.022TyrThr: 2.022 ± 0.292
2.274TyrVal: 2.274 ± 0.941
0.253TyrTrp: 0.253 ± 0.203
0.505TyrTyr: 0.505 ± 0.451
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3958 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski