Amino acid dipepetide frequency for Mikumi yellow baboon virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.767AlaAla: 8.767 ± 1.252
2.466AlaCys: 2.466 ± 0.605
2.74AlaAsp: 2.74 ± 0.372
2.329AlaGlu: 2.329 ± 0.412
3.973AlaPhe: 3.973 ± 1.041
4.247AlaGly: 4.247 ± 0.288
3.014AlaHis: 3.014 ± 0.73
5.479AlaIle: 5.479 ± 1.091
3.425AlaLys: 3.425 ± 0.774
9.178AlaLeu: 9.178 ± 0.908
1.918AlaMet: 1.918 ± 0.44
2.603AlaAsn: 2.603 ± 0.358
5.205AlaPro: 5.205 ± 0.832
3.973AlaGln: 3.973 ± 0.482
5.342AlaArg: 5.342 ± 1.113
6.986AlaSer: 6.986 ± 1.135
6.575AlaThr: 6.575 ± 1.498
5.068AlaVal: 5.068 ± 0.609
1.781AlaTrp: 1.781 ± 0.306
3.973AlaTyr: 3.973 ± 0.542
0.0AlaXaa: 0.0 ± 0.0
Cys
3.014CysAla: 3.014 ± 0.334
1.37CysCys: 1.37 ± 0.335
1.781CysAsp: 1.781 ± 0.447
0.822CysGlu: 0.822 ± 0.226
1.507CysPhe: 1.507 ± 0.592
2.192CysGly: 2.192 ± 0.409
1.37CysHis: 1.37 ± 0.417
0.822CysIle: 0.822 ± 0.289
1.507CysLys: 1.507 ± 0.439
3.699CysLeu: 3.699 ± 0.793
0.411CysMet: 0.411 ± 0.141
0.548CysAsn: 0.548 ± 0.288
1.233CysPro: 1.233 ± 0.363
0.0CysGln: 0.0 ± 0.0
1.37CysArg: 1.37 ± 0.674
2.466CysSer: 2.466 ± 1.004
3.425CysThr: 3.425 ± 0.466
1.781CysVal: 1.781 ± 0.623
1.233CysTrp: 1.233 ± 0.33
1.507CysTyr: 1.507 ± 0.353
0.0CysXaa: 0.0 ± 0.0
Asp
4.247AspAla: 4.247 ± 0.608
1.096AspCys: 1.096 ± 0.36
2.466AspAsp: 2.466 ± 0.462
2.055AspGlu: 2.055 ± 0.535
2.192AspPhe: 2.192 ± 0.391
2.603AspGly: 2.603 ± 0.499
1.096AspHis: 1.096 ± 0.326
2.055AspIle: 2.055 ± 0.466
2.466AspLys: 2.466 ± 0.515
3.699AspLeu: 3.699 ± 1.099
1.644AspMet: 1.644 ± 0.386
1.507AspAsn: 1.507 ± 0.355
3.425AspPro: 3.425 ± 1.021
1.233AspGln: 1.233 ± 0.358
2.055AspArg: 2.055 ± 0.626
2.74AspSer: 2.74 ± 0.984
1.37AspThr: 1.37 ± 0.26
3.014AspVal: 3.014 ± 0.399
0.548AspTrp: 0.548 ± 0.193
2.329AspTyr: 2.329 ± 0.644
0.0AspXaa: 0.0 ± 0.0
Glu
3.973GluAla: 3.973 ± 0.885
0.411GluCys: 0.411 ± 0.557
1.37GluAsp: 1.37 ± 0.355
1.507GluGlu: 1.507 ± 0.324
1.096GluPhe: 1.096 ± 0.436
3.699GluGly: 3.699 ± 0.85
1.37GluHis: 1.37 ± 0.309
2.192GluIle: 2.192 ± 0.64
0.822GluLys: 0.822 ± 0.289
2.877GluLeu: 2.877 ± 0.537
0.548GluMet: 0.548 ± 0.241
0.822GluAsn: 0.822 ± 0.342
2.74GluPro: 2.74 ± 0.478
1.37GluGln: 1.37 ± 0.327
1.37GluArg: 1.37 ± 0.306
1.918GluSer: 1.918 ± 0.316
1.644GluThr: 1.644 ± 0.382
2.055GluVal: 2.055 ± 0.302
0.548GluTrp: 0.548 ± 0.465
1.507GluTyr: 1.507 ± 0.5
0.0GluXaa: 0.0 ± 0.0
Phe
3.288PheAla: 3.288 ± 1.526
2.466PheCys: 2.466 ± 0.913
1.781PheAsp: 1.781 ± 0.541
2.055PheGlu: 2.055 ± 0.493
2.877PhePhe: 2.877 ± 0.67
4.384PheGly: 4.384 ± 0.531
1.918PheHis: 1.918 ± 0.497
2.055PheIle: 2.055 ± 1.215
2.329PheLys: 2.329 ± 0.279
5.205PheLeu: 5.205 ± 1.365
1.233PheMet: 1.233 ± 0.428
0.685PheAsn: 0.685 ± 0.933
1.37PhePro: 1.37 ± 0.341
2.055PheGln: 2.055 ± 0.671
0.822PheArg: 0.822 ± 0.566
3.151PheSer: 3.151 ± 0.389
2.877PheThr: 2.877 ± 0.828
3.973PheVal: 3.973 ± 0.573
0.274PheTrp: 0.274 ± 0.263
0.548PheTyr: 0.548 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
4.384GlyAla: 4.384 ± 0.74
1.781GlyCys: 1.781 ± 0.632
4.932GlyAsp: 4.932 ± 1.203
2.192GlyGlu: 2.192 ± 0.407
3.014GlyPhe: 3.014 ± 0.69
3.836GlyGly: 3.836 ± 1.016
1.781GlyHis: 1.781 ± 0.492
5.342GlyIle: 5.342 ± 1.148
4.11GlyLys: 4.11 ± 0.47
4.658GlyLeu: 4.658 ± 0.539
1.233GlyMet: 1.233 ± 0.332
1.781GlyAsn: 1.781 ± 0.553
3.699GlyPro: 3.699 ± 0.45
0.822GlyGln: 0.822 ± 0.31
3.973GlyArg: 3.973 ± 0.705
6.575GlySer: 6.575 ± 0.992
5.068GlyThr: 5.068 ± 0.595
5.479GlyVal: 5.479 ± 0.846
0.411GlyTrp: 0.411 ± 0.262
3.699GlyTyr: 3.699 ± 0.906
0.0GlyXaa: 0.0 ± 0.0
His
3.151HisAla: 3.151 ± 0.433
0.685HisCys: 0.685 ± 0.224
1.233HisAsp: 1.233 ± 0.286
0.822HisGlu: 0.822 ± 0.252
1.918HisPhe: 1.918 ± 0.442
0.959HisGly: 0.959 ± 0.652
1.096HisHis: 1.096 ± 0.636
1.918HisIle: 1.918 ± 0.454
1.096HisLys: 1.096 ± 0.431
3.562HisLeu: 3.562 ± 0.838
0.685HisMet: 0.685 ± 0.226
0.822HisAsn: 0.822 ± 0.504
2.192HisPro: 2.192 ± 1.78
1.644HisGln: 1.644 ± 0.606
1.507HisArg: 1.507 ± 0.461
1.507HisSer: 1.507 ± 0.442
1.781HisThr: 1.781 ± 0.686
1.781HisVal: 1.781 ± 0.424
0.411HisTrp: 0.411 ± 0.441
1.37HisTyr: 1.37 ± 0.49
0.0HisXaa: 0.0 ± 0.0
Ile
4.384IleAla: 4.384 ± 0.975
3.288IleCys: 3.288 ± 0.418
2.329IleAsp: 2.329 ± 0.521
1.233IleGlu: 1.233 ± 0.278
1.507IlePhe: 1.507 ± 0.707
3.699IleGly: 3.699 ± 1.086
2.055IleHis: 2.055 ± 0.73
4.932IleIle: 4.932 ± 1.022
2.466IleLys: 2.466 ± 0.415
4.521IleLeu: 4.521 ± 0.601
0.274IleMet: 0.274 ± 0.247
1.918IleAsn: 1.918 ± 1.079
2.877IlePro: 2.877 ± 0.703
0.959IleGln: 0.959 ± 0.531
1.781IleArg: 1.781 ± 0.988
4.247IleSer: 4.247 ± 0.587
3.151IleThr: 3.151 ± 0.685
3.562IleVal: 3.562 ± 0.551
0.0IleTrp: 0.0 ± 0.0
0.959IleTyr: 0.959 ± 0.569
0.0IleXaa: 0.0 ± 0.0
Lys
3.699LysAla: 3.699 ± 0.831
0.959LysCys: 0.959 ± 0.266
1.096LysAsp: 1.096 ± 0.564
1.507LysGlu: 1.507 ± 0.405
2.877LysPhe: 2.877 ± 0.628
2.603LysGly: 2.603 ± 0.341
0.822LysHis: 0.822 ± 0.417
1.096LysIle: 1.096 ± 0.311
2.74LysLys: 2.74 ± 0.613
5.616LysLeu: 5.616 ± 1.109
0.959LysMet: 0.959 ± 0.392
0.685LysAsn: 0.685 ± 0.224
3.014LysPro: 3.014 ± 0.779
1.644LysGln: 1.644 ± 0.345
1.096LysArg: 1.096 ± 0.293
1.644LysSer: 1.644 ± 0.554
2.603LysThr: 2.603 ± 0.519
4.521LysVal: 4.521 ± 0.648
0.137LysTrp: 0.137 ± 0.203
1.781LysTyr: 1.781 ± 0.275
0.0LysXaa: 0.0 ± 0.0
Leu
11.096LeuAla: 11.096 ± 1.366
2.055LeuCys: 2.055 ± 0.368
1.781LeuAsp: 1.781 ± 0.638
3.288LeuGlu: 3.288 ± 0.626
4.932LeuPhe: 4.932 ± 0.82
6.986LeuGly: 6.986 ± 0.857
2.466LeuHis: 2.466 ± 0.745
3.151LeuIle: 3.151 ± 1.715
2.055LeuLys: 2.055 ± 0.512
13.562LeuLeu: 13.562 ± 3.333
0.959LeuMet: 0.959 ± 0.77
3.425LeuAsn: 3.425 ± 0.608
7.397LeuPro: 7.397 ± 0.912
4.247LeuGln: 4.247 ± 0.51
4.384LeuArg: 4.384 ± 1.248
9.863LeuSer: 9.863 ± 1.281
8.356LeuThr: 8.356 ± 0.762
9.178LeuVal: 9.178 ± 0.858
0.548LeuTrp: 0.548 ± 0.173
3.014LeuTyr: 3.014 ± 0.891
0.0LeuXaa: 0.0 ± 0.0
Met
1.644MetAla: 1.644 ± 0.58
0.548MetCys: 0.548 ± 0.253
0.685MetAsp: 0.685 ± 0.253
0.274MetGlu: 0.274 ± 0.275
0.548MetPhe: 0.548 ± 0.394
0.959MetGly: 0.959 ± 0.303
0.274MetHis: 0.274 ± 0.402
1.644MetIle: 1.644 ± 0.34
0.548MetLys: 0.548 ± 0.314
1.781MetLeu: 1.781 ± 0.665
0.274MetMet: 0.274 ± 0.263
0.548MetAsn: 0.548 ± 0.252
0.548MetPro: 0.548 ± 0.409
0.137MetGln: 0.137 ± 0.326
0.685MetArg: 0.685 ± 0.224
0.959MetSer: 0.959 ± 0.397
0.548MetThr: 0.548 ± 0.735
2.329MetVal: 2.329 ± 0.636
0.274MetTrp: 0.274 ± 0.096
0.274MetTyr: 0.274 ± 0.181
0.0MetXaa: 0.0 ± 0.0
Asn
3.836AsnAla: 3.836 ± 0.641
0.959AsnCys: 0.959 ± 0.267
1.781AsnAsp: 1.781 ± 0.382
1.233AsnGlu: 1.233 ± 0.294
1.37AsnPhe: 1.37 ± 0.858
3.288AsnGly: 3.288 ± 0.43
0.685AsnHis: 0.685 ± 1.013
1.918AsnIle: 1.918 ± 0.943
0.548AsnLys: 0.548 ± 0.216
2.055AsnLeu: 2.055 ± 0.548
0.411AsnMet: 0.411 ± 0.349
1.507AsnAsn: 1.507 ± 0.916
1.507AsnPro: 1.507 ± 0.512
1.37AsnGln: 1.37 ± 1.167
1.233AsnArg: 1.233 ± 0.354
1.507AsnSer: 1.507 ± 0.373
3.014AsnThr: 3.014 ± 0.803
1.233AsnVal: 1.233 ± 0.615
0.137AsnTrp: 0.137 ± 0.326
1.644AsnTyr: 1.644 ± 0.326
0.0AsnXaa: 0.0 ± 0.0
Pro
6.301ProAla: 6.301 ± 1.251
1.644ProCys: 1.644 ± 0.421
2.192ProAsp: 2.192 ± 0.511
2.192ProGlu: 2.192 ± 0.588
2.192ProPhe: 2.192 ± 0.471
4.932ProGly: 4.932 ± 0.891
1.781ProHis: 1.781 ± 0.417
2.603ProIle: 2.603 ± 0.671
1.37ProLys: 1.37 ± 0.492
5.205ProLeu: 5.205 ± 1.066
1.096ProMet: 1.096 ± 0.302
1.781ProAsn: 1.781 ± 0.519
4.521ProPro: 4.521 ± 0.863
3.288ProGln: 3.288 ± 0.701
4.247ProArg: 4.247 ± 0.981
3.973ProSer: 3.973 ± 1.584
5.753ProThr: 5.753 ± 0.763
4.932ProVal: 4.932 ± 1.01
0.548ProTrp: 0.548 ± 0.488
3.562ProTyr: 3.562 ± 1.049
0.0ProXaa: 0.0 ± 0.0
Gln
2.877GlnAla: 2.877 ± 0.596
0.137GlnCys: 0.137 ± 0.326
1.37GlnAsp: 1.37 ± 0.447
0.822GlnGlu: 0.822 ± 0.234
0.959GlnPhe: 0.959 ± 0.369
2.74GlnGly: 2.74 ± 0.692
0.548GlnHis: 0.548 ± 0.33
0.959GlnIle: 0.959 ± 0.541
1.37GlnLys: 1.37 ± 0.575
3.562GlnLeu: 3.562 ± 0.625
0.274GlnMet: 0.274 ± 0.263
1.644GlnAsn: 1.644 ± 0.703
3.288GlnPro: 3.288 ± 0.866
0.959GlnGln: 0.959 ± 0.235
2.055GlnArg: 2.055 ± 0.525
1.918GlnSer: 1.918 ± 0.519
1.096GlnThr: 1.096 ± 0.475
2.192GlnVal: 2.192 ± 0.525
0.548GlnTrp: 0.548 ± 0.344
1.644GlnTyr: 1.644 ± 0.279
0.0GlnXaa: 0.0 ± 0.0
Arg
3.014ArgAla: 3.014 ± 0.675
1.918ArgCys: 1.918 ± 0.464
3.562ArgAsp: 3.562 ± 0.988
1.644ArgGlu: 1.644 ± 0.318
2.192ArgPhe: 2.192 ± 0.639
3.288ArgGly: 3.288 ± 0.703
1.233ArgHis: 1.233 ± 0.816
2.192ArgIle: 2.192 ± 0.446
1.781ArgLys: 1.781 ± 0.648
6.301ArgLeu: 6.301 ± 0.965
0.959ArgMet: 0.959 ± 0.533
1.644ArgAsn: 1.644 ± 0.496
2.192ArgPro: 2.192 ± 0.277
0.822ArgGln: 0.822 ± 0.371
2.192ArgArg: 2.192 ± 0.808
2.055ArgSer: 2.055 ± 0.673
2.877ArgThr: 2.877 ± 0.924
4.11ArgVal: 4.11 ± 0.413
0.137ArgTrp: 0.137 ± 0.21
2.74ArgTyr: 2.74 ± 0.558
0.0ArgXaa: 0.0 ± 0.0
Ser
6.575SerAla: 6.575 ± 0.597
3.014SerCys: 3.014 ± 0.634
3.151SerAsp: 3.151 ± 0.833
1.644SerGlu: 1.644 ± 0.442
2.329SerPhe: 2.329 ± 0.571
5.205SerGly: 5.205 ± 1.01
2.466SerHis: 2.466 ± 1.123
3.288SerIle: 3.288 ± 0.353
3.425SerLys: 3.425 ± 0.981
9.178SerLeu: 9.178 ± 1.377
0.959SerMet: 0.959 ± 0.3
1.781SerAsn: 1.781 ± 0.37
6.712SerPro: 6.712 ± 0.869
2.055SerGln: 2.055 ± 0.551
3.425SerArg: 3.425 ± 0.551
4.932SerSer: 4.932 ± 0.999
6.575SerThr: 6.575 ± 0.84
5.479SerVal: 5.479 ± 0.808
0.822SerTrp: 0.822 ± 0.329
1.781SerTyr: 1.781 ± 0.798
0.0SerXaa: 0.0 ± 0.0
Thr
5.753ThrAla: 5.753 ± 0.7
1.507ThrCys: 1.507 ± 0.327
2.192ThrAsp: 2.192 ± 0.555
1.233ThrGlu: 1.233 ± 0.363
2.877ThrPhe: 2.877 ± 0.717
6.986ThrGly: 6.986 ± 0.959
2.466ThrHis: 2.466 ± 0.611
2.877ThrIle: 2.877 ± 0.869
4.658ThrLys: 4.658 ± 0.752
7.123ThrLeu: 7.123 ± 2.061
0.685ThrMet: 0.685 ± 0.241
3.151ThrAsn: 3.151 ± 1.346
4.521ThrPro: 4.521 ± 0.433
1.781ThrGln: 1.781 ± 0.632
4.11ThrArg: 4.11 ± 0.798
7.123ThrSer: 7.123 ± 0.727
6.712ThrThr: 6.712 ± 1.074
4.11ThrVal: 4.11 ± 0.717
0.548ThrTrp: 0.548 ± 0.226
2.055ThrTyr: 2.055 ± 0.328
0.0ThrXaa: 0.0 ± 0.0
Val
5.205ValAla: 5.205 ± 0.629
3.973ValCys: 3.973 ± 0.866
4.932ValAsp: 4.932 ± 1.202
4.521ValGlu: 4.521 ± 0.58
4.384ValPhe: 4.384 ± 1.296
3.562ValGly: 3.562 ± 0.552
2.055ValHis: 2.055 ± 0.328
2.603ValIle: 2.603 ± 0.565
2.74ValLys: 2.74 ± 0.357
5.89ValLeu: 5.89 ± 0.513
0.411ValMet: 0.411 ± 0.235
3.425ValAsn: 3.425 ± 0.653
4.932ValPro: 4.932 ± 0.994
1.507ValGln: 1.507 ± 0.394
3.288ValArg: 3.288 ± 0.834
7.397ValSer: 7.397 ± 1.074
5.616ValThr: 5.616 ± 1.149
7.808ValVal: 7.808 ± 0.821
0.548ValTrp: 0.548 ± 0.216
1.918ValTyr: 1.918 ± 0.996
0.0ValXaa: 0.0 ± 0.0
Trp
1.37TrpAla: 1.37 ± 0.414
0.137TrpCys: 0.137 ± 0.09
0.685TrpAsp: 0.685 ± 0.252
0.137TrpGlu: 0.137 ± 0.326
0.822TrpPhe: 0.822 ± 0.286
0.0TrpGly: 0.0 ± 0.0
0.685TrpHis: 0.685 ± 0.23
0.0TrpIle: 0.0 ± 0.0
0.411TrpLys: 0.411 ± 0.141
1.918TrpLeu: 1.918 ± 0.347
0.0TrpMet: 0.0 ± 0.285
0.137TrpAsn: 0.137 ± 0.09
0.822TrpPro: 0.822 ± 0.281
0.274TrpGln: 0.274 ± 0.275
0.411TrpArg: 0.411 ± 0.262
0.548TrpSer: 0.548 ± 0.773
1.233TrpThr: 1.233 ± 0.94
0.548TrpVal: 0.548 ± 0.304
0.0TrpTrp: 0.0 ± 0.0
0.548TrpTyr: 0.548 ± 0.193
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.466TyrAla: 2.466 ± 0.725
1.644TyrCys: 1.644 ± 0.439
1.781TyrAsp: 1.781 ± 0.622
2.329TyrGlu: 2.329 ± 0.586
1.781TyrPhe: 1.781 ± 0.611
2.603TyrGly: 2.603 ± 0.484
0.959TyrHis: 0.959 ± 0.588
3.014TyrIle: 3.014 ± 0.62
1.233TyrLys: 1.233 ± 0.407
3.151TyrLeu: 3.151 ± 0.506
0.411TyrMet: 0.411 ± 0.281
0.822TyrAsn: 0.822 ± 0.323
1.918TyrPro: 1.918 ± 0.375
0.822TyrGln: 0.822 ± 0.258
1.507TyrArg: 1.507 ± 0.469
3.288TyrSer: 3.288 ± 0.606
2.192TyrThr: 2.192 ± 0.689
3.562TyrVal: 3.562 ± 0.381
1.37TyrTrp: 1.37 ± 0.327
2.877TyrTyr: 2.877 ± 0.505
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (7301 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski