Amino acid dipepetide frequency for Kafue kinda chacma baboon virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.582AlaAla: 9.582 ± 1.088
1.437AlaCys: 1.437 ± 0.479
2.395AlaAsp: 2.395 ± 0.562
1.677AlaGlu: 1.677 ± 0.345
4.192AlaPhe: 4.192 ± 0.952
4.312AlaGly: 4.312 ± 0.478
2.395AlaHis: 2.395 ± 0.451
5.51AlaIle: 5.51 ± 0.683
3.234AlaLys: 3.234 ± 0.332
9.582AlaLeu: 9.582 ± 1.131
0.838AlaMet: 0.838 ± 0.218
2.395AlaAsn: 2.395 ± 0.345
5.749AlaPro: 5.749 ± 0.779
2.036AlaGln: 2.036 ± 0.363
3.713AlaArg: 3.713 ± 0.608
6.468AlaSer: 6.468 ± 0.484
6.228AlaThr: 6.228 ± 0.92
6.109AlaVal: 6.109 ± 0.56
0.838AlaTrp: 0.838 ± 0.247
4.192AlaTyr: 4.192 ± 0.499
0.0AlaXaa: 0.0 ± 0.0
Cys
3.833CysAla: 3.833 ± 0.487
0.838CysCys: 0.838 ± 0.269
1.797CysAsp: 1.797 ± 0.355
0.838CysGlu: 0.838 ± 0.143
1.078CysPhe: 1.078 ± 0.612
2.515CysGly: 2.515 ± 0.349
2.036CysHis: 2.036 ± 0.31
1.198CysIle: 1.198 ± 0.401
0.479CysLys: 0.479 ± 0.207
3.473CysLeu: 3.473 ± 0.766
0.359CysMet: 0.359 ± 0.117
0.479CysAsn: 0.479 ± 0.177
1.198CysPro: 1.198 ± 0.263
1.437CysGln: 1.437 ± 0.369
0.958CysArg: 0.958 ± 0.313
1.557CysSer: 1.557 ± 0.509
2.994CysThr: 2.994 ± 0.702
1.198CysVal: 1.198 ± 0.737
0.958CysTrp: 0.958 ± 0.387
1.797CysTyr: 1.797 ± 0.527
0.0CysXaa: 0.0 ± 0.0
Asp
3.833AspAla: 3.833 ± 0.714
0.719AspCys: 0.719 ± 0.289
2.515AspAsp: 2.515 ± 0.523
2.635AspGlu: 2.635 ± 0.361
1.677AspPhe: 1.677 ± 0.37
2.755AspGly: 2.755 ± 0.491
1.078AspHis: 1.078 ± 0.298
2.635AspIle: 2.635 ± 0.519
1.677AspLys: 1.677 ± 0.371
3.713AspLeu: 3.713 ± 0.464
0.24AspMet: 0.24 ± 0.166
2.276AspAsn: 2.276 ± 0.295
4.312AspPro: 4.312 ± 0.812
1.078AspGln: 1.078 ± 0.343
1.198AspArg: 1.198 ± 0.433
2.515AspSer: 2.515 ± 0.418
0.838AspThr: 0.838 ± 0.32
2.395AspVal: 2.395 ± 0.474
0.12AspTrp: 0.12 ± 0.083
2.635AspTyr: 2.635 ± 0.658
0.0AspXaa: 0.0 ± 0.0
Glu
2.395GluAla: 2.395 ± 0.529
1.318GluCys: 1.318 ± 0.313
1.198GluAsp: 1.198 ± 0.281
0.838GluGlu: 0.838 ± 0.285
2.036GluPhe: 2.036 ± 0.581
3.473GluGly: 3.473 ± 0.451
0.958GluHis: 0.958 ± 0.263
2.395GluIle: 2.395 ± 0.469
1.797GluLys: 1.797 ± 0.528
3.593GluLeu: 3.593 ± 0.469
0.479GluMet: 0.479 ± 0.177
0.479GluAsn: 0.479 ± 0.331
1.797GluPro: 1.797 ± 0.359
1.437GluGln: 1.437 ± 0.329
1.677GluArg: 1.677 ± 0.265
2.994GluSer: 2.994 ± 0.429
2.156GluThr: 2.156 ± 0.287
1.677GluVal: 1.677 ± 0.469
0.479GluTrp: 0.479 ± 0.195
0.599GluTyr: 0.599 ± 0.294
0.0GluXaa: 0.0 ± 0.0
Phe
3.593PheAla: 3.593 ± 0.697
2.635PheCys: 2.635 ± 0.349
1.797PheAsp: 1.797 ± 0.331
2.276PheGlu: 2.276 ± 0.52
2.515PhePhe: 2.515 ± 0.59
4.551PheGly: 4.551 ± 1.032
1.557PheHis: 1.557 ± 0.375
2.755PheIle: 2.755 ± 0.328
2.156PheLys: 2.156 ± 0.374
4.312PheLeu: 4.312 ± 0.975
0.838PheMet: 0.838 ± 0.377
1.198PheAsn: 1.198 ± 0.453
1.916PhePro: 1.916 ± 0.376
1.318PheGln: 1.318 ± 0.48
1.677PheArg: 1.677 ± 0.612
3.234PheSer: 3.234 ± 0.538
1.797PheThr: 1.797 ± 0.745
4.551PheVal: 4.551 ± 0.707
0.719PheTrp: 0.719 ± 0.283
1.437PheTyr: 1.437 ± 0.259
0.0PheXaa: 0.0 ± 0.0
Gly
3.833GlyAla: 3.833 ± 0.511
1.916GlyCys: 1.916 ± 0.287
4.312GlyAsp: 4.312 ± 0.814
2.994GlyGlu: 2.994 ± 0.709
4.192GlyPhe: 4.192 ± 0.572
4.192GlyGly: 4.192 ± 0.732
2.635GlyHis: 2.635 ± 0.578
2.755GlyIle: 2.755 ± 1.003
4.312GlyLys: 4.312 ± 0.621
6.468GlyLeu: 6.468 ± 0.592
1.198GlyMet: 1.198 ± 0.264
2.036GlyAsn: 2.036 ± 0.331
3.953GlyPro: 3.953 ± 0.353
1.916GlyGln: 1.916 ± 0.267
2.395GlyArg: 2.395 ± 0.612
5.989GlySer: 5.989 ± 0.634
4.551GlyThr: 4.551 ± 0.805
6.228GlyVal: 6.228 ± 0.951
0.12GlyTrp: 0.12 ± 0.172
3.833GlyTyr: 3.833 ± 0.485
0.0GlyXaa: 0.0 ± 0.0
His
2.156HisAla: 2.156 ± 0.376
0.719HisCys: 0.719 ± 0.298
1.437HisAsp: 1.437 ± 0.305
0.719HisGlu: 0.719 ± 0.26
1.557HisPhe: 1.557 ± 0.527
1.797HisGly: 1.797 ± 0.334
1.078HisHis: 1.078 ± 0.683
2.276HisIle: 2.276 ± 0.342
1.797HisLys: 1.797 ± 0.418
3.713HisLeu: 3.713 ± 0.847
0.958HisMet: 0.958 ± 0.237
0.838HisAsn: 0.838 ± 0.599
2.276HisPro: 2.276 ± 1.403
1.078HisGln: 1.078 ± 0.311
0.479HisArg: 0.479 ± 0.354
1.198HisSer: 1.198 ± 0.335
2.755HisThr: 2.755 ± 0.417
0.958HisVal: 0.958 ± 0.383
0.479HisTrp: 0.479 ± 0.269
2.276HisTyr: 2.276 ± 0.53
0.0HisXaa: 0.0 ± 0.0
Ile
4.911IleAla: 4.911 ± 0.827
2.276IleCys: 2.276 ± 0.514
1.078IleAsp: 1.078 ± 0.468
1.198IleGlu: 1.198 ± 0.243
1.677IlePhe: 1.677 ± 0.383
3.833IleGly: 3.833 ± 0.516
2.036IleHis: 2.036 ± 0.62
2.755IleIle: 2.755 ± 0.95
1.916IleLys: 1.916 ± 0.334
5.989IleLeu: 5.989 ± 0.468
0.599IleMet: 0.599 ± 0.153
2.156IleAsn: 2.156 ± 0.955
4.192IlePro: 4.192 ± 0.589
1.198IleGln: 1.198 ± 0.264
2.994IleArg: 2.994 ± 0.333
3.234IleSer: 3.234 ± 1.024
2.635IleThr: 2.635 ± 0.607
5.031IleVal: 5.031 ± 0.496
0.12IleTrp: 0.12 ± 0.153
2.276IleTyr: 2.276 ± 0.468
0.0IleXaa: 0.0 ± 0.0
Lys
3.833LysAla: 3.833 ± 0.582
1.078LysCys: 1.078 ± 0.325
2.156LysAsp: 2.156 ± 0.611
1.916LysGlu: 1.916 ± 0.369
2.156LysPhe: 2.156 ± 0.536
2.036LysGly: 2.036 ± 0.451
0.838LysHis: 0.838 ± 0.266
2.395LysIle: 2.395 ± 0.29
1.318LysLys: 1.318 ± 0.348
5.989LysLeu: 5.989 ± 0.365
0.359LysMet: 0.359 ± 0.241
0.838LysAsn: 0.838 ± 0.399
1.916LysPro: 1.916 ± 0.429
1.078LysGln: 1.078 ± 0.331
1.557LysArg: 1.557 ± 0.333
1.916LysSer: 1.916 ± 0.339
2.156LysThr: 2.156 ± 0.507
3.354LysVal: 3.354 ± 0.652
0.359LysTrp: 0.359 ± 0.142
0.719LysTyr: 0.719 ± 0.236
0.0LysXaa: 0.0 ± 0.0
Leu
11.738LeuAla: 11.738 ± 0.799
4.072LeuCys: 4.072 ± 0.585
2.635LeuAsp: 2.635 ± 0.646
2.515LeuGlu: 2.515 ± 0.663
4.072LeuPhe: 4.072 ± 0.639
7.067LeuGly: 7.067 ± 0.879
2.156LeuHis: 2.156 ± 0.439
4.312LeuIle: 4.312 ± 0.97
2.875LeuLys: 2.875 ± 0.808
12.097LeuLeu: 12.097 ± 1.899
1.916LeuMet: 1.916 ± 0.47
4.192LeuAsn: 4.192 ± 0.5
6.468LeuPro: 6.468 ± 0.615
3.593LeuGln: 3.593 ± 0.921
5.031LeuArg: 5.031 ± 1.078
11.738LeuSer: 11.738 ± 1.032
6.947LeuThr: 6.947 ± 1.04
8.504LeuVal: 8.504 ± 0.491
0.719LeuTrp: 0.719 ± 0.168
2.635LeuTyr: 2.635 ± 1.024
0.0LeuXaa: 0.0 ± 0.0
Met
1.677MetAla: 1.677 ± 0.358
0.479MetCys: 0.479 ± 0.554
0.24MetAsp: 0.24 ± 0.166
0.359MetGlu: 0.359 ± 0.336
0.0MetPhe: 0.0 ± 0.0
0.958MetGly: 0.958 ± 0.358
0.0MetHis: 0.0 ± 0.0
1.198MetIle: 1.198 ± 0.35
0.12MetLys: 0.12 ± 0.083
1.916MetLeu: 1.916 ± 0.251
0.359MetMet: 0.359 ± 0.123
0.24MetAsn: 0.24 ± 0.166
0.958MetPro: 0.958 ± 0.402
0.599MetGln: 0.599 ± 0.368
0.359MetArg: 0.359 ± 0.289
1.557MetSer: 1.557 ± 0.461
0.719MetThr: 0.719 ± 0.246
3.354MetVal: 3.354 ± 0.543
0.359MetTrp: 0.359 ± 0.123
0.719MetTyr: 0.719 ± 0.168
0.0MetXaa: 0.0 ± 0.0
Asn
3.713AsnAla: 3.713 ± 0.553
0.479AsnCys: 0.479 ± 0.347
0.719AsnAsp: 0.719 ± 0.307
0.838AsnGlu: 0.838 ± 0.207
2.156AsnPhe: 2.156 ± 1.428
3.473AsnGly: 3.473 ± 0.48
1.198AsnHis: 1.198 ± 0.719
2.036AsnIle: 2.036 ± 0.768
0.838AsnLys: 0.838 ± 0.261
2.156AsnLeu: 2.156 ± 0.595
0.24AsnMet: 0.24 ± 0.148
1.198AsnAsn: 1.198 ± 0.502
1.916AsnPro: 1.916 ± 0.53
0.958AsnGln: 0.958 ± 0.603
2.036AsnArg: 2.036 ± 0.452
4.192AsnSer: 4.192 ± 0.739
2.635AsnThr: 2.635 ± 0.611
3.354AsnVal: 3.354 ± 0.661
0.479AsnTrp: 0.479 ± 0.195
1.318AsnTyr: 1.318 ± 0.363
0.0AsnXaa: 0.0 ± 0.0
Pro
2.994ProAla: 2.994 ± 0.375
1.916ProCys: 1.916 ± 0.505
3.114ProAsp: 3.114 ± 0.784
3.114ProGlu: 3.114 ± 0.538
2.036ProPhe: 2.036 ± 0.509
4.551ProGly: 4.551 ± 0.65
1.797ProHis: 1.797 ± 0.301
2.875ProIle: 2.875 ± 0.455
3.114ProLys: 3.114 ± 0.655
5.031ProLeu: 5.031 ± 1.254
0.958ProMet: 0.958 ± 0.246
2.395ProAsn: 2.395 ± 0.393
7.067ProPro: 7.067 ± 1.318
3.354ProGln: 3.354 ± 0.716
3.234ProArg: 3.234 ± 0.618
6.947ProSer: 6.947 ± 0.691
4.911ProThr: 4.911 ± 0.701
4.551ProVal: 4.551 ± 0.592
0.599ProTrp: 0.599 ± 0.433
2.156ProTyr: 2.156 ± 0.485
0.0ProXaa: 0.0 ± 0.0
Gln
2.635GlnAla: 2.635 ± 0.387
0.838GlnCys: 0.838 ± 0.55
1.078GlnAsp: 1.078 ± 0.387
0.838GlnGlu: 0.838 ± 0.306
1.557GlnPhe: 1.557 ± 0.376
1.677GlnGly: 1.677 ± 0.185
1.318GlnHis: 1.318 ± 0.282
1.198GlnIle: 1.198 ± 0.374
0.599GlnLys: 0.599 ± 0.416
4.551GlnLeu: 4.551 ± 0.749
0.599GlnMet: 0.599 ± 0.151
1.437GlnAsn: 1.437 ± 0.468
2.276GlnPro: 2.276 ± 0.537
1.437GlnGln: 1.437 ± 0.299
1.557GlnArg: 1.557 ± 0.287
1.677GlnSer: 1.677 ± 0.383
1.198GlnThr: 1.198 ± 0.693
1.916GlnVal: 1.916 ± 0.626
0.719GlnTrp: 0.719 ± 0.374
2.036GlnTyr: 2.036 ± 0.403
0.0GlnXaa: 0.0 ± 0.0
Arg
3.593ArgAla: 3.593 ± 0.538
0.359ArgCys: 0.359 ± 0.266
2.156ArgAsp: 2.156 ± 0.488
1.677ArgGlu: 1.677 ± 0.305
2.515ArgPhe: 2.515 ± 0.514
4.072ArgGly: 4.072 ± 0.87
1.318ArgHis: 1.318 ± 0.427
1.198ArgIle: 1.198 ± 0.22
2.395ArgLys: 2.395 ± 0.319
4.911ArgLeu: 4.911 ± 0.619
1.557ArgMet: 1.557 ± 0.475
1.437ArgAsn: 1.437 ± 0.453
2.994ArgPro: 2.994 ± 0.312
1.318ArgGln: 1.318 ± 0.564
2.755ArgArg: 2.755 ± 0.49
3.114ArgSer: 3.114 ± 0.405
2.994ArgThr: 2.994 ± 0.425
3.354ArgVal: 3.354 ± 0.371
0.838ArgTrp: 0.838 ± 0.297
2.036ArgTyr: 2.036 ± 0.264
0.0ArgXaa: 0.0 ± 0.0
Ser
6.228SerAla: 6.228 ± 0.362
3.114SerCys: 3.114 ± 0.935
3.833SerAsp: 3.833 ± 0.37
3.593SerGlu: 3.593 ± 0.303
3.354SerPhe: 3.354 ± 0.442
5.031SerGly: 5.031 ± 0.973
2.276SerHis: 2.276 ± 0.965
4.551SerIle: 4.551 ± 0.794
2.276SerLys: 2.276 ± 0.66
9.103SerLeu: 9.103 ± 1.21
1.437SerMet: 1.437 ± 0.322
3.354SerAsn: 3.354 ± 0.485
4.671SerPro: 4.671 ± 0.693
0.838SerGln: 0.838 ± 0.515
5.15SerArg: 5.15 ± 0.544
7.905SerSer: 7.905 ± 1.339
6.468SerThr: 6.468 ± 1.056
3.473SerVal: 3.473 ± 0.589
1.318SerTrp: 1.318 ± 0.303
1.916SerTyr: 1.916 ± 0.51
0.0SerXaa: 0.0 ± 0.0
Thr
4.312ThrAla: 4.312 ± 0.648
1.916ThrCys: 1.916 ± 0.353
2.036ThrAsp: 2.036 ± 0.561
1.557ThrGlu: 1.557 ± 0.293
2.755ThrPhe: 2.755 ± 0.458
5.989ThrGly: 5.989 ± 1.058
2.635ThrHis: 2.635 ± 0.568
3.354ThrIle: 3.354 ± 0.896
3.354ThrLys: 3.354 ± 0.516
5.15ThrLeu: 5.15 ± 1.044
0.719ThrMet: 0.719 ± 0.32
3.234ThrAsn: 3.234 ± 0.884
7.785ThrPro: 7.785 ± 0.722
2.036ThrGln: 2.036 ± 0.567
3.354ThrArg: 3.354 ± 0.415
4.192ThrSer: 4.192 ± 0.955
4.312ThrThr: 4.312 ± 1.101
6.468ThrVal: 6.468 ± 0.534
0.719ThrTrp: 0.719 ± 0.377
1.437ThrTyr: 1.437 ± 0.342
0.0ThrXaa: 0.0 ± 0.0
Val
4.911ValAla: 4.911 ± 0.482
2.635ValCys: 2.635 ± 0.433
4.072ValAsp: 4.072 ± 0.686
3.713ValGlu: 3.713 ± 0.495
4.312ValPhe: 4.312 ± 0.468
3.354ValGly: 3.354 ± 0.693
1.797ValHis: 1.797 ± 0.483
2.755ValIle: 2.755 ± 0.352
2.395ValLys: 2.395 ± 0.496
8.145ValLeu: 8.145 ± 0.717
1.078ValMet: 1.078 ± 0.258
4.192ValAsn: 4.192 ± 0.697
3.953ValPro: 3.953 ± 0.621
1.078ValGln: 1.078 ± 0.333
4.432ValArg: 4.432 ± 0.54
6.588ValSer: 6.588 ± 0.605
6.588ValThr: 6.588 ± 0.498
7.186ValVal: 7.186 ± 1.269
0.479ValTrp: 0.479 ± 0.218
2.276ValTyr: 2.276 ± 0.427
0.0ValXaa: 0.0 ± 0.0
Trp
1.318TrpAla: 1.318 ± 0.442
0.359TrpCys: 0.359 ± 0.274
0.479TrpAsp: 0.479 ± 0.215
0.0TrpGlu: 0.0 ± 0.0
0.719TrpPhe: 0.719 ± 0.279
0.599TrpGly: 0.599 ± 0.266
0.12TrpHis: 0.12 ± 0.195
0.958TrpIle: 0.958 ± 0.41
0.12TrpLys: 0.12 ± 0.083
1.198TrpLeu: 1.198 ± 0.358
0.12TrpMet: 0.12 ± 0.191
0.0TrpAsn: 0.0 ± 0.0
0.599TrpPro: 0.599 ± 0.17
0.599TrpGln: 0.599 ± 0.271
0.599TrpArg: 0.599 ± 0.151
0.359TrpSer: 0.359 ± 0.441
2.156TrpThr: 2.156 ± 0.805
0.359TrpVal: 0.359 ± 0.384
0.0TrpTrp: 0.0 ± 0.0
0.359TrpTyr: 0.359 ± 0.123
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.797TyrAla: 1.797 ± 0.254
1.916TyrCys: 1.916 ± 0.206
1.797TyrAsp: 1.797 ± 0.381
0.479TyrGlu: 0.479 ± 0.175
2.276TyrPhe: 2.276 ± 0.328
3.593TyrGly: 3.593 ± 0.88
1.318TyrHis: 1.318 ± 0.607
2.994TyrIle: 2.994 ± 0.393
1.198TyrLys: 1.198 ± 0.481
4.192TyrLeu: 4.192 ± 0.53
1.198TyrMet: 1.198 ± 0.451
1.557TyrAsn: 1.557 ± 0.274
0.719TyrPro: 0.719 ± 0.195
2.755TyrGln: 2.755 ± 0.497
1.437TyrArg: 1.437 ± 0.329
2.515TyrSer: 2.515 ± 0.625
2.395TyrThr: 2.395 ± 0.68
1.916TyrVal: 1.916 ± 0.379
0.599TyrTrp: 0.599 ± 0.152
1.557TyrTyr: 1.557 ± 0.594
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14 proteins (8350 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski