Amino acid dipepetide frequency for Natrinema versiforme icosahedral virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.88AlaAla: 14.88 ± 3.206
0.753AlaCys: 0.753 ± 0.332
9.606AlaAsp: 9.606 ± 1.493
7.534AlaGlu: 7.534 ± 1.718
3.014AlaPhe: 3.014 ± 0.651
10.548AlaGly: 10.548 ± 2.368
1.319AlaHis: 1.319 ± 0.495
4.521AlaIle: 4.521 ± 0.624
1.695AlaLys: 1.695 ± 0.422
9.983AlaLeu: 9.983 ± 1.821
2.637AlaMet: 2.637 ± 0.968
1.695AlaAsn: 1.695 ± 0.777
4.521AlaPro: 4.521 ± 0.891
3.014AlaGln: 3.014 ± 0.712
5.274AlaArg: 5.274 ± 1.281
5.086AlaSer: 5.086 ± 0.71
6.781AlaThr: 6.781 ± 1.388
8.476AlaVal: 8.476 ± 1.903
1.695AlaTrp: 1.695 ± 0.507
2.26AlaTyr: 2.26 ± 0.608
0.0AlaXaa: 0.0 ± 0.0
Cys
0.565CysAla: 0.565 ± 0.433
0.0CysCys: 0.0 ± 0.0
0.753CysAsp: 0.753 ± 0.287
0.753CysGlu: 0.753 ± 0.401
0.0CysPhe: 0.0 ± 0.0
0.565CysGly: 0.565 ± 0.264
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.188CysLys: 0.188 ± 0.187
0.377CysLeu: 0.377 ± 0.274
0.188CysMet: 0.188 ± 0.209
0.188CysAsn: 0.188 ± 0.137
0.377CysPro: 0.377 ± 0.247
0.377CysGln: 0.377 ± 0.311
0.0CysArg: 0.0 ± 0.0
0.377CysSer: 0.377 ± 0.208
0.188CysThr: 0.188 ± 0.204
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.377CysTyr: 0.377 ± 0.244
0.0CysXaa: 0.0 ± 0.0
Asp
13.374AspAla: 13.374 ± 3.063
0.0AspCys: 0.0 ± 0.0
15.257AspAsp: 15.257 ± 3.667
10.36AspGlu: 10.36 ± 1.656
2.26AspPhe: 2.26 ± 0.794
10.925AspGly: 10.925 ± 1.429
2.072AspHis: 2.072 ± 0.727
3.767AspIle: 3.767 ± 0.849
1.13AspLys: 1.13 ± 0.374
9.041AspLeu: 9.041 ± 1.239
1.13AspMet: 1.13 ± 0.41
1.507AspAsn: 1.507 ± 0.542
5.651AspPro: 5.651 ± 1.142
3.579AspGln: 3.579 ± 1.186
7.723AspArg: 7.723 ± 1.879
6.593AspSer: 6.593 ± 0.922
3.767AspThr: 3.767 ± 0.778
6.593AspVal: 6.593 ± 1.486
1.884AspTrp: 1.884 ± 0.465
1.319AspTyr: 1.319 ± 0.518
0.0AspXaa: 0.0 ± 0.0
Glu
11.302GluAla: 11.302 ± 2.135
0.565GluCys: 0.565 ± 0.301
7.911GluAsp: 7.911 ± 1.325
6.969GluGlu: 6.969 ± 1.46
2.072GluPhe: 2.072 ± 0.904
7.158GluGly: 7.158 ± 1.513
0.942GluHis: 0.942 ± 0.458
5.086GluIle: 5.086 ± 1.115
1.13GluLys: 1.13 ± 0.567
2.825GluLeu: 2.825 ± 0.691
1.507GluMet: 1.507 ± 0.556
3.014GluAsn: 3.014 ± 0.518
4.709GluPro: 4.709 ± 0.958
2.825GluGln: 2.825 ± 1.122
7.158GluArg: 7.158 ± 1.304
7.158GluSer: 7.158 ± 1.428
5.462GluThr: 5.462 ± 0.953
4.709GluVal: 4.709 ± 1.216
1.884GluTrp: 1.884 ± 0.543
3.014GluTyr: 3.014 ± 0.823
0.0GluXaa: 0.0 ± 0.0
Phe
2.825PheAla: 2.825 ± 0.768
0.188PheCys: 0.188 ± 0.166
2.637PheAsp: 2.637 ± 0.559
2.449PheGlu: 2.449 ± 0.816
0.565PhePhe: 0.565 ± 0.429
2.637PheGly: 2.637 ± 0.708
0.188PheHis: 0.188 ± 0.166
0.0PheIle: 0.0 ± 0.0
0.188PheLys: 0.188 ± 0.221
2.825PheLeu: 2.825 ± 0.527
0.0PheMet: 0.0 ± 0.0
1.319PheAsn: 1.319 ± 0.628
0.753PhePro: 0.753 ± 0.34
0.188PheGln: 0.188 ± 0.166
2.825PheArg: 2.825 ± 0.605
3.014PheSer: 3.014 ± 0.628
1.13PheThr: 1.13 ± 0.487
2.072PheVal: 2.072 ± 0.533
0.188PheTrp: 0.188 ± 0.162
0.188PheTyr: 0.188 ± 0.186
0.0PheXaa: 0.0 ± 0.0
Gly
7.158GlyAla: 7.158 ± 2.025
0.377GlyCys: 0.377 ± 0.274
9.983GlyAsp: 9.983 ± 3.128
6.593GlyGlu: 6.593 ± 0.95
2.637GlyPhe: 2.637 ± 0.583
11.302GlyGly: 11.302 ± 3.078
1.13GlyHis: 1.13 ± 0.392
2.449GlyIle: 2.449 ± 0.549
1.695GlyLys: 1.695 ± 0.624
7.158GlyLeu: 7.158 ± 1.113
2.072GlyMet: 2.072 ± 0.87
3.39GlyAsn: 3.39 ± 0.824
4.521GlyPro: 4.521 ± 1.122
1.507GlyGln: 1.507 ± 0.596
3.767GlyArg: 3.767 ± 0.94
6.781GlySer: 6.781 ± 1.049
5.651GlyThr: 5.651 ± 0.971
6.216GlyVal: 6.216 ± 0.932
1.507GlyTrp: 1.507 ± 0.707
3.767GlyTyr: 3.767 ± 0.649
0.0GlyXaa: 0.0 ± 0.0
His
0.942HisAla: 0.942 ± 0.401
0.377HisCys: 0.377 ± 0.258
0.942HisAsp: 0.942 ± 0.397
0.565HisGlu: 0.565 ± 0.252
0.188HisPhe: 0.188 ± 0.186
1.13HisGly: 1.13 ± 0.391
0.188HisHis: 0.188 ± 0.142
0.753HisIle: 0.753 ± 0.335
0.377HisLys: 0.377 ± 0.241
1.13HisLeu: 1.13 ± 0.372
0.0HisMet: 0.0 ± 0.0
0.188HisAsn: 0.188 ± 0.22
0.565HisPro: 0.565 ± 0.27
0.565HisGln: 0.565 ± 0.304
1.13HisArg: 1.13 ± 0.524
0.377HisSer: 0.377 ± 0.267
0.565HisThr: 0.565 ± 0.29
1.884HisVal: 1.884 ± 0.457
0.0HisTrp: 0.0 ± 0.0
0.565HisTyr: 0.565 ± 0.271
0.0HisXaa: 0.0 ± 0.0
Ile
2.637IleAla: 2.637 ± 0.637
0.0IleCys: 0.0 ± 0.0
5.462IleAsp: 5.462 ± 0.93
4.897IleGlu: 4.897 ± 0.728
0.377IlePhe: 0.377 ± 0.244
3.956IleGly: 3.956 ± 0.554
0.0IleHis: 0.0 ± 0.0
1.319IleIle: 1.319 ± 0.55
1.13IleLys: 1.13 ± 0.38
2.825IleLeu: 2.825 ± 0.692
1.13IleMet: 1.13 ± 0.38
1.507IleAsn: 1.507 ± 0.39
1.507IlePro: 1.507 ± 0.471
1.319IleGln: 1.319 ± 0.512
3.202IleArg: 3.202 ± 0.685
1.507IleSer: 1.507 ± 0.505
1.507IleThr: 1.507 ± 0.477
2.637IleVal: 2.637 ± 0.539
0.565IleTrp: 0.565 ± 0.448
1.695IleTyr: 1.695 ± 0.698
0.0IleXaa: 0.0 ± 0.0
Lys
3.202LysAla: 3.202 ± 0.862
0.0LysCys: 0.0 ± 0.0
1.695LysAsp: 1.695 ± 0.481
1.507LysGlu: 1.507 ± 0.492
0.942LysPhe: 0.942 ± 0.363
1.319LysGly: 1.319 ± 0.45
0.565LysHis: 0.565 ± 0.343
1.13LysIle: 1.13 ± 0.422
0.377LysLys: 0.377 ± 0.369
1.13LysLeu: 1.13 ± 0.363
0.188LysMet: 0.188 ± 0.217
0.377LysAsn: 0.377 ± 0.285
0.753LysPro: 0.753 ± 0.416
0.942LysGln: 0.942 ± 0.349
1.507LysArg: 1.507 ± 0.706
0.942LysSer: 0.942 ± 0.32
1.13LysThr: 1.13 ± 0.373
1.319LysVal: 1.319 ± 0.493
0.565LysTrp: 0.565 ± 0.266
0.565LysTyr: 0.565 ± 0.301
0.0LysXaa: 0.0 ± 0.0
Leu
8.288LeuAla: 8.288 ± 1.258
0.942LeuCys: 0.942 ± 0.31
6.028LeuAsp: 6.028 ± 1.031
12.243LeuGlu: 12.243 ± 2.072
2.26LeuPhe: 2.26 ± 0.518
6.028LeuGly: 6.028 ± 1.404
0.942LeuHis: 0.942 ± 0.361
3.202LeuIle: 3.202 ± 0.813
1.695LeuLys: 1.695 ± 0.52
8.099LeuLeu: 8.099 ± 1.978
0.942LeuMet: 0.942 ± 0.382
1.695LeuAsn: 1.695 ± 0.641
5.462LeuPro: 5.462 ± 0.966
3.579LeuGln: 3.579 ± 1.112
4.521LeuArg: 4.521 ± 0.942
5.462LeuSer: 5.462 ± 0.982
4.144LeuThr: 4.144 ± 0.726
5.462LeuVal: 5.462 ± 1.06
2.072LeuTrp: 2.072 ± 0.775
0.942LeuTyr: 0.942 ± 0.39
0.0LeuXaa: 0.0 ± 0.0
Met
2.825MetAla: 2.825 ± 0.936
0.0MetCys: 0.0 ± 0.0
1.13MetAsp: 1.13 ± 0.365
0.942MetGlu: 0.942 ± 0.364
0.188MetPhe: 0.188 ± 0.186
0.942MetGly: 0.942 ± 0.391
0.0MetHis: 0.0 ± 0.0
0.753MetIle: 0.753 ± 0.438
0.188MetLys: 0.188 ± 0.142
1.319MetLeu: 1.319 ± 0.664
0.188MetMet: 0.188 ± 0.178
0.565MetAsn: 0.565 ± 0.343
0.565MetPro: 0.565 ± 0.328
0.753MetGln: 0.753 ± 0.467
0.377MetArg: 0.377 ± 0.243
1.884MetSer: 1.884 ± 0.566
1.507MetThr: 1.507 ± 0.478
1.695MetVal: 1.695 ± 0.67
0.188MetTrp: 0.188 ± 0.203
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.202AsnAla: 3.202 ± 0.553
0.0AsnCys: 0.0 ± 0.0
2.637AsnAsp: 2.637 ± 0.664
1.884AsnGlu: 1.884 ± 0.489
0.565AsnPhe: 0.565 ± 0.363
3.014AsnGly: 3.014 ± 1.117
0.377AsnHis: 0.377 ± 0.208
1.13AsnIle: 1.13 ± 0.53
0.188AsnLys: 0.188 ± 0.162
2.449AsnLeu: 2.449 ± 0.746
0.565AsnMet: 0.565 ± 0.286
0.377AsnAsn: 0.377 ± 0.232
1.319AsnPro: 1.319 ± 0.6
1.319AsnGln: 1.319 ± 0.489
0.942AsnArg: 0.942 ± 0.405
1.884AsnSer: 1.884 ± 0.87
2.26AsnThr: 2.26 ± 0.749
1.507AsnVal: 1.507 ± 0.51
0.942AsnTrp: 0.942 ± 0.372
0.377AsnTyr: 0.377 ± 0.276
0.0AsnXaa: 0.0 ± 0.0
Pro
5.274ProAla: 5.274 ± 1.064
0.565ProCys: 0.565 ± 0.315
6.781ProAsp: 6.781 ± 1.014
4.709ProGlu: 4.709 ± 0.915
1.319ProPhe: 1.319 ± 0.443
4.144ProGly: 4.144 ± 0.799
0.565ProHis: 0.565 ± 0.313
1.695ProIle: 1.695 ± 0.736
1.13ProLys: 1.13 ± 0.498
3.579ProLeu: 3.579 ± 0.733
0.565ProMet: 0.565 ± 0.244
0.942ProAsn: 0.942 ± 0.284
1.884ProPro: 1.884 ± 0.512
0.942ProGln: 0.942 ± 0.344
1.884ProArg: 1.884 ± 0.658
3.579ProSer: 3.579 ± 1.054
3.014ProThr: 3.014 ± 0.636
2.637ProVal: 2.637 ± 0.618
1.13ProTrp: 1.13 ± 0.363
1.319ProTyr: 1.319 ± 0.488
0.0ProXaa: 0.0 ± 0.0
Gln
3.956GlnAla: 3.956 ± 0.805
0.188GlnCys: 0.188 ± 0.212
2.072GlnAsp: 2.072 ± 0.822
2.26GlnGlu: 2.26 ± 0.467
1.13GlnPhe: 1.13 ± 0.52
2.449GlnGly: 2.449 ± 0.732
0.377GlnHis: 0.377 ± 0.178
1.507GlnIle: 1.507 ± 0.451
0.942GlnLys: 0.942 ± 0.437
3.014GlnLeu: 3.014 ± 1.117
0.565GlnMet: 0.565 ± 0.366
0.565GlnAsn: 0.565 ± 0.267
1.507GlnPro: 1.507 ± 0.562
2.072GlnGln: 2.072 ± 1.056
2.449GlnArg: 2.449 ± 0.547
1.695GlnSer: 1.695 ± 0.514
2.072GlnThr: 2.072 ± 0.708
1.13GlnVal: 1.13 ± 0.599
0.565GlnTrp: 0.565 ± 0.268
1.13GlnTyr: 1.13 ± 0.542
0.0GlnXaa: 0.0 ± 0.0
Arg
4.521ArgAla: 4.521 ± 1.473
0.377ArgCys: 0.377 ± 0.265
7.158ArgAsp: 7.158 ± 1.577
4.897ArgGlu: 4.897 ± 1.251
3.014ArgPhe: 3.014 ± 0.715
2.825ArgGly: 2.825 ± 0.505
0.565ArgHis: 0.565 ± 0.315
1.695ArgIle: 1.695 ± 0.486
3.014ArgLys: 3.014 ± 0.998
8.099ArgLeu: 8.099 ± 2.819
1.13ArgMet: 1.13 ± 0.394
2.26ArgAsn: 2.26 ± 0.594
1.319ArgPro: 1.319 ± 0.405
1.507ArgGln: 1.507 ± 0.42
7.158ArgArg: 7.158 ± 1.316
3.956ArgSer: 3.956 ± 0.681
3.579ArgThr: 3.579 ± 1.001
3.014ArgVal: 3.014 ± 0.651
0.753ArgTrp: 0.753 ± 0.312
0.942ArgTyr: 0.942 ± 0.33
0.0ArgXaa: 0.0 ± 0.0
Ser
4.521SerAla: 4.521 ± 0.711
0.0SerCys: 0.0 ± 0.0
7.723SerAsp: 7.723 ± 1.332
4.332SerGlu: 4.332 ± 0.62
1.507SerPhe: 1.507 ± 0.474
5.651SerGly: 5.651 ± 1.3
0.753SerHis: 0.753 ± 0.361
3.202SerIle: 3.202 ± 0.685
1.884SerLys: 1.884 ± 0.721
5.839SerLeu: 5.839 ± 0.921
1.13SerMet: 1.13 ± 0.549
2.449SerAsn: 2.449 ± 0.755
3.956SerPro: 3.956 ± 0.873
1.884SerGln: 1.884 ± 0.58
3.579SerArg: 3.579 ± 0.974
6.404SerSer: 6.404 ± 1.455
5.274SerThr: 5.274 ± 1.057
4.521SerVal: 4.521 ± 0.884
1.884SerTrp: 1.884 ± 0.541
1.884SerTyr: 1.884 ± 0.566
0.0SerXaa: 0.0 ± 0.0
Thr
6.028ThrAla: 6.028 ± 1.049
0.188ThrCys: 0.188 ± 0.162
7.346ThrAsp: 7.346 ± 1.888
5.086ThrGlu: 5.086 ± 0.751
1.13ThrPhe: 1.13 ± 0.388
4.709ThrGly: 4.709 ± 1.069
1.13ThrHis: 1.13 ± 0.336
3.014ThrIle: 3.014 ± 0.677
0.753ThrLys: 0.753 ± 0.43
5.274ThrLeu: 5.274 ± 1.243
0.565ThrMet: 0.565 ± 0.368
2.072ThrAsn: 2.072 ± 0.636
3.579ThrPro: 3.579 ± 0.92
1.884ThrGln: 1.884 ± 0.757
2.072ThrArg: 2.072 ± 0.459
5.086ThrSer: 5.086 ± 0.906
7.723ThrThr: 7.723 ± 1.491
4.332ThrVal: 4.332 ± 1.031
0.942ThrTrp: 0.942 ± 0.32
0.565ThrTyr: 0.565 ± 0.298
0.0ThrXaa: 0.0 ± 0.0
Val
6.781ValAla: 6.781 ± 1.036
0.377ValCys: 0.377 ± 0.223
9.418ValAsp: 9.418 ± 1.011
6.781ValGlu: 6.781 ± 1.187
1.319ValPhe: 1.319 ± 0.604
6.404ValGly: 6.404 ± 1.069
0.377ValHis: 0.377 ± 0.259
2.072ValIle: 2.072 ± 0.695
1.13ValLys: 1.13 ± 0.461
5.651ValLeu: 5.651 ± 1.053
1.13ValMet: 1.13 ± 0.604
0.942ValAsn: 0.942 ± 0.304
2.072ValPro: 2.072 ± 0.616
1.884ValGln: 1.884 ± 0.457
3.202ValArg: 3.202 ± 0.742
4.332ValSer: 4.332 ± 1.235
5.086ValThr: 5.086 ± 1.187
4.332ValVal: 4.332 ± 0.903
0.565ValTrp: 0.565 ± 0.355
1.319ValTyr: 1.319 ± 0.431
0.0ValXaa: 0.0 ± 0.0
Trp
0.753TrpAla: 0.753 ± 0.27
0.0TrpCys: 0.0 ± 0.0
1.507TrpAsp: 1.507 ± 0.444
1.319TrpGlu: 1.319 ± 0.549
0.565TrpPhe: 0.565 ± 0.254
1.884TrpGly: 1.884 ± 0.977
0.377TrpHis: 0.377 ± 0.259
0.753TrpIle: 0.753 ± 0.384
0.565TrpLys: 0.565 ± 0.333
1.695TrpLeu: 1.695 ± 0.7
0.188TrpMet: 0.188 ± 0.276
0.753TrpAsn: 0.753 ± 0.489
1.319TrpPro: 1.319 ± 0.523
0.188TrpGln: 0.188 ± 0.221
1.319TrpArg: 1.319 ± 0.507
0.753TrpSer: 0.753 ± 0.287
1.507TrpThr: 1.507 ± 0.367
1.507TrpVal: 1.507 ± 0.579
0.377TrpTrp: 0.377 ± 0.313
0.565TrpTyr: 0.565 ± 0.329
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.26TyrAla: 2.26 ± 0.785
0.377TyrCys: 0.377 ± 0.239
1.884TyrAsp: 1.884 ± 0.512
1.319TyrGlu: 1.319 ± 0.442
1.13TyrPhe: 1.13 ± 0.516
2.072TyrGly: 2.072 ± 0.787
0.565TyrHis: 0.565 ± 0.338
1.13TyrIle: 1.13 ± 0.572
0.753TyrLys: 0.753 ± 0.335
1.695TyrLeu: 1.695 ± 0.446
0.0TyrMet: 0.0 ± 0.0
1.13TyrAsn: 1.13 ± 0.567
1.319TyrPro: 1.319 ± 0.585
1.319TyrGln: 1.319 ± 0.578
1.695TyrArg: 1.695 ± 0.673
1.695TyrSer: 1.695 ± 0.527
1.13TyrThr: 1.13 ± 0.49
1.13TyrVal: 1.13 ± 0.354
0.188TyrTrp: 0.188 ± 0.137
0.753TyrTyr: 0.753 ± 0.29
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 26 proteins (5310 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski