Amino acid dipepetide frequency for Arthrobacter phage Whytu

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.833AlaAla: 20.833 ± 3.9
0.204AlaCys: 0.204 ± 0.178
11.234AlaAsp: 11.234 ± 1.871
5.719AlaGlu: 5.719 ± 1.292
3.064AlaPhe: 3.064 ± 0.806
18.382AlaGly: 18.382 ± 2.66
2.042AlaHis: 2.042 ± 0.675
4.902AlaIle: 4.902 ± 0.94
3.881AlaLys: 3.881 ± 0.999
10.621AlaLeu: 10.621 ± 1.301
3.064AlaMet: 3.064 ± 0.728
4.085AlaAsn: 4.085 ± 1.016
4.902AlaPro: 4.902 ± 1.146
3.472AlaGln: 3.472 ± 0.619
6.332AlaArg: 6.332 ± 1.754
4.902AlaSer: 4.902 ± 1.047
7.353AlaThr: 7.353 ± 1.21
9.6AlaVal: 9.6 ± 1.319
0.817AlaTrp: 0.817 ± 0.558
1.838AlaTyr: 1.838 ± 0.602
0.0AlaXaa: 0.0 ± 0.0
Cys
0.204CysAla: 0.204 ± 0.204
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.408CysGlu: 0.408 ± 0.272
0.204CysPhe: 0.204 ± 0.237
0.613CysGly: 0.613 ± 0.307
0.408CysHis: 0.408 ± 0.254
0.204CysIle: 0.204 ± 0.178
0.204CysLys: 0.204 ± 0.218
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.408CysAsn: 0.408 ± 0.305
0.204CysPro: 0.204 ± 0.231
0.204CysGln: 0.204 ± 0.209
0.204CysArg: 0.204 ± 0.192
0.408CysSer: 0.408 ± 0.307
0.408CysThr: 0.408 ± 0.283
0.408CysVal: 0.408 ± 0.241
0.204CysTrp: 0.204 ± 0.231
0.204CysTyr: 0.204 ± 0.204
0.0CysXaa: 0.0 ± 0.0
Asp
9.6AspAla: 9.6 ± 1.699
0.204AspCys: 0.204 ± 0.178
3.676AspAsp: 3.676 ± 0.751
3.268AspGlu: 3.268 ± 0.835
1.225AspPhe: 1.225 ± 0.552
6.536AspGly: 6.536 ± 1.133
0.817AspHis: 0.817 ± 0.403
2.451AspIle: 2.451 ± 0.522
2.859AspLys: 2.859 ± 0.838
5.719AspLeu: 5.719 ± 1.173
3.268AspMet: 3.268 ± 0.906
2.859AspAsn: 2.859 ± 0.895
5.515AspPro: 5.515 ± 1.089
1.838AspGln: 1.838 ± 0.611
3.268AspArg: 3.268 ± 0.756
3.268AspSer: 3.268 ± 1.282
5.31AspThr: 5.31 ± 1.314
6.944AspVal: 6.944 ± 0.796
1.43AspTrp: 1.43 ± 0.421
2.451AspTyr: 2.451 ± 0.547
0.0AspXaa: 0.0 ± 0.0
Glu
8.578GluAla: 8.578 ± 2.286
0.204GluCys: 0.204 ± 0.237
4.493GluAsp: 4.493 ± 0.92
1.021GluGlu: 1.021 ± 0.413
1.021GluPhe: 1.021 ± 0.344
3.268GluGly: 3.268 ± 0.833
0.817GluHis: 0.817 ± 0.355
1.838GluIle: 1.838 ± 0.576
2.859GluLys: 2.859 ± 0.963
4.289GluLeu: 4.289 ± 0.885
0.613GluMet: 0.613 ± 0.316
1.225GluAsn: 1.225 ± 0.694
2.042GluPro: 2.042 ± 0.56
2.247GluGln: 2.247 ± 0.848
4.698GluArg: 4.698 ± 1.189
2.859GluSer: 2.859 ± 0.786
3.676GluThr: 3.676 ± 0.895
3.472GluVal: 3.472 ± 0.771
0.817GluTrp: 0.817 ± 0.336
1.021GluTyr: 1.021 ± 0.507
0.0GluXaa: 0.0 ± 0.0
Phe
4.289PheAla: 4.289 ± 0.903
0.204PheCys: 0.204 ± 0.231
3.064PheAsp: 3.064 ± 0.755
1.021PheGlu: 1.021 ± 0.774
1.021PhePhe: 1.021 ± 0.43
3.881PheGly: 3.881 ± 0.607
0.204PheHis: 0.204 ± 0.204
0.817PheIle: 0.817 ± 0.366
1.225PheLys: 1.225 ± 0.475
1.225PheLeu: 1.225 ± 0.359
0.817PheMet: 0.817 ± 0.446
0.408PheAsn: 0.408 ± 0.437
1.225PhePro: 1.225 ± 0.443
0.613PheGln: 0.613 ± 0.41
0.817PheArg: 0.817 ± 0.298
0.204PheSer: 0.204 ± 0.178
1.838PheThr: 1.838 ± 0.821
1.225PheVal: 1.225 ± 0.359
0.613PheTrp: 0.613 ± 0.268
0.408PheTyr: 0.408 ± 0.294
0.0PheXaa: 0.0 ± 0.0
Gly
10.212GlyAla: 10.212 ± 1.479
0.408GlyCys: 0.408 ± 0.284
5.106GlyAsp: 5.106 ± 1.144
4.698GlyGlu: 4.698 ± 0.901
4.493GlyPhe: 4.493 ± 0.702
7.353GlyGly: 7.353 ± 0.844
1.634GlyHis: 1.634 ± 0.506
4.698GlyIle: 4.698 ± 1.276
2.655GlyLys: 2.655 ± 0.87
7.149GlyLeu: 7.149 ± 0.932
1.634GlyMet: 1.634 ± 0.639
3.472GlyAsn: 3.472 ± 0.635
4.289GlyPro: 4.289 ± 0.759
1.021GlyGln: 1.021 ± 0.543
7.761GlyArg: 7.761 ± 1.435
5.515GlySer: 5.515 ± 0.749
9.395GlyThr: 9.395 ± 1.498
6.74GlyVal: 6.74 ± 1.091
1.838GlyTrp: 1.838 ± 0.658
2.451GlyTyr: 2.451 ± 0.734
0.0GlyXaa: 0.0 ± 0.0
His
1.838HisAla: 1.838 ± 0.708
0.613HisCys: 0.613 ± 0.379
0.817HisAsp: 0.817 ± 0.43
0.408HisGlu: 0.408 ± 0.317
0.408HisPhe: 0.408 ± 0.288
1.225HisGly: 1.225 ± 0.473
0.204HisHis: 0.204 ± 0.178
0.613HisIle: 0.613 ± 0.309
0.204HisLys: 0.204 ± 0.198
1.021HisLeu: 1.021 ± 0.443
0.0HisMet: 0.0 ± 0.0
0.408HisAsn: 0.408 ± 0.305
0.408HisPro: 0.408 ± 0.357
0.0HisGln: 0.0 ± 0.0
1.225HisArg: 1.225 ± 0.444
0.408HisSer: 0.408 ± 0.391
0.408HisThr: 0.408 ± 0.254
1.021HisVal: 1.021 ± 0.494
0.204HisTrp: 0.204 ± 0.204
0.204HisTyr: 0.204 ± 0.198
0.0HisXaa: 0.0 ± 0.0
Ile
5.106IleAla: 5.106 ± 1.189
0.0IleCys: 0.0 ± 0.0
3.064IleAsp: 3.064 ± 1.128
3.064IleGlu: 3.064 ± 0.885
0.613IlePhe: 0.613 ± 0.319
3.064IleGly: 3.064 ± 0.87
0.204IleHis: 0.204 ± 0.237
1.225IleIle: 1.225 ± 0.634
1.021IleLys: 1.021 ± 0.572
3.064IleLeu: 3.064 ± 0.694
0.204IleMet: 0.204 ± 0.204
2.042IleAsn: 2.042 ± 0.467
3.064IlePro: 3.064 ± 0.547
2.859IleGln: 2.859 ± 0.868
3.881IleArg: 3.881 ± 0.786
3.472IleSer: 3.472 ± 0.95
3.268IleThr: 3.268 ± 0.902
3.881IleVal: 3.881 ± 1.126
1.021IleTrp: 1.021 ± 0.418
0.817IleTyr: 0.817 ± 0.388
0.0IleXaa: 0.0 ± 0.0
Lys
4.493LysAla: 4.493 ± 1.163
0.0LysCys: 0.0 ± 0.0
2.451LysAsp: 2.451 ± 0.705
1.225LysGlu: 1.225 ± 0.448
0.204LysPhe: 0.204 ± 0.193
3.472LysGly: 3.472 ± 0.959
0.408LysHis: 0.408 ± 0.294
2.655LysIle: 2.655 ± 0.659
3.064LysLys: 3.064 ± 1.431
3.064LysLeu: 3.064 ± 0.888
0.817LysMet: 0.817 ± 0.401
1.225LysAsn: 1.225 ± 0.491
1.634LysPro: 1.634 ± 0.692
1.225LysGln: 1.225 ± 0.617
2.247LysArg: 2.247 ± 0.606
2.247LysSer: 2.247 ± 0.868
4.085LysThr: 4.085 ± 0.745
5.31LysVal: 5.31 ± 0.774
1.021LysTrp: 1.021 ± 0.551
0.204LysTyr: 0.204 ± 0.204
0.0LysXaa: 0.0 ± 0.0
Leu
9.6LeuAla: 9.6 ± 1.795
0.408LeuCys: 0.408 ± 0.272
6.944LeuAsp: 6.944 ± 1.017
5.719LeuGlu: 5.719 ± 0.796
2.042LeuPhe: 2.042 ± 0.697
5.31LeuGly: 5.31 ± 1.123
0.408LeuHis: 0.408 ± 0.301
2.859LeuIle: 2.859 ± 0.736
2.451LeuLys: 2.451 ± 0.794
5.515LeuLeu: 5.515 ± 1.326
1.225LeuMet: 1.225 ± 0.412
1.225LeuAsn: 1.225 ± 0.415
4.085LeuPro: 4.085 ± 0.637
2.042LeuGln: 2.042 ± 0.561
5.515LeuArg: 5.515 ± 1.383
4.289LeuSer: 4.289 ± 1.148
7.557LeuThr: 7.557 ± 1.44
4.902LeuVal: 4.902 ± 0.939
1.225LeuTrp: 1.225 ± 0.505
1.838LeuTyr: 1.838 ± 0.576
0.0LeuXaa: 0.0 ± 0.0
Met
2.247MetAla: 2.247 ± 0.715
0.0MetCys: 0.0 ± 0.0
1.021MetAsp: 1.021 ± 0.545
1.021MetGlu: 1.021 ± 0.429
0.408MetPhe: 0.408 ± 0.284
0.613MetGly: 0.613 ± 0.322
0.0MetHis: 0.0 ± 0.0
0.613MetIle: 0.613 ± 0.339
1.225MetLys: 1.225 ± 0.512
2.451MetLeu: 2.451 ± 0.758
0.204MetMet: 0.204 ± 0.195
1.021MetAsn: 1.021 ± 0.541
1.021MetPro: 1.021 ± 0.646
0.817MetGln: 0.817 ± 0.405
0.817MetArg: 0.817 ± 0.429
2.042MetSer: 2.042 ± 0.582
2.859MetThr: 2.859 ± 0.812
0.817MetVal: 0.817 ± 0.496
0.204MetTrp: 0.204 ± 0.193
0.613MetTyr: 0.613 ± 0.318
0.0MetXaa: 0.0 ± 0.0
Asn
3.472AsnAla: 3.472 ± 0.869
0.0AsnCys: 0.0 ± 0.0
1.838AsnAsp: 1.838 ± 0.501
1.838AsnGlu: 1.838 ± 0.476
0.613AsnPhe: 0.613 ± 0.287
3.881AsnGly: 3.881 ± 0.896
0.0AsnHis: 0.0 ± 0.0
1.021AsnIle: 1.021 ± 0.444
1.225AsnLys: 1.225 ± 0.437
1.634AsnLeu: 1.634 ± 0.653
0.613AsnMet: 0.613 ± 0.302
0.408AsnAsn: 0.408 ± 0.243
2.451AsnPro: 2.451 ± 0.853
0.613AsnGln: 0.613 ± 0.431
1.838AsnArg: 1.838 ± 0.558
1.43AsnSer: 1.43 ± 0.461
4.902AsnThr: 4.902 ± 1.055
2.859AsnVal: 2.859 ± 0.5
1.021AsnTrp: 1.021 ± 0.621
1.021AsnTyr: 1.021 ± 0.386
0.0AsnXaa: 0.0 ± 0.0
Pro
8.987ProAla: 8.987 ± 1.729
0.408ProCys: 0.408 ± 0.273
4.493ProAsp: 4.493 ± 0.959
3.064ProGlu: 3.064 ± 0.735
0.613ProPhe: 0.613 ± 0.385
7.149ProGly: 7.149 ± 1.228
0.613ProHis: 0.613 ± 0.385
3.064ProIle: 3.064 ± 0.732
3.268ProLys: 3.268 ± 0.788
2.655ProLeu: 2.655 ± 0.598
0.204ProMet: 0.204 ± 0.194
1.43ProAsn: 1.43 ± 0.552
2.451ProPro: 2.451 ± 1.009
0.613ProGln: 0.613 ± 0.296
2.247ProArg: 2.247 ± 0.588
2.655ProSer: 2.655 ± 0.711
3.676ProThr: 3.676 ± 0.864
7.149ProVal: 7.149 ± 0.984
0.204ProTrp: 0.204 ± 0.195
0.613ProTyr: 0.613 ± 0.404
0.0ProXaa: 0.0 ± 0.0
Gln
1.838GlnAla: 1.838 ± 0.554
0.0GlnCys: 0.0 ± 0.0
2.451GlnAsp: 2.451 ± 0.683
2.451GlnGlu: 2.451 ± 0.608
0.204GlnPhe: 0.204 ± 0.207
2.247GlnGly: 2.247 ± 0.619
0.204GlnHis: 0.204 ± 0.192
2.247GlnIle: 2.247 ± 0.585
0.817GlnLys: 0.817 ± 0.462
1.021GlnLeu: 1.021 ± 0.525
0.817GlnMet: 0.817 ± 0.524
1.634GlnAsn: 1.634 ± 0.686
0.613GlnPro: 0.613 ± 0.328
0.204GlnGln: 0.204 ± 0.193
2.042GlnArg: 2.042 ± 0.695
0.613GlnSer: 0.613 ± 0.363
1.838GlnThr: 1.838 ± 0.566
2.042GlnVal: 2.042 ± 0.572
0.817GlnTrp: 0.817 ± 0.518
0.613GlnTyr: 0.613 ± 0.31
0.0GlnXaa: 0.0 ± 0.0
Arg
6.944ArgAla: 6.944 ± 0.899
0.408ArgCys: 0.408 ± 0.283
3.676ArgAsp: 3.676 ± 0.87
4.698ArgGlu: 4.698 ± 0.989
1.634ArgPhe: 1.634 ± 0.495
3.064ArgGly: 3.064 ± 0.844
1.225ArgHis: 1.225 ± 0.43
5.106ArgIle: 5.106 ± 1.261
3.064ArgLys: 3.064 ± 0.812
4.289ArgLeu: 4.289 ± 0.984
1.225ArgMet: 1.225 ± 0.618
2.042ArgAsn: 2.042 ± 0.708
5.106ArgPro: 5.106 ± 1.13
1.838ArgGln: 1.838 ± 0.681
6.74ArgArg: 6.74 ± 1.713
3.472ArgSer: 3.472 ± 0.822
3.268ArgThr: 3.268 ± 0.837
4.289ArgVal: 4.289 ± 1.087
0.613ArgTrp: 0.613 ± 0.411
1.838ArgTyr: 1.838 ± 0.612
0.0ArgXaa: 0.0 ± 0.0
Ser
6.127SerAla: 6.127 ± 1.505
0.817SerCys: 0.817 ± 0.441
2.655SerAsp: 2.655 ± 0.787
1.838SerGlu: 1.838 ± 0.504
1.838SerPhe: 1.838 ± 0.749
6.74SerGly: 6.74 ± 1.449
0.204SerHis: 0.204 ± 0.198
2.042SerIle: 2.042 ± 0.826
2.655SerLys: 2.655 ± 0.635
4.085SerLeu: 4.085 ± 0.578
1.225SerMet: 1.225 ± 0.382
1.838SerAsn: 1.838 ± 0.62
3.881SerPro: 3.881 ± 0.963
0.408SerGln: 0.408 ± 0.287
4.698SerArg: 4.698 ± 0.841
2.247SerSer: 2.247 ± 0.618
4.085SerThr: 4.085 ± 0.879
2.859SerVal: 2.859 ± 0.958
1.225SerTrp: 1.225 ± 0.513
1.225SerTyr: 1.225 ± 0.531
0.0SerXaa: 0.0 ± 0.0
Thr
9.804ThrAla: 9.804 ± 1.439
0.0ThrCys: 0.0 ± 0.0
4.289ThrAsp: 4.289 ± 0.886
3.064ThrGlu: 3.064 ± 0.8
1.43ThrPhe: 1.43 ± 0.44
6.332ThrGly: 6.332 ± 0.917
0.408ThrHis: 0.408 ± 0.272
6.127ThrIle: 6.127 ± 0.99
3.881ThrLys: 3.881 ± 1.192
6.332ThrLeu: 6.332 ± 1.072
1.634ThrMet: 1.634 ± 0.534
3.676ThrAsn: 3.676 ± 0.921
6.127ThrPro: 6.127 ± 1.322
1.021ThrGln: 1.021 ± 0.442
3.268ThrArg: 3.268 ± 0.916
3.472ThrSer: 3.472 ± 1.084
7.557ThrThr: 7.557 ± 1.47
8.374ThrVal: 8.374 ± 1.265
1.225ThrTrp: 1.225 ± 0.559
2.451ThrTyr: 2.451 ± 0.776
0.0ThrXaa: 0.0 ± 0.0
Val
8.17ValAla: 8.17 ± 1.184
0.613ValCys: 0.613 ± 0.349
8.17ValAsp: 8.17 ± 1.255
4.698ValGlu: 4.698 ± 1.003
2.451ValPhe: 2.451 ± 0.674
6.332ValGly: 6.332 ± 0.984
1.225ValHis: 1.225 ± 0.335
1.43ValIle: 1.43 ± 0.489
2.859ValLys: 2.859 ± 0.806
6.536ValLeu: 6.536 ± 1.451
1.838ValMet: 1.838 ± 0.625
2.247ValAsn: 2.247 ± 0.868
5.106ValPro: 5.106 ± 1.232
2.451ValGln: 2.451 ± 0.695
5.31ValArg: 5.31 ± 0.833
5.923ValSer: 5.923 ± 0.931
7.149ValThr: 7.149 ± 1.076
7.761ValVal: 7.761 ± 1.207
1.43ValTrp: 1.43 ± 0.595
2.042ValTyr: 2.042 ± 0.566
0.0ValXaa: 0.0 ± 0.0
Trp
2.042TrpAla: 2.042 ± 0.648
0.204TrpCys: 0.204 ± 0.231
1.43TrpAsp: 1.43 ± 0.666
1.021TrpGlu: 1.021 ± 0.436
1.021TrpPhe: 1.021 ± 0.365
0.613TrpGly: 0.613 ± 0.325
0.613TrpHis: 0.613 ± 0.307
0.613TrpIle: 0.613 ± 0.358
0.613TrpLys: 0.613 ± 0.28
2.859TrpLeu: 2.859 ± 1.008
0.0TrpMet: 0.0 ± 0.213
0.613TrpAsn: 0.613 ± 0.398
0.204TrpPro: 0.204 ± 0.237
0.408TrpGln: 0.408 ± 0.328
0.408TrpArg: 0.408 ± 0.243
1.634TrpSer: 1.634 ± 0.47
0.613TrpThr: 0.613 ± 0.353
1.838TrpVal: 1.838 ± 0.64
0.817TrpTrp: 0.817 ± 0.463
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.859TyrAla: 2.859 ± 0.962
0.204TyrCys: 0.204 ± 0.204
1.634TyrAsp: 1.634 ± 0.643
0.817TyrGlu: 0.817 ± 0.574
0.613TyrPhe: 0.613 ± 0.425
2.451TyrGly: 2.451 ± 0.674
0.0TyrHis: 0.0 ± 0.0
0.408TyrIle: 0.408 ± 0.281
1.021TyrLys: 1.021 ± 0.451
1.634TyrLeu: 1.634 ± 0.503
0.408TyrMet: 0.408 ± 0.288
0.408TyrAsn: 0.408 ± 0.305
1.225TyrPro: 1.225 ± 0.573
0.817TyrGln: 0.817 ± 0.5
1.225TyrArg: 1.225 ± 0.47
1.634TyrSer: 1.634 ± 0.537
1.225TyrThr: 1.225 ± 0.551
2.247TyrVal: 2.247 ± 0.606
1.021TyrTrp: 1.021 ± 0.358
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 22 proteins (4897 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski