Amino acid dipepetide frequency for Microbacterium phage Rhysand

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
23.003AlaAla: 23.003 ± 2.868
0.181AlaCys: 0.181 ± 0.167
8.332AlaAsp: 8.332 ± 1.295
9.056AlaGlu: 9.056 ± 1.256
5.253AlaPhe: 5.253 ± 0.717
16.845AlaGly: 16.845 ± 1.776
1.63AlaHis: 1.63 ± 0.665
5.072AlaIle: 5.072 ± 0.932
4.528AlaLys: 4.528 ± 0.878
12.86AlaLeu: 12.86 ± 1.656
3.441AlaMet: 3.441 ± 0.988
2.717AlaAsn: 2.717 ± 0.875
8.513AlaPro: 8.513 ± 1.124
4.709AlaGln: 4.709 ± 1.152
7.97AlaArg: 7.97 ± 1.163
5.434AlaSer: 5.434 ± 0.965
7.064AlaThr: 7.064 ± 1.598
8.513AlaVal: 8.513 ± 1.76
1.992AlaTrp: 1.992 ± 0.505
2.717AlaTyr: 2.717 ± 0.685
0.0AlaXaa: 0.0 ± 0.0
Cys
0.181CysAla: 0.181 ± 0.188
0.0CysCys: 0.0 ± 0.0
0.181CysAsp: 0.181 ± 0.157
0.362CysGlu: 0.362 ± 0.22
0.181CysPhe: 0.181 ± 0.157
0.181CysGly: 0.181 ± 0.18
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.181CysLys: 0.181 ± 0.154
0.362CysLeu: 0.362 ± 0.242
0.0CysMet: 0.0 ± 0.0
0.362CysAsn: 0.362 ± 0.23
0.0CysPro: 0.0 ± 0.0
0.181CysGln: 0.181 ± 0.18
0.362CysArg: 0.362 ± 0.24
0.0CysSer: 0.0 ± 0.0
0.543CysThr: 0.543 ± 0.402
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.362CysTyr: 0.362 ± 0.232
0.0CysXaa: 0.0 ± 0.0
Asp
9.962AspAla: 9.962 ± 1.806
0.181AspCys: 0.181 ± 0.18
3.079AspAsp: 3.079 ± 0.934
3.26AspGlu: 3.26 ± 1.025
0.725AspPhe: 0.725 ± 0.491
5.615AspGly: 5.615 ± 0.99
1.268AspHis: 1.268 ± 0.581
0.362AspIle: 0.362 ± 0.22
0.181AspLys: 0.181 ± 0.185
7.97AspLeu: 7.97 ± 1.52
0.725AspMet: 0.725 ± 0.436
1.087AspAsn: 1.087 ± 0.391
2.717AspPro: 2.717 ± 0.667
1.449AspGln: 1.449 ± 0.579
3.441AspArg: 3.441 ± 0.962
2.174AspSer: 2.174 ± 0.424
3.26AspThr: 3.26 ± 0.765
4.89AspVal: 4.89 ± 1.158
1.087AspTrp: 1.087 ± 0.42
1.449AspTyr: 1.449 ± 0.446
0.0AspXaa: 0.0 ± 0.0
Glu
7.064GluAla: 7.064 ± 1.225
0.181GluCys: 0.181 ± 0.18
2.898GluAsp: 2.898 ± 0.602
0.725GluGlu: 0.725 ± 0.409
0.725GluPhe: 0.725 ± 0.308
3.804GluGly: 3.804 ± 0.749
0.362GluHis: 0.362 ± 0.224
4.347GluIle: 4.347 ± 0.662
0.362GluLys: 0.362 ± 0.249
8.513GluLeu: 8.513 ± 1.376
0.362GluMet: 0.362 ± 0.255
3.441GluAsn: 3.441 ± 0.972
1.63GluPro: 1.63 ± 0.658
3.623GluGln: 3.623 ± 0.904
4.89GluArg: 4.89 ± 1.243
3.26GluSer: 3.26 ± 0.74
3.441GluThr: 3.441 ± 0.672
2.174GluVal: 2.174 ± 0.467
1.087GluTrp: 1.087 ± 0.449
1.449GluTyr: 1.449 ± 0.409
0.0GluXaa: 0.0 ± 0.0
Phe
3.985PheAla: 3.985 ± 0.925
0.0PheCys: 0.0 ± 0.0
2.355PheAsp: 2.355 ± 0.674
1.449PheGlu: 1.449 ± 0.676
0.181PhePhe: 0.181 ± 0.167
4.166PheGly: 4.166 ± 0.855
0.181PheHis: 0.181 ± 0.188
1.087PheIle: 1.087 ± 0.273
0.543PheLys: 0.543 ± 0.219
2.717PheLeu: 2.717 ± 0.752
0.906PheMet: 0.906 ± 0.255
1.449PheAsn: 1.449 ± 0.528
0.725PhePro: 0.725 ± 0.313
1.087PheGln: 1.087 ± 0.607
1.63PheArg: 1.63 ± 0.627
0.906PheSer: 0.906 ± 0.427
3.441PheThr: 3.441 ± 0.855
2.536PheVal: 2.536 ± 0.476
0.181PheTrp: 0.181 ± 0.187
0.725PheTyr: 0.725 ± 0.387
0.0PheXaa: 0.0 ± 0.0
Gly
12.135GlyAla: 12.135 ± 2.812
0.181GlyCys: 0.181 ± 0.154
4.709GlyAsp: 4.709 ± 1.228
3.623GlyGlu: 3.623 ± 0.719
2.717GlyPhe: 2.717 ± 0.494
6.883GlyGly: 6.883 ± 1.235
1.087GlyHis: 1.087 ± 0.579
5.434GlyIle: 5.434 ± 0.824
2.536GlyLys: 2.536 ± 0.556
7.97GlyLeu: 7.97 ± 2.375
1.63GlyMet: 1.63 ± 0.424
2.355GlyAsn: 2.355 ± 0.561
2.898GlyPro: 2.898 ± 1.024
3.804GlyGln: 3.804 ± 0.903
6.339GlyArg: 6.339 ± 0.863
7.788GlySer: 7.788 ± 1.177
4.166GlyThr: 4.166 ± 0.821
9.962GlyVal: 9.962 ± 1.08
2.355GlyTrp: 2.355 ± 0.645
1.992GlyTyr: 1.992 ± 0.461
0.0GlyXaa: 0.0 ± 0.0
His
1.992HisAla: 1.992 ± 0.474
0.0HisCys: 0.0 ± 0.0
1.268HisAsp: 1.268 ± 0.63
0.0HisGlu: 0.0 ± 0.0
0.543HisPhe: 0.543 ± 0.333
1.63HisGly: 1.63 ± 0.539
0.0HisHis: 0.0 ± 0.0
0.181HisIle: 0.181 ± 0.157
0.362HisLys: 0.362 ± 0.293
1.087HisLeu: 1.087 ± 0.421
0.181HisMet: 0.181 ± 0.167
0.362HisAsn: 0.362 ± 0.259
0.725HisPro: 0.725 ± 0.275
0.181HisGln: 0.181 ± 0.157
1.268HisArg: 1.268 ± 0.549
0.543HisSer: 0.543 ± 0.319
1.087HisThr: 1.087 ± 0.522
2.355HisVal: 2.355 ± 0.613
0.362HisTrp: 0.362 ± 0.215
0.725HisTyr: 0.725 ± 0.296
0.0HisXaa: 0.0 ± 0.0
Ile
4.528IleAla: 4.528 ± 0.813
0.181IleCys: 0.181 ± 0.174
3.804IleAsp: 3.804 ± 0.802
5.796IleGlu: 5.796 ± 0.938
0.725IlePhe: 0.725 ± 0.291
3.441IleGly: 3.441 ± 0.725
1.449IleHis: 1.449 ± 0.438
0.906IleIle: 0.906 ± 0.531
0.543IleLys: 0.543 ± 0.27
3.441IleLeu: 3.441 ± 1.344
0.181IleMet: 0.181 ± 0.185
0.725IleAsn: 0.725 ± 0.405
3.079IlePro: 3.079 ± 0.949
1.087IleGln: 1.087 ± 0.416
3.804IleArg: 3.804 ± 0.732
3.985IleSer: 3.985 ± 0.893
2.898IleThr: 2.898 ± 0.612
3.804IleVal: 3.804 ± 0.832
0.362IleTrp: 0.362 ± 0.221
0.362IleTyr: 0.362 ± 0.244
0.0IleXaa: 0.0 ± 0.0
Lys
1.63LysAla: 1.63 ± 0.491
0.0LysCys: 0.0 ± 0.0
1.449LysAsp: 1.449 ± 0.593
0.543LysGlu: 0.543 ± 0.267
0.725LysPhe: 0.725 ± 0.525
1.087LysGly: 1.087 ± 0.473
0.362LysHis: 0.362 ± 0.223
1.268LysIle: 1.268 ± 0.532
0.543LysLys: 0.543 ± 0.326
1.63LysLeu: 1.63 ± 0.565
0.543LysMet: 0.543 ± 0.267
0.906LysAsn: 0.906 ± 0.544
0.543LysPro: 0.543 ± 0.363
0.362LysGln: 0.362 ± 0.226
2.174LysArg: 2.174 ± 0.912
2.174LysSer: 2.174 ± 0.484
1.992LysThr: 1.992 ± 0.558
0.543LysVal: 0.543 ± 0.392
0.362LysTrp: 0.362 ± 0.231
0.362LysTyr: 0.362 ± 0.215
0.0LysXaa: 0.0 ± 0.0
Leu
9.962LeuAla: 9.962 ± 1.032
0.543LeuCys: 0.543 ± 0.28
6.158LeuAsp: 6.158 ± 0.971
2.174LeuGlu: 2.174 ± 0.572
3.804LeuPhe: 3.804 ± 0.841
8.513LeuGly: 8.513 ± 1.29
1.63LeuHis: 1.63 ± 0.562
4.89LeuIle: 4.89 ± 1.287
1.087LeuLys: 1.087 ± 0.375
7.426LeuLeu: 7.426 ± 2.151
2.536LeuMet: 2.536 ± 0.672
3.623LeuAsn: 3.623 ± 0.986
7.97LeuPro: 7.97 ± 1.527
5.615LeuGln: 5.615 ± 1.065
6.883LeuArg: 6.883 ± 1.066
5.615LeuSer: 5.615 ± 0.967
8.332LeuThr: 8.332 ± 1.268
5.977LeuVal: 5.977 ± 1.174
0.543LeuTrp: 0.543 ± 0.296
1.811LeuTyr: 1.811 ± 0.584
0.0LeuXaa: 0.0 ± 0.0
Met
2.898MetAla: 2.898 ± 0.54
0.0MetCys: 0.0 ± 0.0
1.268MetAsp: 1.268 ± 0.422
0.725MetGlu: 0.725 ± 0.347
0.543MetPhe: 0.543 ± 0.251
2.355MetGly: 2.355 ± 0.737
0.362MetHis: 0.362 ± 0.22
1.449MetIle: 1.449 ± 0.87
0.181MetLys: 0.181 ± 0.157
0.543MetLeu: 0.543 ± 0.382
0.0MetMet: 0.0 ± 0.0
1.087MetAsn: 1.087 ± 0.416
1.087MetPro: 1.087 ± 0.416
0.362MetGln: 0.362 ± 0.353
1.449MetArg: 1.449 ± 0.727
2.174MetSer: 2.174 ± 0.99
1.63MetThr: 1.63 ± 0.54
1.449MetVal: 1.449 ± 0.393
0.181MetTrp: 0.181 ± 0.157
0.181MetTyr: 0.181 ± 0.18
0.0MetXaa: 0.0 ± 0.0
Asn
7.064AsnAla: 7.064 ± 0.957
0.181AsnCys: 0.181 ± 0.157
0.181AsnAsp: 0.181 ± 0.154
1.087AsnGlu: 1.087 ± 0.373
0.362AsnPhe: 0.362 ± 0.219
3.985AsnGly: 3.985 ± 0.582
0.543AsnHis: 0.543 ± 0.296
0.906AsnIle: 0.906 ± 0.431
0.725AsnLys: 0.725 ± 0.319
2.898AsnLeu: 2.898 ± 0.856
0.725AsnMet: 0.725 ± 0.362
1.268AsnAsn: 1.268 ± 0.497
1.449AsnPro: 1.449 ± 0.45
0.0AsnGln: 0.0 ± 0.0
2.355AsnArg: 2.355 ± 0.641
2.174AsnSer: 2.174 ± 0.552
1.087AsnThr: 1.087 ± 0.364
2.355AsnVal: 2.355 ± 0.636
0.0AsnTrp: 0.0 ± 0.0
0.543AsnTyr: 0.543 ± 0.309
0.0AsnXaa: 0.0 ± 0.0
Pro
9.056ProAla: 9.056 ± 1.086
0.362ProCys: 0.362 ± 0.256
2.717ProAsp: 2.717 ± 0.822
2.898ProGlu: 2.898 ± 0.765
1.087ProPhe: 1.087 ± 0.391
3.804ProGly: 3.804 ± 0.829
0.725ProHis: 0.725 ± 0.403
2.174ProIle: 2.174 ± 0.396
1.63ProLys: 1.63 ± 0.57
4.166ProLeu: 4.166 ± 0.925
0.906ProMet: 0.906 ± 0.393
1.449ProAsn: 1.449 ± 0.526
1.63ProPro: 1.63 ± 0.809
2.174ProGln: 2.174 ± 0.674
3.079ProArg: 3.079 ± 0.852
3.623ProSer: 3.623 ± 0.543
3.623ProThr: 3.623 ± 0.736
3.985ProVal: 3.985 ± 0.706
0.362ProTrp: 0.362 ± 0.227
1.449ProTyr: 1.449 ± 0.43
0.0ProXaa: 0.0 ± 0.0
Gln
4.528GlnAla: 4.528 ± 1.061
0.0GlnCys: 0.0 ± 0.0
1.449GlnAsp: 1.449 ± 0.408
1.268GlnGlu: 1.268 ± 0.405
1.992GlnPhe: 1.992 ± 0.489
2.717GlnGly: 2.717 ± 0.635
1.087GlnHis: 1.087 ± 0.456
1.811GlnIle: 1.811 ± 0.556
0.543GlnLys: 0.543 ± 0.267
9.962GlnLeu: 9.962 ± 1.579
0.725GlnMet: 0.725 ± 0.491
1.449GlnAsn: 1.449 ± 0.505
2.898GlnPro: 2.898 ± 0.783
2.717GlnGln: 2.717 ± 0.597
3.441GlnArg: 3.441 ± 0.645
1.63GlnSer: 1.63 ± 0.463
1.268GlnThr: 1.268 ± 0.408
1.63GlnVal: 1.63 ± 0.456
0.181GlnTrp: 0.181 ± 0.177
0.725GlnTyr: 0.725 ± 0.355
0.0GlnXaa: 0.0 ± 0.0
Arg
9.056ArgAla: 9.056 ± 1.636
0.725ArgCys: 0.725 ± 0.342
5.072ArgAsp: 5.072 ± 0.803
6.702ArgGlu: 6.702 ± 1.421
2.536ArgPhe: 2.536 ± 0.758
4.528ArgGly: 4.528 ± 0.784
0.906ArgHis: 0.906 ± 0.44
3.26ArgIle: 3.26 ± 0.658
1.268ArgLys: 1.268 ± 0.453
7.064ArgLeu: 7.064 ± 1.313
1.087ArgMet: 1.087 ± 0.387
1.63ArgAsn: 1.63 ± 0.484
2.536ArgPro: 2.536 ± 0.798
4.347ArgGln: 4.347 ± 0.89
8.513ArgArg: 8.513 ± 1.907
2.717ArgSer: 2.717 ± 0.599
2.536ArgThr: 2.536 ± 0.629
5.796ArgVal: 5.796 ± 1.051
2.174ArgTrp: 2.174 ± 0.653
1.992ArgTyr: 1.992 ± 0.444
0.0ArgXaa: 0.0 ± 0.0
Ser
9.056SerAla: 9.056 ± 1.044
0.362SerCys: 0.362 ± 0.347
2.536SerAsp: 2.536 ± 0.501
3.441SerGlu: 3.441 ± 0.763
1.087SerPhe: 1.087 ± 0.394
3.985SerGly: 3.985 ± 0.707
1.087SerHis: 1.087 ± 0.39
3.804SerIle: 3.804 ± 0.781
1.087SerLys: 1.087 ± 0.423
3.985SerLeu: 3.985 ± 0.966
2.174SerMet: 2.174 ± 0.577
1.268SerAsn: 1.268 ± 0.526
1.811SerPro: 1.811 ± 0.476
2.717SerGln: 2.717 ± 0.672
3.623SerArg: 3.623 ± 0.796
4.528SerSer: 4.528 ± 0.798
4.347SerThr: 4.347 ± 0.847
4.709SerVal: 4.709 ± 0.857
1.087SerTrp: 1.087 ± 0.426
1.449SerTyr: 1.449 ± 0.462
0.0SerXaa: 0.0 ± 0.0
Thr
9.056ThrAla: 9.056 ± 1.525
0.0ThrCys: 0.0 ± 0.0
2.717ThrAsp: 2.717 ± 0.839
3.26ThrGlu: 3.26 ± 0.841
3.985ThrPhe: 3.985 ± 0.89
7.064ThrGly: 7.064 ± 1.735
0.906ThrHis: 0.906 ± 0.441
3.079ThrIle: 3.079 ± 0.973
1.087ThrLys: 1.087 ± 0.488
4.709ThrLeu: 4.709 ± 1.289
1.63ThrMet: 1.63 ± 0.547
1.811ThrAsn: 1.811 ± 0.515
4.528ThrPro: 4.528 ± 0.94
1.087ThrGln: 1.087 ± 0.572
4.347ThrArg: 4.347 ± 0.943
2.536ThrSer: 2.536 ± 0.663
5.434ThrThr: 5.434 ± 1.12
5.615ThrVal: 5.615 ± 1.049
1.087ThrTrp: 1.087 ± 0.418
1.087ThrTyr: 1.087 ± 0.521
0.0ThrXaa: 0.0 ± 0.0
Val
10.868ValAla: 10.868 ± 1.479
0.0ValCys: 0.0 ± 0.0
3.26ValAsp: 3.26 ± 0.708
5.615ValGlu: 5.615 ± 1.139
1.449ValPhe: 1.449 ± 0.642
6.158ValGly: 6.158 ± 1.171
0.543ValHis: 0.543 ± 0.278
3.441ValIle: 3.441 ± 1.064
1.268ValLys: 1.268 ± 0.525
4.709ValLeu: 4.709 ± 1.088
1.087ValMet: 1.087 ± 0.649
2.174ValAsn: 2.174 ± 0.914
3.985ValPro: 3.985 ± 0.674
4.528ValGln: 4.528 ± 1.339
6.158ValArg: 6.158 ± 1.351
5.434ValSer: 5.434 ± 1.02
6.883ValThr: 6.883 ± 1.305
6.339ValVal: 6.339 ± 1.519
0.543ValTrp: 0.543 ± 0.312
1.449ValTyr: 1.449 ± 0.362
0.0ValXaa: 0.0 ± 0.0
Trp
1.63TrpAla: 1.63 ± 0.477
0.0TrpCys: 0.0 ± 0.0
0.543TrpAsp: 0.543 ± 0.268
0.725TrpGlu: 0.725 ± 0.398
1.087TrpPhe: 1.087 ± 0.303
0.725TrpGly: 0.725 ± 0.271
0.725TrpHis: 0.725 ± 0.384
1.268TrpIle: 1.268 ± 0.402
0.0TrpLys: 0.0 ± 0.0
2.536TrpLeu: 2.536 ± 0.612
0.181TrpMet: 0.181 ± 0.18
0.362TrpAsn: 0.362 ± 0.215
0.362TrpPro: 0.362 ± 0.313
0.725TrpGln: 0.725 ± 0.33
0.543TrpArg: 0.543 ± 0.321
0.725TrpSer: 0.725 ± 0.397
1.449TrpThr: 1.449 ± 0.52
0.362TrpVal: 0.362 ± 0.268
0.0TrpTrp: 0.0 ± 0.0
0.362TrpTyr: 0.362 ± 0.22
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.355TyrAla: 2.355 ± 0.626
0.362TyrCys: 0.362 ± 0.22
0.906TyrAsp: 0.906 ± 0.391
1.63TyrGlu: 1.63 ± 0.357
0.543TyrPhe: 0.543 ± 0.274
3.079TyrGly: 3.079 ± 0.559
0.0TyrHis: 0.0 ± 0.0
0.362TyrIle: 0.362 ± 0.335
0.543TyrLys: 0.543 ± 0.25
0.543TyrLeu: 0.543 ± 0.293
0.725TyrMet: 0.725 ± 0.36
0.362TyrAsn: 0.362 ± 0.229
1.811TyrPro: 1.811 ± 0.584
1.268TyrGln: 1.268 ± 0.393
2.174TyrArg: 2.174 ± 0.762
0.725TyrSer: 0.725 ± 0.346
0.543TyrThr: 0.543 ± 0.321
2.898TyrVal: 2.898 ± 0.678
0.362TyrTrp: 0.362 ± 0.335
0.725TyrTyr: 0.725 ± 0.407
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 25 proteins (5522 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski