Amino acid dipepetide frequency for Microbacterium phage Zeta1847

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
24.354AlaAla: 24.354 ± 2.002
1.085AlaCys: 1.085 ± 0.438
11.261AlaAsp: 11.261 ± 0.986
12.007AlaGlu: 12.007 ± 1.05
3.595AlaPhe: 3.595 ± 0.51
8.683AlaGly: 8.683 ± 0.872
3.188AlaHis: 3.188 ± 0.386
6.445AlaIle: 6.445 ± 0.896
4.681AlaLys: 4.681 ± 0.689
13.093AlaLeu: 13.093 ± 1.414
2.646AlaMet: 2.646 ± 0.365
3.053AlaAsn: 3.053 ± 0.453
8.955AlaPro: 8.955 ± 0.724
1.899AlaGln: 1.899 ± 0.369
12.279AlaArg: 12.279 ± 1.12
7.937AlaSer: 7.937 ± 0.815
8.548AlaThr: 8.548 ± 0.646
9.158AlaVal: 9.158 ± 0.739
2.51AlaTrp: 2.51 ± 0.404
3.392AlaTyr: 3.392 ± 0.494
0.0AlaXaa: 0.0 ± 0.0
Cys
0.407CysAla: 0.407 ± 0.183
0.068CysCys: 0.068 ± 0.069
0.678CysAsp: 0.678 ± 0.261
0.271CysGlu: 0.271 ± 0.128
0.204CysPhe: 0.204 ± 0.106
0.95CysGly: 0.95 ± 0.314
0.271CysHis: 0.271 ± 0.145
0.136CysIle: 0.136 ± 0.092
0.475CysLys: 0.475 ± 0.198
0.407CysLeu: 0.407 ± 0.171
0.068CysMet: 0.068 ± 0.072
0.0CysAsn: 0.0 ± 0.0
0.678CysPro: 0.678 ± 0.23
0.407CysGln: 0.407 ± 0.163
0.543CysArg: 0.543 ± 0.189
0.543CysSer: 0.543 ± 0.188
0.407CysThr: 0.407 ± 0.19
0.339CysVal: 0.339 ± 0.163
0.136CysTrp: 0.136 ± 0.099
0.068CysTyr: 0.068 ± 0.06
0.0CysXaa: 0.0 ± 0.0
Asp
10.108AspAla: 10.108 ± 0.79
0.204AspCys: 0.204 ± 0.097
4.07AspAsp: 4.07 ± 0.637
3.935AspGlu: 3.935 ± 0.478
1.221AspPhe: 1.221 ± 0.244
7.327AspGly: 7.327 ± 0.967
1.085AspHis: 1.085 ± 0.216
2.51AspIle: 2.51 ± 0.381
2.374AspLys: 2.374 ± 0.381
6.309AspLeu: 6.309 ± 0.531
0.746AspMet: 0.746 ± 0.205
1.221AspAsn: 1.221 ± 0.237
4.138AspPro: 4.138 ± 0.517
0.136AspGln: 0.136 ± 0.084
4.342AspArg: 4.342 ± 0.491
2.306AspSer: 2.306 ± 0.428
3.935AspThr: 3.935 ± 0.479
4.477AspVal: 4.477 ± 0.644
1.018AspTrp: 1.018 ± 0.272
1.832AspTyr: 1.832 ± 0.55
0.0AspXaa: 0.0 ± 0.0
Glu
10.786GluAla: 10.786 ± 1.08
0.746GluCys: 0.746 ± 0.292
1.899GluAsp: 1.899 ± 0.324
3.731GluGlu: 3.731 ± 0.632
2.306GluPhe: 2.306 ± 0.405
4.477GluGly: 4.477 ± 0.496
1.832GluHis: 1.832 ± 0.389
0.814GluIle: 0.814 ± 0.199
0.407GluLys: 0.407 ± 0.154
11.736GluLeu: 11.736 ± 0.92
0.407GluMet: 0.407 ± 0.159
0.678GluAsn: 0.678 ± 0.189
4.816GluPro: 4.816 ± 1.018
1.018GluGln: 1.018 ± 0.256
5.902GluArg: 5.902 ± 0.789
3.121GluSer: 3.121 ± 0.473
5.02GluThr: 5.02 ± 0.737
4.749GluVal: 4.749 ± 0.536
1.085GluTrp: 1.085 ± 0.266
0.882GluTyr: 0.882 ± 0.269
0.0GluXaa: 0.0 ± 0.0
Phe
3.392PheAla: 3.392 ± 0.392
0.136PheCys: 0.136 ± 0.102
1.764PheAsp: 1.764 ± 0.273
1.832PheGlu: 1.832 ± 0.347
0.204PhePhe: 0.204 ± 0.109
2.239PheGly: 2.239 ± 0.367
0.543PheHis: 0.543 ± 0.184
0.407PheIle: 0.407 ± 0.154
0.95PheLys: 0.95 ± 0.252
1.832PheLeu: 1.832 ± 0.471
0.339PheMet: 0.339 ± 0.126
0.543PheAsn: 0.543 ± 0.175
1.628PhePro: 1.628 ± 0.44
0.475PheGln: 0.475 ± 0.143
2.646PheArg: 2.646 ± 0.451
0.882PheSer: 0.882 ± 0.206
2.781PheThr: 2.781 ± 0.581
1.696PheVal: 1.696 ± 0.299
0.204PheTrp: 0.204 ± 0.112
0.543PheTyr: 0.543 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
9.226GlyAla: 9.226 ± 0.897
0.543GlyCys: 0.543 ± 0.207
5.427GlyAsp: 5.427 ± 0.609
4.884GlyGlu: 4.884 ± 0.56
1.967GlyPhe: 1.967 ± 0.401
4.749GlyGly: 4.749 ± 0.51
1.153GlyHis: 1.153 ± 0.244
4.138GlyIle: 4.138 ± 0.626
3.528GlyLys: 3.528 ± 0.361
8.073GlyLeu: 8.073 ± 0.87
1.56GlyMet: 1.56 ± 0.423
1.764GlyAsn: 1.764 ± 0.285
4.206GlyPro: 4.206 ± 0.64
1.425GlyGln: 1.425 ± 0.318
5.02GlyArg: 5.02 ± 0.612
4.884GlySer: 4.884 ± 0.788
5.902GlyThr: 5.902 ± 1.157
6.105GlyVal: 6.105 ± 0.536
1.56GlyTrp: 1.56 ± 0.355
2.103GlyTyr: 2.103 ± 0.439
0.0GlyXaa: 0.0 ± 0.0
His
2.985HisAla: 2.985 ± 0.396
0.068HisCys: 0.068 ± 0.054
1.832HisAsp: 1.832 ± 0.339
1.018HisGlu: 1.018 ± 0.249
0.407HisPhe: 0.407 ± 0.162
1.764HisGly: 1.764 ± 0.35
0.611HisHis: 0.611 ± 0.198
0.882HisIle: 0.882 ± 0.225
0.678HisLys: 0.678 ± 0.235
2.103HisLeu: 2.103 ± 0.345
0.475HisMet: 0.475 ± 0.197
0.611HisAsn: 0.611 ± 0.202
1.357HisPro: 1.357 ± 0.24
0.136HisGln: 0.136 ± 0.092
2.239HisArg: 2.239 ± 0.358
0.678HisSer: 0.678 ± 0.204
1.085HisThr: 1.085 ± 0.221
1.221HisVal: 1.221 ± 0.287
0.475HisTrp: 0.475 ± 0.19
0.543HisTyr: 0.543 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
5.495IleAla: 5.495 ± 0.483
0.068IleCys: 0.068 ± 0.054
2.781IleAsp: 2.781 ± 0.342
2.781IleGlu: 2.781 ± 0.471
0.95IlePhe: 0.95 ± 0.267
2.646IleGly: 2.646 ± 0.375
0.339IleHis: 0.339 ± 0.145
1.56IleIle: 1.56 ± 0.304
1.153IleLys: 1.153 ± 0.318
3.867IleLeu: 3.867 ± 0.596
0.407IleMet: 0.407 ± 0.162
1.018IleAsn: 1.018 ± 0.231
1.899IlePro: 1.899 ± 0.379
0.746IleGln: 0.746 ± 0.261
3.324IleArg: 3.324 ± 0.397
0.95IleSer: 0.95 ± 0.228
2.306IleThr: 2.306 ± 0.405
3.799IleVal: 3.799 ± 0.606
0.136IleTrp: 0.136 ± 0.088
0.407IleTyr: 0.407 ± 0.168
0.0IleXaa: 0.0 ± 0.0
Lys
5.224LysAla: 5.224 ± 0.681
0.339LysCys: 0.339 ± 0.164
0.814LysAsp: 0.814 ± 0.262
0.407LysGlu: 0.407 ± 0.144
0.746LysPhe: 0.746 ± 0.235
2.171LysGly: 2.171 ± 0.42
0.746LysHis: 0.746 ± 0.226
0.746LysIle: 0.746 ± 0.298
0.204LysLys: 0.204 ± 0.117
3.935LysLeu: 3.935 ± 0.614
0.271LysMet: 0.271 ± 0.126
0.678LysAsn: 0.678 ± 0.208
3.256LysPro: 3.256 ± 0.659
0.95LysGln: 0.95 ± 0.264
3.121LysArg: 3.121 ± 0.568
1.832LysSer: 1.832 ± 0.39
1.967LysThr: 1.967 ± 0.328
2.306LysVal: 2.306 ± 0.375
0.543LysTrp: 0.543 ± 0.191
0.543LysTyr: 0.543 ± 0.202
0.0LysXaa: 0.0 ± 0.0
Leu
14.653LeuAla: 14.653 ± 1.077
0.746LeuCys: 0.746 ± 0.267
6.309LeuAsp: 6.309 ± 0.555
8.141LeuGlu: 8.141 ± 0.849
2.103LeuPhe: 2.103 ± 0.341
7.937LeuGly: 7.937 ± 0.933
1.628LeuHis: 1.628 ± 0.332
2.849LeuIle: 2.849 ± 0.549
3.324LeuLys: 3.324 ± 0.485
7.191LeuLeu: 7.191 ± 0.874
1.289LeuMet: 1.289 ± 0.281
2.306LeuAsn: 2.306 ± 0.392
5.766LeuPro: 5.766 ± 0.762
1.085LeuGln: 1.085 ± 0.318
8.48LeuArg: 8.48 ± 0.967
4.613LeuSer: 4.613 ± 0.616
5.156LeuThr: 5.156 ± 0.727
8.073LeuVal: 8.073 ± 0.661
1.085LeuTrp: 1.085 ± 0.272
1.899LeuTyr: 1.899 ± 0.332
0.0LeuXaa: 0.0 ± 0.0
Met
2.578MetAla: 2.578 ± 0.445
0.0MetCys: 0.0 ± 0.0
0.339MetAsp: 0.339 ± 0.141
0.543MetGlu: 0.543 ± 0.184
0.407MetPhe: 0.407 ± 0.166
1.425MetGly: 1.425 ± 0.273
0.339MetHis: 0.339 ± 0.189
0.0MetIle: 0.0 ± 0.0
0.136MetLys: 0.136 ± 0.113
1.832MetLeu: 1.832 ± 0.547
0.407MetMet: 0.407 ± 0.21
0.068MetAsn: 0.068 ± 0.06
0.611MetPro: 0.611 ± 0.144
0.543MetGln: 0.543 ± 0.156
1.832MetArg: 1.832 ± 0.345
2.442MetSer: 2.442 ± 0.455
1.628MetThr: 1.628 ± 0.432
0.814MetVal: 0.814 ± 0.222
0.0MetTrp: 0.0 ± 0.0
0.407MetTyr: 0.407 ± 0.151
0.0MetXaa: 0.0 ± 0.0
Asn
3.392AsnAla: 3.392 ± 0.471
0.0AsnCys: 0.0 ± 0.0
0.678AsnAsp: 0.678 ± 0.2
0.814AsnGlu: 0.814 ± 0.301
0.407AsnPhe: 0.407 ± 0.181
1.764AsnGly: 1.764 ± 0.475
0.543AsnHis: 0.543 ± 0.155
0.814AsnIle: 0.814 ± 0.208
0.407AsnLys: 0.407 ± 0.174
1.832AsnLeu: 1.832 ± 0.329
0.271AsnMet: 0.271 ± 0.129
0.339AsnAsn: 0.339 ± 0.157
2.103AsnPro: 2.103 ± 0.42
0.204AsnGln: 0.204 ± 0.119
1.357AsnArg: 1.357 ± 0.266
0.678AsnSer: 0.678 ± 0.197
2.171AsnThr: 2.171 ± 0.455
1.899AsnVal: 1.899 ± 0.46
0.204AsnTrp: 0.204 ± 0.105
0.611AsnTyr: 0.611 ± 0.181
0.0AsnXaa: 0.0 ± 0.0
Pro
8.005ProAla: 8.005 ± 0.767
0.814ProCys: 0.814 ± 0.261
5.088ProAsp: 5.088 ± 0.642
4.952ProGlu: 4.952 ± 0.942
1.425ProPhe: 1.425 ± 0.312
4.545ProGly: 4.545 ± 0.613
1.832ProHis: 1.832 ± 0.389
1.56ProIle: 1.56 ± 0.268
2.103ProLys: 2.103 ± 0.491
4.613ProLeu: 4.613 ± 0.523
0.95ProMet: 0.95 ± 0.223
0.95ProAsn: 0.95 ± 0.24
2.103ProPro: 2.103 ± 0.472
1.425ProGln: 1.425 ± 0.473
3.799ProArg: 3.799 ± 0.443
3.188ProSer: 3.188 ± 0.583
4.749ProThr: 4.749 ± 0.657
4.342ProVal: 4.342 ± 0.416
0.882ProTrp: 0.882 ± 0.276
1.018ProTyr: 1.018 ± 0.246
0.0ProXaa: 0.0 ± 0.0
Gln
3.121GlnAla: 3.121 ± 0.489
0.271GlnCys: 0.271 ± 0.152
0.95GlnAsp: 0.95 ± 0.247
0.882GlnGlu: 0.882 ± 0.208
0.339GlnPhe: 0.339 ± 0.103
1.289GlnGly: 1.289 ± 0.349
0.204GlnHis: 0.204 ± 0.112
0.678GlnIle: 0.678 ± 0.209
0.136GlnLys: 0.136 ± 0.084
1.289GlnLeu: 1.289 ± 0.36
0.339GlnMet: 0.339 ± 0.16
0.136GlnAsn: 0.136 ± 0.145
0.611GlnPro: 0.611 ± 0.204
0.611GlnGln: 0.611 ± 0.207
1.899GlnArg: 1.899 ± 0.327
1.018GlnSer: 1.018 ± 0.298
0.882GlnThr: 0.882 ± 0.205
0.95GlnVal: 0.95 ± 0.37
0.271GlnTrp: 0.271 ± 0.143
0.339GlnTyr: 0.339 ± 0.145
0.0GlnXaa: 0.0 ± 0.0
Arg
10.718ArgAla: 10.718 ± 1.041
0.611ArgCys: 0.611 ± 0.21
5.359ArgAsp: 5.359 ± 0.601
6.716ArgGlu: 6.716 ± 0.777
2.442ArgPhe: 2.442 ± 0.386
4.409ArgGly: 4.409 ± 0.376
2.239ArgHis: 2.239 ± 0.418
4.409ArgIle: 4.409 ± 0.422
3.731ArgLys: 3.731 ± 0.573
7.055ArgLeu: 7.055 ± 0.541
2.239ArgMet: 2.239 ± 0.385
1.628ArgAsn: 1.628 ± 0.338
4.681ArgPro: 4.681 ± 0.67
1.221ArgGln: 1.221 ± 0.351
8.073ArgArg: 8.073 ± 0.978
4.002ArgSer: 4.002 ± 0.613
4.613ArgThr: 4.613 ± 0.506
7.327ArgVal: 7.327 ± 0.73
1.425ArgTrp: 1.425 ± 0.304
1.696ArgTyr: 1.696 ± 0.338
0.0ArgXaa: 0.0 ± 0.0
Ser
7.123SerAla: 7.123 ± 0.773
0.204SerCys: 0.204 ± 0.115
2.714SerAsp: 2.714 ± 0.392
2.714SerGlu: 2.714 ± 0.399
1.425SerPhe: 1.425 ± 0.368
6.309SerGly: 6.309 ± 0.883
1.085SerHis: 1.085 ± 0.317
2.171SerIle: 2.171 ± 0.345
1.492SerLys: 1.492 ± 0.328
4.477SerLeu: 4.477 ± 0.615
0.95SerMet: 0.95 ± 0.313
1.56SerAsn: 1.56 ± 0.369
2.917SerPro: 2.917 ± 0.452
0.407SerGln: 0.407 ± 0.156
3.595SerArg: 3.595 ± 0.437
2.646SerSer: 2.646 ± 0.416
4.952SerThr: 4.952 ± 0.541
3.121SerVal: 3.121 ± 0.52
1.221SerTrp: 1.221 ± 0.279
1.085SerTyr: 1.085 ± 0.327
0.0SerXaa: 0.0 ± 0.0
Thr
10.176ThrAla: 10.176 ± 0.8
0.407ThrCys: 0.407 ± 0.178
3.663ThrAsp: 3.663 ± 0.475
3.324ThrGlu: 3.324 ± 0.405
2.646ThrPhe: 2.646 ± 0.518
6.173ThrGly: 6.173 ± 0.554
1.492ThrHis: 1.492 ± 0.276
2.239ThrIle: 2.239 ± 0.368
2.239ThrLys: 2.239 ± 0.339
5.834ThrLeu: 5.834 ± 0.558
1.425ThrMet: 1.425 ± 0.266
2.103ThrAsn: 2.103 ± 0.386
4.274ThrPro: 4.274 ± 0.613
0.678ThrGln: 0.678 ± 0.233
6.105ThrArg: 6.105 ± 0.614
4.206ThrSer: 4.206 ± 0.608
5.902ThrThr: 5.902 ± 0.737
5.088ThrVal: 5.088 ± 0.519
0.814ThrTrp: 0.814 ± 0.194
1.153ThrTyr: 1.153 ± 0.259
0.0ThrXaa: 0.0 ± 0.0
Val
10.922ValAla: 10.922 ± 1.002
0.475ValCys: 0.475 ± 0.164
5.02ValAsp: 5.02 ± 0.664
4.749ValGlu: 4.749 ± 0.509
1.425ValPhe: 1.425 ± 0.34
5.902ValGly: 5.902 ± 0.664
1.425ValHis: 1.425 ± 0.281
3.256ValIle: 3.256 ± 0.443
2.171ValLys: 2.171 ± 0.3
6.105ValLeu: 6.105 ± 0.796
1.018ValMet: 1.018 ± 0.243
1.425ValAsn: 1.425 ± 0.286
3.053ValPro: 3.053 ± 0.466
2.035ValGln: 2.035 ± 0.369
5.97ValArg: 5.97 ± 0.626
4.477ValSer: 4.477 ± 0.535
4.816ValThr: 4.816 ± 0.525
4.884ValVal: 4.884 ± 0.587
1.221ValTrp: 1.221 ± 0.291
2.306ValTyr: 2.306 ± 0.383
0.0ValXaa: 0.0 ± 0.0
Trp
2.578TrpAla: 2.578 ± 0.421
0.204TrpCys: 0.204 ± 0.112
0.814TrpAsp: 0.814 ± 0.28
1.56TrpGlu: 1.56 ± 0.321
0.204TrpPhe: 0.204 ± 0.114
0.882TrpGly: 0.882 ± 0.253
0.339TrpHis: 0.339 ± 0.204
0.068TrpIle: 0.068 ± 0.065
0.136TrpLys: 0.136 ± 0.083
1.492TrpLeu: 1.492 ± 0.33
0.204TrpMet: 0.204 ± 0.139
0.271TrpAsn: 0.271 ± 0.127
0.611TrpPro: 0.611 ± 0.227
0.475TrpGln: 0.475 ± 0.156
1.967TrpArg: 1.967 ± 0.3
0.746TrpSer: 0.746 ± 0.261
1.085TrpThr: 1.085 ± 0.353
1.289TrpVal: 1.289 ± 0.29
0.339TrpTrp: 0.339 ± 0.166
0.204TrpTyr: 0.204 ± 0.111
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.528TyrAla: 3.528 ± 0.435
0.068TyrCys: 0.068 ± 0.065
1.832TyrAsp: 1.832 ± 0.356
1.153TyrGlu: 1.153 ± 0.254
0.611TyrPhe: 0.611 ± 0.177
2.578TyrGly: 2.578 ± 0.559
0.271TyrHis: 0.271 ± 0.119
0.95TyrIle: 0.95 ± 0.258
0.475TyrLys: 0.475 ± 0.17
1.56TyrLeu: 1.56 ± 0.273
0.271TyrMet: 0.271 ± 0.118
0.136TyrAsn: 0.136 ± 0.086
0.746TyrPro: 0.746 ± 0.222
0.339TyrGln: 0.339 ± 0.161
2.103TyrArg: 2.103 ± 0.397
1.018TyrSer: 1.018 ± 0.245
2.035TyrThr: 2.035 ± 0.316
1.018TyrVal: 1.018 ± 0.235
0.339TyrTrp: 0.339 ± 0.139
0.746TyrTyr: 0.746 ± 0.218
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (14742 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski