Amino acid dipepetide frequency for Gordonia phage Blueberry

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.232AlaAla: 16.232 ± 1.352
0.795AlaCys: 0.795 ± 0.243
8.229AlaAsp: 8.229 ± 0.78
7.378AlaGlu: 7.378 ± 0.6
3.689AlaPhe: 3.689 ± 0.506
8.4AlaGly: 8.4 ± 0.81
2.27AlaHis: 2.27 ± 0.359
5.165AlaIle: 5.165 ± 0.435
3.292AlaLys: 3.292 ± 0.551
10.443AlaLeu: 10.443 ± 0.812
2.384AlaMet: 2.384 ± 0.431
3.689AlaAsn: 3.689 ± 0.437
5.505AlaPro: 5.505 ± 0.551
4.03AlaGln: 4.03 ± 0.399
8.173AlaArg: 8.173 ± 0.681
5.789AlaSer: 5.789 ± 0.519
7.321AlaThr: 7.321 ± 0.713
7.662AlaVal: 7.662 ± 0.788
1.816AlaTrp: 1.816 ± 0.265
1.646AlaTyr: 1.646 ± 0.214
0.0AlaXaa: 0.0 ± 0.0
Cys
0.795CysAla: 0.795 ± 0.223
0.284CysCys: 0.284 ± 0.145
1.022CysAsp: 1.022 ± 0.28
0.568CysGlu: 0.568 ± 0.202
0.057CysPhe: 0.057 ± 0.054
0.795CysGly: 0.795 ± 0.269
0.624CysHis: 0.624 ± 0.215
0.227CysIle: 0.227 ± 0.113
0.17CysLys: 0.17 ± 0.11
0.454CysLeu: 0.454 ± 0.15
0.284CysMet: 0.284 ± 0.14
0.511CysAsn: 0.511 ± 0.171
0.851CysPro: 0.851 ± 0.234
0.397CysGln: 0.397 ± 0.156
0.681CysArg: 0.681 ± 0.216
0.284CysSer: 0.284 ± 0.147
0.511CysThr: 0.511 ± 0.162
0.397CysVal: 0.397 ± 0.153
0.284CysTrp: 0.284 ± 0.138
0.057CysTyr: 0.057 ± 0.058
0.0CysXaa: 0.0 ± 0.0
Asp
7.321AspAla: 7.321 ± 0.694
0.284AspCys: 0.284 ± 0.138
5.959AspAsp: 5.959 ± 0.912
4.37AspGlu: 4.37 ± 0.565
1.759AspPhe: 1.759 ± 0.361
7.264AspGly: 7.264 ± 0.867
1.703AspHis: 1.703 ± 0.39
2.894AspIle: 2.894 ± 0.363
1.305AspLys: 1.305 ± 0.315
6.3AspLeu: 6.3 ± 0.59
1.419AspMet: 1.419 ± 0.255
2.1AspAsn: 2.1 ± 0.326
4.37AspPro: 4.37 ± 0.618
2.327AspGln: 2.327 ± 0.341
4.994AspArg: 4.994 ± 0.778
3.802AspSer: 3.802 ± 0.4
4.313AspThr: 4.313 ± 0.651
5.505AspVal: 5.505 ± 0.514
1.135AspTrp: 1.135 ± 0.208
1.419AspTyr: 1.419 ± 0.32
0.0AspXaa: 0.0 ± 0.0
Glu
6.697GluAla: 6.697 ± 0.789
0.511GluCys: 0.511 ± 0.181
2.781GluAsp: 2.781 ± 0.362
3.121GluGlu: 3.121 ± 0.595
2.497GluPhe: 2.497 ± 0.382
4.143GluGly: 4.143 ± 0.591
1.476GluHis: 1.476 ± 0.282
3.121GluIle: 3.121 ± 0.417
2.27GluLys: 2.27 ± 0.321
4.994GluLeu: 4.994 ± 0.814
1.135GluMet: 1.135 ± 0.216
1.419GluAsn: 1.419 ± 0.292
3.689GluPro: 3.689 ± 0.695
2.213GluGln: 2.213 ± 0.394
5.505GluArg: 5.505 ± 0.521
2.724GluSer: 2.724 ± 0.36
3.008GluThr: 3.008 ± 0.361
5.278GluVal: 5.278 ± 0.602
1.192GluTrp: 1.192 ± 0.306
1.816GluTyr: 1.816 ± 0.316
0.0GluXaa: 0.0 ± 0.0
Phe
3.348PheAla: 3.348 ± 0.425
0.454PheCys: 0.454 ± 0.146
1.986PheAsp: 1.986 ± 0.291
1.759PheGlu: 1.759 ± 0.29
0.795PhePhe: 0.795 ± 0.215
2.043PheGly: 2.043 ± 0.315
0.454PheHis: 0.454 ± 0.171
1.022PheIle: 1.022 ± 0.288
1.022PheLys: 1.022 ± 0.296
1.759PheLeu: 1.759 ± 0.392
0.568PheMet: 0.568 ± 0.142
1.135PheAsn: 1.135 ± 0.255
1.532PhePro: 1.532 ± 0.245
0.624PheGln: 0.624 ± 0.161
2.043PheArg: 2.043 ± 0.286
1.362PheSer: 1.362 ± 0.284
2.213PheThr: 2.213 ± 0.306
2.497PheVal: 2.497 ± 0.376
0.341PheTrp: 0.341 ± 0.138
0.624PheTyr: 0.624 ± 0.174
0.0PheXaa: 0.0 ± 0.0
Gly
8.57GlyAla: 8.57 ± 0.914
0.454GlyCys: 0.454 ± 0.155
5.619GlyAsp: 5.619 ± 0.45
4.767GlyGlu: 4.767 ± 0.569
2.838GlyPhe: 2.838 ± 0.422
7.321GlyGly: 7.321 ± 0.796
1.646GlyHis: 1.646 ± 0.314
3.689GlyIle: 3.689 ± 0.465
3.178GlyLys: 3.178 ± 0.442
6.697GlyLeu: 6.697 ± 0.974
1.816GlyMet: 1.816 ± 0.283
2.951GlyAsn: 2.951 ± 0.519
4.143GlyPro: 4.143 ± 0.616
3.065GlyGln: 3.065 ± 0.45
5.846GlyArg: 5.846 ± 0.607
4.257GlySer: 4.257 ± 0.501
4.484GlyThr: 4.484 ± 0.55
6.47GlyVal: 6.47 ± 0.626
2.213GlyTrp: 2.213 ± 0.321
2.724GlyTyr: 2.724 ± 0.31
0.0GlyXaa: 0.0 ± 0.0
His
2.44HisAla: 2.44 ± 0.375
0.17HisCys: 0.17 ± 0.104
1.873HisAsp: 1.873 ± 0.264
0.795HisGlu: 0.795 ± 0.186
0.681HisPhe: 0.681 ± 0.185
1.646HisGly: 1.646 ± 0.282
0.681HisHis: 0.681 ± 0.191
1.078HisIle: 1.078 ± 0.24
0.341HisLys: 0.341 ± 0.143
1.986HisLeu: 1.986 ± 0.288
0.227HisMet: 0.227 ± 0.096
0.397HisAsn: 0.397 ± 0.152
1.759HisPro: 1.759 ± 0.293
0.454HisGln: 0.454 ± 0.132
1.873HisArg: 1.873 ± 0.393
1.305HisSer: 1.305 ± 0.26
1.703HisThr: 1.703 ± 0.306
1.532HisVal: 1.532 ± 0.33
0.284HisTrp: 0.284 ± 0.118
0.738HisTyr: 0.738 ± 0.199
0.0HisXaa: 0.0 ± 0.0
Ile
5.959IleAla: 5.959 ± 0.597
0.284IleCys: 0.284 ± 0.125
4.2IleAsp: 4.2 ± 0.601
2.611IleGlu: 2.611 ± 0.398
0.851IlePhe: 0.851 ± 0.186
4.086IleGly: 4.086 ± 0.572
0.908IleHis: 0.908 ± 0.204
1.419IleIle: 1.419 ± 0.306
1.532IleLys: 1.532 ± 0.377
1.873IleLeu: 1.873 ± 0.337
0.341IleMet: 0.341 ± 0.125
1.192IleAsn: 1.192 ± 0.251
2.554IlePro: 2.554 ± 0.352
0.795IleGln: 0.795 ± 0.184
4.37IleArg: 4.37 ± 0.477
1.93IleSer: 1.93 ± 0.319
3.405IleThr: 3.405 ± 0.409
4.03IleVal: 4.03 ± 0.484
0.284IleTrp: 0.284 ± 0.114
0.681IleTyr: 0.681 ± 0.241
0.0IleXaa: 0.0 ± 0.0
Lys
3.235LysAla: 3.235 ± 0.429
0.17LysCys: 0.17 ± 0.088
1.873LysAsp: 1.873 ± 0.368
1.305LysGlu: 1.305 ± 0.241
1.078LysPhe: 1.078 ± 0.263
2.611LysGly: 2.611 ± 0.382
0.454LysHis: 0.454 ± 0.17
1.703LysIle: 1.703 ± 0.271
1.93LysLys: 1.93 ± 0.377
2.611LysLeu: 2.611 ± 0.39
0.511LysMet: 0.511 ± 0.178
1.135LysAsn: 1.135 ± 0.264
2.327LysPro: 2.327 ± 0.45
1.022LysGln: 1.022 ± 0.274
1.873LysArg: 1.873 ± 0.376
1.646LysSer: 1.646 ± 0.317
2.554LysThr: 2.554 ± 0.318
2.043LysVal: 2.043 ± 0.326
1.192LysTrp: 1.192 ± 0.237
0.568LysTyr: 0.568 ± 0.205
0.0LysXaa: 0.0 ± 0.0
Leu
9.762LeuAla: 9.762 ± 0.915
0.681LeuCys: 0.681 ± 0.207
5.278LeuAsp: 5.278 ± 0.587
3.973LeuGlu: 3.973 ± 0.52
2.043LeuPhe: 2.043 ± 0.287
6.243LeuGly: 6.243 ± 0.668
1.532LeuHis: 1.532 ± 0.272
2.724LeuIle: 2.724 ± 0.364
1.589LeuLys: 1.589 ± 0.273
4.711LeuLeu: 4.711 ± 0.648
1.816LeuMet: 1.816 ± 0.306
2.157LeuAsn: 2.157 ± 0.332
4.767LeuPro: 4.767 ± 0.511
2.327LeuGln: 2.327 ± 0.379
5.505LeuArg: 5.505 ± 0.61
3.973LeuSer: 3.973 ± 0.402
6.243LeuThr: 6.243 ± 0.529
6.3LeuVal: 6.3 ± 0.558
1.873LeuTrp: 1.873 ± 0.33
1.532LeuTyr: 1.532 ± 0.261
0.0LeuXaa: 0.0 ± 0.0
Met
3.348MetAla: 3.348 ± 0.537
0.284MetCys: 0.284 ± 0.121
0.568MetAsp: 0.568 ± 0.163
1.078MetGlu: 1.078 ± 0.199
0.624MetPhe: 0.624 ± 0.183
1.703MetGly: 1.703 ± 0.313
0.341MetHis: 0.341 ± 0.141
0.795MetIle: 0.795 ± 0.2
0.511MetLys: 0.511 ± 0.167
1.305MetLeu: 1.305 ± 0.346
0.511MetMet: 0.511 ± 0.145
0.568MetAsn: 0.568 ± 0.163
1.703MetPro: 1.703 ± 0.295
0.511MetGln: 0.511 ± 0.198
1.93MetArg: 1.93 ± 0.447
1.589MetSer: 1.589 ± 0.304
2.781MetThr: 2.781 ± 0.348
0.965MetVal: 0.965 ± 0.258
0.568MetTrp: 0.568 ± 0.184
0.341MetTyr: 0.341 ± 0.114
0.0MetXaa: 0.0 ± 0.0
Asn
3.235AsnAla: 3.235 ± 0.397
0.227AsnCys: 0.227 ± 0.099
2.043AsnAsp: 2.043 ± 0.286
0.851AsnGlu: 0.851 ± 0.203
0.624AsnPhe: 0.624 ± 0.173
3.235AsnGly: 3.235 ± 0.443
0.965AsnHis: 0.965 ± 0.228
0.965AsnIle: 0.965 ± 0.243
0.851AsnLys: 0.851 ± 0.2
2.1AsnLeu: 2.1 ± 0.355
0.851AsnMet: 0.851 ± 0.226
1.135AsnAsn: 1.135 ± 0.283
3.348AsnPro: 3.348 ± 0.382
0.965AsnGln: 0.965 ± 0.216
1.816AsnArg: 1.816 ± 0.359
1.93AsnSer: 1.93 ± 0.307
2.44AsnThr: 2.44 ± 0.433
1.986AsnVal: 1.986 ± 0.37
0.511AsnTrp: 0.511 ± 0.152
0.851AsnTyr: 0.851 ± 0.231
0.0AsnXaa: 0.0 ± 0.0
Pro
5.675ProAla: 5.675 ± 0.674
0.851ProCys: 0.851 ± 0.215
4.824ProAsp: 4.824 ± 0.587
5.278ProGlu: 5.278 ± 0.607
1.362ProPhe: 1.362 ± 0.294
4.824ProGly: 4.824 ± 0.524
1.419ProHis: 1.419 ± 0.319
2.724ProIle: 2.724 ± 0.411
2.213ProLys: 2.213 ± 0.362
3.235ProLeu: 3.235 ± 0.366
1.589ProMet: 1.589 ± 0.337
1.93ProAsn: 1.93 ± 0.293
3.462ProPro: 3.462 ± 0.577
2.27ProGln: 2.27 ± 0.309
3.462ProArg: 3.462 ± 0.486
3.008ProSer: 3.008 ± 0.402
4.2ProThr: 4.2 ± 0.576
3.746ProVal: 3.746 ± 0.432
1.419ProTrp: 1.419 ± 0.252
1.305ProTyr: 1.305 ± 0.259
0.0ProXaa: 0.0 ± 0.0
Gln
3.575GlnAla: 3.575 ± 0.498
0.227GlnCys: 0.227 ± 0.116
1.419GlnAsp: 1.419 ± 0.261
1.192GlnGlu: 1.192 ± 0.339
1.022GlnPhe: 1.022 ± 0.197
2.27GlnGly: 2.27 ± 0.393
0.908GlnHis: 0.908 ± 0.203
1.589GlnIle: 1.589 ± 0.272
1.078GlnLys: 1.078 ± 0.24
3.575GlnLeu: 3.575 ± 0.434
0.908GlnMet: 0.908 ± 0.207
0.908GlnAsn: 0.908 ± 0.25
2.043GlnPro: 2.043 ± 0.445
1.873GlnGln: 1.873 ± 0.372
2.951GlnArg: 2.951 ± 0.468
1.93GlnSer: 1.93 ± 0.371
1.646GlnThr: 1.646 ± 0.319
3.178GlnVal: 3.178 ± 0.454
1.022GlnTrp: 1.022 ± 0.255
1.078GlnTyr: 1.078 ± 0.229
0.0GlnXaa: 0.0 ± 0.0
Arg
7.889ArgAla: 7.889 ± 0.678
0.965ArgCys: 0.965 ± 0.23
6.129ArgAsp: 6.129 ± 0.636
4.994ArgGlu: 4.994 ± 0.527
1.646ArgPhe: 1.646 ± 0.373
6.64ArgGly: 6.64 ± 0.613
1.986ArgHis: 1.986 ± 0.369
3.689ArgIle: 3.689 ± 0.414
2.497ArgLys: 2.497 ± 0.305
5.789ArgLeu: 5.789 ± 0.516
2.157ArgMet: 2.157 ± 0.386
2.781ArgAsn: 2.781 ± 0.341
3.235ArgPro: 3.235 ± 0.475
3.065ArgGln: 3.065 ± 0.497
7.946ArgArg: 7.946 ± 0.965
3.632ArgSer: 3.632 ± 0.373
4.086ArgThr: 4.086 ± 0.415
5.902ArgVal: 5.902 ± 0.727
1.759ArgTrp: 1.759 ± 0.395
1.362ArgTyr: 1.362 ± 0.237
0.0ArgXaa: 0.0 ± 0.0
Ser
5.335SerAla: 5.335 ± 0.548
0.227SerCys: 0.227 ± 0.102
3.121SerAsp: 3.121 ± 0.461
3.689SerGlu: 3.689 ± 0.502
1.476SerPhe: 1.476 ± 0.275
4.711SerGly: 4.711 ± 0.639
0.851SerHis: 0.851 ± 0.245
2.497SerIle: 2.497 ± 0.363
1.532SerLys: 1.532 ± 0.269
2.724SerLeu: 2.724 ± 0.481
1.476SerMet: 1.476 ± 0.21
1.362SerAsn: 1.362 ± 0.258
3.235SerPro: 3.235 ± 0.376
1.703SerGln: 1.703 ± 0.301
4.03SerArg: 4.03 ± 0.506
2.327SerSer: 2.327 ± 0.383
3.859SerThr: 3.859 ± 0.424
3.859SerVal: 3.859 ± 0.535
1.362SerTrp: 1.362 ± 0.243
0.965SerTyr: 0.965 ± 0.256
0.0SerXaa: 0.0 ± 0.0
Thr
7.832ThrAla: 7.832 ± 0.735
0.511ThrCys: 0.511 ± 0.168
5.108ThrAsp: 5.108 ± 0.571
4.257ThrGlu: 4.257 ± 0.458
1.759ThrPhe: 1.759 ± 0.344
5.392ThrGly: 5.392 ± 0.701
1.249ThrHis: 1.249 ± 0.258
3.178ThrIle: 3.178 ± 0.433
2.724ThrLys: 2.724 ± 0.382
5.165ThrLeu: 5.165 ± 0.511
0.908ThrMet: 0.908 ± 0.242
2.157ThrAsn: 2.157 ± 0.408
4.086ThrPro: 4.086 ± 0.447
2.724ThrGln: 2.724 ± 0.363
4.427ThrArg: 4.427 ± 0.375
3.178ThrSer: 3.178 ± 0.367
4.767ThrThr: 4.767 ± 0.595
5.846ThrVal: 5.846 ± 0.556
1.135ThrTrp: 1.135 ± 0.301
1.646ThrTyr: 1.646 ± 0.463
0.0ThrXaa: 0.0 ± 0.0
Val
8.74ValAla: 8.74 ± 0.787
1.249ValCys: 1.249 ± 0.315
5.846ValAsp: 5.846 ± 0.641
5.051ValGlu: 5.051 ± 0.566
1.703ValPhe: 1.703 ± 0.322
6.356ValGly: 6.356 ± 0.576
1.078ValHis: 1.078 ± 0.258
3.065ValIle: 3.065 ± 0.4
2.327ValLys: 2.327 ± 0.314
5.675ValLeu: 5.675 ± 0.715
2.043ValMet: 2.043 ± 0.414
2.043ValAsn: 2.043 ± 0.37
3.746ValPro: 3.746 ± 0.373
2.667ValGln: 2.667 ± 0.417
6.867ValArg: 6.867 ± 0.628
3.008ValSer: 3.008 ± 0.431
5.562ValThr: 5.562 ± 0.799
6.754ValVal: 6.754 ± 0.58
1.419ValTrp: 1.419 ± 0.261
2.44ValTyr: 2.44 ± 0.324
0.0ValXaa: 0.0 ± 0.0
Trp
1.589TrpAla: 1.589 ± 0.31
0.454TrpCys: 0.454 ± 0.148
1.192TrpAsp: 1.192 ± 0.336
0.681TrpGlu: 0.681 ± 0.236
0.624TrpPhe: 0.624 ± 0.201
0.965TrpGly: 0.965 ± 0.234
0.511TrpHis: 0.511 ± 0.186
0.965TrpIle: 0.965 ± 0.21
0.681TrpLys: 0.681 ± 0.17
2.157TrpLeu: 2.157 ± 0.305
0.568TrpMet: 0.568 ± 0.177
0.738TrpAsn: 0.738 ± 0.324
1.476TrpPro: 1.476 ± 0.321
0.738TrpGln: 0.738 ± 0.182
2.043TrpArg: 2.043 ± 0.281
1.192TrpSer: 1.192 ± 0.247
1.192TrpThr: 1.192 ± 0.236
1.759TrpVal: 1.759 ± 0.304
0.624TrpTrp: 0.624 ± 0.153
0.738TrpTyr: 0.738 ± 0.179
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.327TyrAla: 2.327 ± 0.429
0.17TyrCys: 0.17 ± 0.101
1.476TyrAsp: 1.476 ± 0.288
1.873TyrGlu: 1.873 ± 0.385
0.454TyrPhe: 0.454 ± 0.193
2.1TyrGly: 2.1 ± 0.336
0.908TyrHis: 0.908 ± 0.207
0.795TyrIle: 0.795 ± 0.224
0.795TyrLys: 0.795 ± 0.195
1.476TyrLeu: 1.476 ± 0.286
0.454TyrMet: 0.454 ± 0.137
0.624TyrAsn: 0.624 ± 0.186
0.965TyrPro: 0.965 ± 0.239
0.624TyrGln: 0.624 ± 0.204
1.873TyrArg: 1.873 ± 0.328
1.476TyrSer: 1.476 ± 0.27
1.759TyrThr: 1.759 ± 0.303
1.93TyrVal: 1.93 ± 0.365
0.454TyrTrp: 0.454 ± 0.136
0.511TyrTyr: 0.511 ± 0.172
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (17621 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski