Amino acid dipepetide frequency for Gordonia phage Boopy

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.147AlaAla: 9.147 ± 0.728
0.647AlaCys: 0.647 ± 0.145
4.941AlaAsp: 4.941 ± 0.37
5.912AlaGlu: 5.912 ± 0.546
2.735AlaPhe: 2.735 ± 0.276
5.706AlaGly: 5.706 ± 0.558
1.647AlaHis: 1.647 ± 0.21
5.853AlaIle: 5.853 ± 0.599
4.882AlaLys: 4.882 ± 0.417
8.353AlaLeu: 8.353 ± 0.674
2.853AlaMet: 2.853 ± 0.252
3.382AlaAsn: 3.382 ± 0.324
4.529AlaPro: 4.529 ± 0.397
3.323AlaGln: 3.323 ± 0.296
4.97AlaArg: 4.97 ± 0.375
5.47AlaSer: 5.47 ± 0.444
5.765AlaThr: 5.765 ± 0.423
6.47AlaVal: 6.47 ± 0.522
1.588AlaTrp: 1.588 ± 0.196
2.5AlaTyr: 2.5 ± 0.28
0.0AlaXaa: 0.0 ± 0.0
Cys
0.706CysAla: 0.706 ± 0.153
0.088CysCys: 0.088 ± 0.051
0.5CysAsp: 0.5 ± 0.121
0.676CysGlu: 0.676 ± 0.151
0.294CysPhe: 0.294 ± 0.093
1.147CysGly: 1.147 ± 0.221
0.206CysHis: 0.206 ± 0.08
0.382CysIle: 0.382 ± 0.124
0.471CysLys: 0.471 ± 0.139
0.5CysLeu: 0.5 ± 0.134
0.176CysMet: 0.176 ± 0.075
0.353CysAsn: 0.353 ± 0.109
0.706CysPro: 0.706 ± 0.165
0.118CysGln: 0.118 ± 0.061
0.559CysArg: 0.559 ± 0.144
0.676CysSer: 0.676 ± 0.139
0.706CysThr: 0.706 ± 0.126
0.794CysVal: 0.794 ± 0.17
0.059CysTrp: 0.059 ± 0.044
0.206CysTyr: 0.206 ± 0.072
0.0CysXaa: 0.0 ± 0.0
Asp
6.647AspAla: 6.647 ± 0.548
0.529AspCys: 0.529 ± 0.13
4.529AspAsp: 4.529 ± 0.594
4.912AspGlu: 4.912 ± 0.45
2.382AspPhe: 2.382 ± 0.244
4.323AspGly: 4.323 ± 0.43
1.412AspHis: 1.412 ± 0.209
3.412AspIle: 3.412 ± 0.453
3.088AspLys: 3.088 ± 0.323
5.235AspLeu: 5.235 ± 0.466
1.382AspMet: 1.382 ± 0.208
2.471AspAsn: 2.471 ± 0.317
3.912AspPro: 3.912 ± 0.348
1.618AspGln: 1.618 ± 0.212
3.029AspArg: 3.029 ± 0.256
3.794AspSer: 3.794 ± 0.309
3.618AspThr: 3.618 ± 0.36
4.823AspVal: 4.823 ± 0.408
1.529AspTrp: 1.529 ± 0.192
2.029AspTyr: 2.029 ± 0.267
0.0AspXaa: 0.0 ± 0.0
Glu
5.235GluAla: 5.235 ± 0.464
0.735GluCys: 0.735 ± 0.147
3.823GluAsp: 3.823 ± 0.349
3.941GluGlu: 3.941 ± 0.378
2.618GluPhe: 2.618 ± 0.256
2.912GluGly: 2.912 ± 0.32
1.353GluHis: 1.353 ± 0.198
3.765GluIle: 3.765 ± 0.329
3.735GluLys: 3.735 ± 0.391
5.529GluLeu: 5.529 ± 0.493
1.618GluMet: 1.618 ± 0.267
2.471GluAsn: 2.471 ± 0.289
2.353GluPro: 2.353 ± 0.29
2.5GluGln: 2.5 ± 0.256
4.412GluArg: 4.412 ± 0.343
3.618GluSer: 3.618 ± 0.336
3.5GluThr: 3.5 ± 0.365
4.088GluVal: 4.088 ± 0.312
1.5GluTrp: 1.5 ± 0.225
2.059GluTyr: 2.059 ± 0.278
0.0GluXaa: 0.0 ± 0.0
Phe
2.647PheAla: 2.647 ± 0.301
0.147PheCys: 0.147 ± 0.067
2.382PheAsp: 2.382 ± 0.266
2.176PheGlu: 2.176 ± 0.275
0.912PhePhe: 0.912 ± 0.208
2.5PheGly: 2.5 ± 0.301
0.559PheHis: 0.559 ± 0.135
1.971PheIle: 1.971 ± 0.228
1.382PheLys: 1.382 ± 0.221
2.176PheLeu: 2.176 ± 0.261
1.029PheMet: 1.029 ± 0.203
1.088PheAsn: 1.088 ± 0.144
1.735PhePro: 1.735 ± 0.224
0.912PheGln: 0.912 ± 0.187
1.912PheArg: 1.912 ± 0.284
1.706PheSer: 1.706 ± 0.211
2.853PheThr: 2.853 ± 0.305
2.176PheVal: 2.176 ± 0.255
0.559PheTrp: 0.559 ± 0.121
1.235PheTyr: 1.235 ± 0.192
0.0PheXaa: 0.0 ± 0.0
Gly
5.676GlyAla: 5.676 ± 0.427
0.706GlyCys: 0.706 ± 0.173
4.0GlyAsp: 4.0 ± 0.392
4.382GlyGlu: 4.382 ± 0.33
2.382GlyPhe: 2.382 ± 0.323
5.235GlyGly: 5.235 ± 0.67
1.412GlyHis: 1.412 ± 0.224
3.97GlyIle: 3.97 ± 0.443
3.676GlyLys: 3.676 ± 0.346
5.235GlyLeu: 5.235 ± 0.402
2.265GlyMet: 2.265 ± 0.271
2.588GlyAsn: 2.588 ± 0.285
3.206GlyPro: 3.206 ± 0.295
2.412GlyGln: 2.412 ± 0.286
3.97GlyArg: 3.97 ± 0.364
4.235GlySer: 4.235 ± 0.419
4.794GlyThr: 4.794 ± 0.456
4.853GlyVal: 4.853 ± 0.437
1.647GlyTrp: 1.647 ± 0.22
2.588GlyTyr: 2.588 ± 0.291
0.0GlyXaa: 0.0 ± 0.0
His
1.794HisAla: 1.794 ± 0.296
0.147HisCys: 0.147 ± 0.067
1.735HisAsp: 1.735 ± 0.224
1.088HisGlu: 1.088 ± 0.146
0.765HisPhe: 0.765 ± 0.148
1.412HisGly: 1.412 ± 0.202
0.382HisHis: 0.382 ± 0.107
1.118HisIle: 1.118 ± 0.2
0.971HisLys: 0.971 ± 0.173
1.765HisLeu: 1.765 ± 0.241
0.471HisMet: 0.471 ± 0.107
0.794HisAsn: 0.794 ± 0.166
1.235HisPro: 1.235 ± 0.18
0.441HisGln: 0.441 ± 0.114
1.471HisArg: 1.471 ± 0.227
1.0HisSer: 1.0 ± 0.178
1.206HisThr: 1.206 ± 0.219
1.588HisVal: 1.588 ± 0.268
0.235HisTrp: 0.235 ± 0.073
1.147HisTyr: 1.147 ± 0.206
0.0HisXaa: 0.0 ± 0.0
Ile
5.588IleAla: 5.588 ± 0.455
0.941IleCys: 0.941 ± 0.208
4.206IleAsp: 4.206 ± 0.325
3.735IleGlu: 3.735 ± 0.31
1.882IlePhe: 1.882 ± 0.223
3.618IleGly: 3.618 ± 0.343
1.147IleHis: 1.147 ± 0.214
2.676IleIle: 2.676 ± 0.408
3.412IleLys: 3.412 ± 0.454
4.294IleLeu: 4.294 ± 0.339
1.059IleMet: 1.059 ± 0.183
2.676IleAsn: 2.676 ± 0.231
2.412IlePro: 2.412 ± 0.306
1.912IleGln: 1.912 ± 0.254
4.382IleArg: 4.382 ± 0.308
3.353IleSer: 3.353 ± 0.304
3.353IleThr: 3.353 ± 0.287
3.618IleVal: 3.618 ± 0.388
1.265IleTrp: 1.265 ± 0.182
1.265IleTyr: 1.265 ± 0.232
0.0IleXaa: 0.0 ± 0.0
Lys
4.853LysAla: 4.853 ± 0.463
0.588LysCys: 0.588 ± 0.14
3.353LysAsp: 3.353 ± 0.316
2.735LysGlu: 2.735 ± 0.283
1.941LysPhe: 1.941 ± 0.214
2.618LysGly: 2.618 ± 0.318
1.235LysHis: 1.235 ± 0.202
3.088LysIle: 3.088 ± 0.35
3.823LysLys: 3.823 ± 0.513
3.912LysLeu: 3.912 ± 0.348
1.353LysMet: 1.353 ± 0.229
1.941LysAsn: 1.941 ± 0.219
3.206LysPro: 3.206 ± 0.393
1.794LysGln: 1.794 ± 0.264
3.088LysArg: 3.088 ± 0.327
3.823LysSer: 3.823 ± 0.435
3.912LysThr: 3.912 ± 0.359
3.647LysVal: 3.647 ± 0.378
1.088LysTrp: 1.088 ± 0.166
2.059LysTyr: 2.059 ± 0.275
0.0LysXaa: 0.0 ± 0.0
Leu
7.912LeuAla: 7.912 ± 0.511
0.647LeuCys: 0.647 ± 0.142
5.176LeuAsp: 5.176 ± 0.517
5.5LeuGlu: 5.5 ± 0.491
2.206LeuPhe: 2.206 ± 0.244
5.706LeuGly: 5.706 ± 0.501
1.412LeuHis: 1.412 ± 0.201
3.912LeuIle: 3.912 ± 0.374
3.941LeuLys: 3.941 ± 0.345
6.323LeuLeu: 6.323 ± 0.408
1.823LeuMet: 1.823 ± 0.218
3.147LeuAsn: 3.147 ± 0.325
3.706LeuPro: 3.706 ± 0.29
2.882LeuGln: 2.882 ± 0.252
4.588LeuArg: 4.588 ± 0.368
5.176LeuSer: 5.176 ± 0.343
5.735LeuThr: 5.735 ± 0.341
5.294LeuVal: 5.294 ± 0.367
1.294LeuTrp: 1.294 ± 0.202
2.118LeuTyr: 2.118 ± 0.276
0.0LeuXaa: 0.0 ± 0.0
Met
2.353MetAla: 2.353 ± 0.297
0.147MetCys: 0.147 ± 0.073
1.235MetAsp: 1.235 ± 0.195
1.206MetGlu: 1.206 ± 0.202
0.618MetPhe: 0.618 ± 0.138
1.559MetGly: 1.559 ± 0.274
0.441MetHis: 0.441 ± 0.098
1.823MetIle: 1.823 ± 0.235
1.176MetLys: 1.176 ± 0.167
1.882MetLeu: 1.882 ± 0.306
0.353MetMet: 0.353 ± 0.106
1.088MetAsn: 1.088 ± 0.168
1.235MetPro: 1.235 ± 0.206
0.824MetGln: 0.824 ± 0.189
1.412MetArg: 1.412 ± 0.175
2.353MetSer: 2.353 ± 0.236
3.0MetThr: 3.0 ± 0.251
1.235MetVal: 1.235 ± 0.179
0.5MetTrp: 0.5 ± 0.115
0.618MetTyr: 0.618 ± 0.152
0.0MetXaa: 0.0 ± 0.0
Asn
3.823AsnAla: 3.823 ± 0.294
0.471AsnCys: 0.471 ± 0.116
2.706AsnAsp: 2.706 ± 0.254
2.206AsnGlu: 2.206 ± 0.243
1.235AsnPhe: 1.235 ± 0.178
3.47AsnGly: 3.47 ± 0.382
0.882AsnHis: 0.882 ± 0.161
2.147AsnIle: 2.147 ± 0.24
2.382AsnLys: 2.382 ± 0.296
2.706AsnLeu: 2.706 ± 0.276
1.0AsnMet: 1.0 ± 0.149
1.941AsnAsn: 1.941 ± 0.43
2.647AsnPro: 2.647 ± 0.258
1.765AsnGln: 1.765 ± 0.218
2.382AsnArg: 2.382 ± 0.261
2.412AsnSer: 2.412 ± 0.263
2.147AsnThr: 2.147 ± 0.282
2.588AsnVal: 2.588 ± 0.277
0.765AsnTrp: 0.765 ± 0.141
1.618AsnTyr: 1.618 ± 0.193
0.0AsnXaa: 0.0 ± 0.0
Pro
4.588ProAla: 4.588 ± 0.45
0.294ProCys: 0.294 ± 0.112
3.323ProAsp: 3.323 ± 0.375
3.618ProGlu: 3.618 ± 0.385
1.529ProPhe: 1.529 ± 0.205
3.735ProGly: 3.735 ± 0.294
1.088ProHis: 1.088 ± 0.201
2.765ProIle: 2.765 ± 0.297
3.382ProLys: 3.382 ± 0.429
3.265ProLeu: 3.265 ± 0.327
0.941ProMet: 0.941 ± 0.176
2.441ProAsn: 2.441 ± 0.282
2.823ProPro: 2.823 ± 0.404
1.235ProGln: 1.235 ± 0.187
2.471ProArg: 2.471 ± 0.277
4.0ProSer: 4.0 ± 0.371
3.323ProThr: 3.323 ± 0.33
3.529ProVal: 3.529 ± 0.342
0.794ProTrp: 0.794 ± 0.157
1.441ProTyr: 1.441 ± 0.217
0.0ProXaa: 0.0 ± 0.0
Gln
2.676GlnAla: 2.676 ± 0.294
0.147GlnCys: 0.147 ± 0.067
1.823GlnAsp: 1.823 ± 0.251
1.853GlnGlu: 1.853 ± 0.222
0.941GlnPhe: 0.941 ± 0.155
1.823GlnGly: 1.823 ± 0.258
0.441GlnHis: 0.441 ± 0.121
2.235GlnIle: 2.235 ± 0.297
1.618GlnLys: 1.618 ± 0.201
3.029GlnLeu: 3.029 ± 0.318
1.353GlnMet: 1.353 ± 0.19
1.588GlnAsn: 1.588 ± 0.214
1.706GlnPro: 1.706 ± 0.21
1.676GlnGln: 1.676 ± 0.207
2.353GlnArg: 2.353 ± 0.28
1.823GlnSer: 1.823 ± 0.243
2.088GlnThr: 2.088 ± 0.243
2.382GlnVal: 2.382 ± 0.258
0.794GlnTrp: 0.794 ± 0.128
1.059GlnTyr: 1.059 ± 0.168
0.0GlnXaa: 0.0 ± 0.0
Arg
5.265ArgAla: 5.265 ± 0.498
0.794ArgCys: 0.794 ± 0.2
4.059ArgAsp: 4.059 ± 0.375
4.088ArgGlu: 4.088 ± 0.369
2.059ArgPhe: 2.059 ± 0.209
3.5ArgGly: 3.5 ± 0.381
1.206ArgHis: 1.206 ± 0.184
3.029ArgIle: 3.029 ± 0.331
3.794ArgLys: 3.794 ± 0.367
4.912ArgLeu: 4.912 ± 0.415
1.912ArgMet: 1.912 ± 0.197
2.353ArgAsn: 2.353 ± 0.336
3.088ArgPro: 3.088 ± 0.387
2.353ArgGln: 2.353 ± 0.275
4.588ArgArg: 4.588 ± 0.502
3.294ArgSer: 3.294 ± 0.318
3.706ArgThr: 3.706 ± 0.313
4.0ArgVal: 4.0 ± 0.332
1.118ArgTrp: 1.118 ± 0.189
2.559ArgTyr: 2.559 ± 0.337
0.0ArgXaa: 0.0 ± 0.0
Ser
5.382SerAla: 5.382 ± 0.398
0.353SerCys: 0.353 ± 0.113
4.265SerAsp: 4.265 ± 0.424
3.294SerGlu: 3.294 ± 0.233
1.853SerPhe: 1.853 ± 0.281
5.676SerGly: 5.676 ± 0.452
1.206SerHis: 1.206 ± 0.217
3.5SerIle: 3.5 ± 0.325
3.0SerLys: 3.0 ± 0.399
5.117SerLeu: 5.117 ± 0.437
1.618SerMet: 1.618 ± 0.195
2.765SerAsn: 2.765 ± 0.326
2.765SerPro: 2.765 ± 0.328
1.765SerGln: 1.765 ± 0.223
3.97SerArg: 3.97 ± 0.327
4.647SerSer: 4.647 ± 0.626
4.47SerThr: 4.47 ± 0.456
4.412SerVal: 4.412 ± 0.392
1.618SerTrp: 1.618 ± 0.225
1.794SerTyr: 1.794 ± 0.244
0.0SerXaa: 0.0 ± 0.0
Thr
5.97ThrAla: 5.97 ± 0.529
0.647ThrCys: 0.647 ± 0.155
3.882ThrAsp: 3.882 ± 0.368
3.353ThrGlu: 3.353 ± 0.348
2.0ThrPhe: 2.0 ± 0.272
5.676ThrGly: 5.676 ± 0.388
1.471ThrHis: 1.471 ± 0.219
4.118ThrIle: 4.118 ± 0.347
3.529ThrLys: 3.529 ± 0.445
5.059ThrLeu: 5.059 ± 0.368
1.382ThrMet: 1.382 ± 0.16
2.5ThrAsn: 2.5 ± 0.306
4.029ThrPro: 4.029 ± 0.425
2.059ThrGln: 2.059 ± 0.243
4.294ThrArg: 4.294 ± 0.3
4.088ThrSer: 4.088 ± 0.522
3.912ThrThr: 3.912 ± 0.402
4.529ThrVal: 4.529 ± 0.49
1.323ThrTrp: 1.323 ± 0.177
2.206ThrTyr: 2.206 ± 0.244
0.0ThrXaa: 0.0 ± 0.0
Val
6.294ValAla: 6.294 ± 0.475
0.676ValCys: 0.676 ± 0.154
5.117ValAsp: 5.117 ± 0.376
4.5ValGlu: 4.5 ± 0.412
2.353ValPhe: 2.353 ± 0.254
4.206ValGly: 4.206 ± 0.381
1.794ValHis: 1.794 ± 0.212
4.588ValIle: 4.588 ± 0.424
3.059ValLys: 3.059 ± 0.346
5.206ValLeu: 5.206 ± 0.42
1.618ValMet: 1.618 ± 0.207
3.029ValAsn: 3.029 ± 0.255
3.0ValPro: 3.0 ± 0.291
2.088ValGln: 2.088 ± 0.305
4.206ValArg: 4.206 ± 0.445
4.265ValSer: 4.265 ± 0.373
4.559ValThr: 4.559 ± 0.335
5.559ValVal: 5.559 ± 0.403
1.118ValTrp: 1.118 ± 0.235
1.706ValTyr: 1.706 ± 0.246
0.0ValXaa: 0.0 ± 0.0
Trp
1.471TrpAla: 1.471 ± 0.192
0.265TrpCys: 0.265 ± 0.1
1.353TrpAsp: 1.353 ± 0.22
1.029TrpGlu: 1.029 ± 0.204
0.5TrpPhe: 0.5 ± 0.125
1.412TrpGly: 1.412 ± 0.197
0.676TrpHis: 0.676 ± 0.155
1.323TrpIle: 1.323 ± 0.191
1.206TrpLys: 1.206 ± 0.195
1.529TrpLeu: 1.529 ± 0.197
0.235TrpMet: 0.235 ± 0.09
1.294TrpAsn: 1.294 ± 0.234
0.735TrpPro: 0.735 ± 0.158
0.588TrpGln: 0.588 ± 0.116
1.294TrpArg: 1.294 ± 0.209
1.471TrpSer: 1.471 ± 0.179
1.294TrpThr: 1.294 ± 0.21
1.088TrpVal: 1.088 ± 0.209
0.5TrpTrp: 0.5 ± 0.119
0.559TrpTyr: 0.559 ± 0.148
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.647TyrAla: 2.647 ± 0.287
0.353TyrCys: 0.353 ± 0.092
2.265TyrAsp: 2.265 ± 0.261
1.588TyrGlu: 1.588 ± 0.255
0.853TyrPhe: 0.853 ± 0.147
3.0TyrGly: 3.0 ± 0.328
0.794TyrHis: 0.794 ± 0.16
1.118TyrIle: 1.118 ± 0.178
1.529TyrLys: 1.529 ± 0.253
2.471TyrLeu: 2.471 ± 0.288
0.588TyrMet: 0.588 ± 0.128
1.382TyrAsn: 1.382 ± 0.186
1.559TyrPro: 1.559 ± 0.241
1.147TyrGln: 1.147 ± 0.153
2.323TyrArg: 2.323 ± 0.248
2.265TyrSer: 2.265 ± 0.232
2.088TyrThr: 2.088 ± 0.268
2.265TyrVal: 2.265 ± 0.289
0.529TyrTrp: 0.529 ± 0.14
1.118TyrTyr: 1.118 ± 0.206
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 162 proteins (34002 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski