Amino acid dipepetide frequency for Gordonia phage Pupper

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.997AlaAla: 10.997 ± 0.695
0.687AlaCys: 0.687 ± 0.14
6.873AlaAsp: 6.873 ± 0.442
7.164AlaGlu: 7.164 ± 0.478
2.603AlaPhe: 2.603 ± 0.23
7.31AlaGly: 7.31 ± 0.505
2.062AlaHis: 2.062 ± 0.189
4.644AlaIle: 4.644 ± 0.356
3.749AlaLys: 3.749 ± 0.3
7.602AlaLeu: 7.602 ± 0.435
2.562AlaMet: 2.562 ± 0.207
3.249AlaAsn: 3.249 ± 0.262
4.811AlaPro: 4.811 ± 0.438
4.394AlaGln: 4.394 ± 0.347
6.311AlaArg: 6.311 ± 0.44
5.956AlaSer: 5.956 ± 0.418
6.102AlaThr: 6.102 ± 0.463
6.602AlaVal: 6.602 ± 0.415
1.375AlaTrp: 1.375 ± 0.151
2.916AlaTyr: 2.916 ± 0.235
0.0AlaXaa: 0.0 ± 0.0
Cys
0.646CysAla: 0.646 ± 0.107
0.146CysCys: 0.146 ± 0.059
0.604CysAsp: 0.604 ± 0.107
0.687CysGlu: 0.687 ± 0.12
0.271CysPhe: 0.271 ± 0.077
0.896CysGly: 0.896 ± 0.142
0.271CysHis: 0.271 ± 0.076
0.396CysIle: 0.396 ± 0.09
0.167CysLys: 0.167 ± 0.063
0.583CysLeu: 0.583 ± 0.116
0.125CysMet: 0.125 ± 0.055
0.208CysAsn: 0.208 ± 0.064
0.812CysPro: 0.812 ± 0.139
0.292CysGln: 0.292 ± 0.065
0.458CysArg: 0.458 ± 0.093
0.479CysSer: 0.479 ± 0.097
0.417CysThr: 0.417 ± 0.102
0.396CysVal: 0.396 ± 0.08
0.292CysTrp: 0.292 ± 0.07
0.229CysTyr: 0.229 ± 0.068
0.0CysXaa: 0.0 ± 0.0
Asp
6.706AspAla: 6.706 ± 0.4
0.458AspCys: 0.458 ± 0.084
6.415AspAsp: 6.415 ± 0.653
4.873AspGlu: 4.873 ± 0.403
2.624AspPhe: 2.624 ± 0.259
5.457AspGly: 5.457 ± 0.379
1.583AspHis: 1.583 ± 0.209
3.145AspIle: 3.145 ± 0.273
2.416AspLys: 2.416 ± 0.272
5.811AspLeu: 5.811 ± 0.309
1.25AspMet: 1.25 ± 0.184
1.958AspAsn: 1.958 ± 0.199
4.832AspPro: 4.832 ± 0.373
2.687AspGln: 2.687 ± 0.23
4.79AspArg: 4.79 ± 0.364
3.728AspSer: 3.728 ± 0.301
3.811AspThr: 3.811 ± 0.316
4.499AspVal: 4.499 ± 0.311
1.5AspTrp: 1.5 ± 0.208
2.249AspTyr: 2.249 ± 0.231
0.0AspXaa: 0.0 ± 0.0
Glu
6.998GluAla: 6.998 ± 0.479
0.666GluCys: 0.666 ± 0.139
4.103GluAsp: 4.103 ± 0.298
4.436GluGlu: 4.436 ± 0.409
2.333GluPhe: 2.333 ± 0.211
4.561GluGly: 4.561 ± 0.327
1.624GluHis: 1.624 ± 0.206
3.603GluIle: 3.603 ± 0.267
2.707GluLys: 2.707 ± 0.254
5.144GluLeu: 5.144 ± 0.397
1.874GluMet: 1.874 ± 0.264
2.208GluAsn: 2.208 ± 0.232
3.249GluPro: 3.249 ± 0.276
2.77GluGln: 2.77 ± 0.262
4.353GluArg: 4.353 ± 0.426
3.416GluSer: 3.416 ± 0.283
3.707GluThr: 3.707 ± 0.31
4.853GluVal: 4.853 ± 0.315
1.166GluTrp: 1.166 ± 0.163
1.77GluTyr: 1.77 ± 0.23
0.0GluXaa: 0.0 ± 0.0
Phe
2.52PheAla: 2.52 ± 0.226
0.375PheCys: 0.375 ± 0.106
2.437PheAsp: 2.437 ± 0.205
1.895PheGlu: 1.895 ± 0.215
0.875PhePhe: 0.875 ± 0.154
2.583PheGly: 2.583 ± 0.227
0.875PheHis: 0.875 ± 0.15
1.0PheIle: 1.0 ± 0.124
0.916PheLys: 0.916 ± 0.153
2.228PheLeu: 2.228 ± 0.248
0.75PheMet: 0.75 ± 0.132
1.062PheAsn: 1.062 ± 0.141
1.624PhePro: 1.624 ± 0.215
1.375PheGln: 1.375 ± 0.175
2.041PheArg: 2.041 ± 0.183
2.062PheSer: 2.062 ± 0.207
2.187PheThr: 2.187 ± 0.205
2.478PheVal: 2.478 ± 0.223
0.479PheTrp: 0.479 ± 0.094
1.145PheTyr: 1.145 ± 0.136
0.0PheXaa: 0.0 ± 0.0
Gly
6.81GlyAla: 6.81 ± 0.506
0.437GlyCys: 0.437 ± 0.101
5.894GlyAsp: 5.894 ± 0.601
5.394GlyGlu: 5.394 ± 0.336
2.583GlyPhe: 2.583 ± 0.238
8.768GlyGly: 8.768 ± 0.916
1.874GlyHis: 1.874 ± 0.163
3.291GlyIle: 3.291 ± 0.31
3.311GlyLys: 3.311 ± 0.282
6.206GlyLeu: 6.206 ± 0.348
2.083GlyMet: 2.083 ± 0.185
3.082GlyAsn: 3.082 ± 0.299
4.936GlyPro: 4.936 ± 0.624
3.749GlyGln: 3.749 ± 0.303
4.915GlyArg: 4.915 ± 0.307
5.415GlySer: 5.415 ± 0.36
5.248GlyThr: 5.248 ± 0.536
5.644GlyVal: 5.644 ± 0.322
1.666GlyTrp: 1.666 ± 0.17
2.583GlyTyr: 2.583 ± 0.24
0.0GlyXaa: 0.0 ± 0.0
His
2.041HisAla: 2.041 ± 0.218
0.229HisCys: 0.229 ± 0.078
1.583HisAsp: 1.583 ± 0.23
1.458HisGlu: 1.458 ± 0.183
0.791HisPhe: 0.791 ± 0.147
1.416HisGly: 1.416 ± 0.203
0.75HisHis: 0.75 ± 0.145
0.708HisIle: 0.708 ± 0.127
0.646HisLys: 0.646 ± 0.106
2.104HisLeu: 2.104 ± 0.208
0.354HisMet: 0.354 ± 0.085
0.625HisAsn: 0.625 ± 0.11
1.77HisPro: 1.77 ± 0.241
1.0HisGln: 1.0 ± 0.17
1.791HisArg: 1.791 ± 0.203
0.916HisSer: 0.916 ± 0.141
1.187HisThr: 1.187 ± 0.16
1.479HisVal: 1.479 ± 0.205
0.417HisTrp: 0.417 ± 0.1
0.729HisTyr: 0.729 ± 0.141
0.0HisXaa: 0.0 ± 0.0
Ile
3.749IleAla: 3.749 ± 0.255
0.354IleCys: 0.354 ± 0.085
3.436IleAsp: 3.436 ± 0.324
3.103IleGlu: 3.103 ± 0.294
1.229IlePhe: 1.229 ± 0.15
3.499IleGly: 3.499 ± 0.358
0.625IleHis: 0.625 ± 0.113
1.354IleIle: 1.354 ± 0.178
1.77IleLys: 1.77 ± 0.231
2.728IleLeu: 2.728 ± 0.261
0.646IleMet: 0.646 ± 0.108
1.458IleAsn: 1.458 ± 0.203
2.583IlePro: 2.583 ± 0.25
1.52IleGln: 1.52 ± 0.191
2.812IleArg: 2.812 ± 0.267
2.77IleSer: 2.77 ± 0.321
3.082IleThr: 3.082 ± 0.264
3.561IleVal: 3.561 ± 0.326
0.562IleTrp: 0.562 ± 0.116
1.145IleTyr: 1.145 ± 0.148
0.0IleXaa: 0.0 ± 0.0
Lys
3.686LysAla: 3.686 ± 0.325
0.271LysCys: 0.271 ± 0.081
2.166LysAsp: 2.166 ± 0.246
1.958LysGlu: 1.958 ± 0.202
1.104LysPhe: 1.104 ± 0.156
3.041LysGly: 3.041 ± 0.416
0.771LysHis: 0.771 ± 0.164
1.645LysIle: 1.645 ± 0.171
1.395LysLys: 1.395 ± 0.226
3.187LysLeu: 3.187 ± 0.24
0.625LysMet: 0.625 ± 0.106
1.145LysAsn: 1.145 ± 0.162
2.124LysPro: 2.124 ± 0.229
1.541LysGln: 1.541 ± 0.232
2.957LysArg: 2.957 ± 0.306
2.228LysSer: 2.228 ± 0.205
2.208LysThr: 2.208 ± 0.263
2.895LysVal: 2.895 ± 0.213
0.5LysTrp: 0.5 ± 0.111
1.187LysTyr: 1.187 ± 0.126
0.0LysXaa: 0.0 ± 0.0
Leu
8.247LeuAla: 8.247 ± 0.562
0.708LeuCys: 0.708 ± 0.142
5.686LeuAsp: 5.686 ± 0.39
3.978LeuGlu: 3.978 ± 0.317
2.062LeuPhe: 2.062 ± 0.203
5.561LeuGly: 5.561 ± 0.415
1.562LeuHis: 1.562 ± 0.191
2.791LeuIle: 2.791 ± 0.243
3.082LeuLys: 3.082 ± 0.293
5.103LeuLeu: 5.103 ± 0.325
1.999LeuMet: 1.999 ± 0.219
2.666LeuAsn: 2.666 ± 0.281
3.978LeuPro: 3.978 ± 0.293
3.416LeuGln: 3.416 ± 0.282
5.686LeuArg: 5.686 ± 0.379
5.394LeuSer: 5.394 ± 0.294
5.165LeuThr: 5.165 ± 0.412
5.019LeuVal: 5.019 ± 0.373
1.437LeuTrp: 1.437 ± 0.189
2.104LeuTyr: 2.104 ± 0.191
0.0LeuXaa: 0.0 ± 0.0
Met
2.583MetAla: 2.583 ± 0.217
0.229MetCys: 0.229 ± 0.07
1.0MetAsp: 1.0 ± 0.151
1.416MetGlu: 1.416 ± 0.212
0.604MetPhe: 0.604 ± 0.127
1.708MetGly: 1.708 ± 0.192
0.479MetHis: 0.479 ± 0.113
0.875MetIle: 0.875 ± 0.142
0.729MetLys: 0.729 ± 0.126
1.541MetLeu: 1.541 ± 0.177
0.521MetMet: 0.521 ± 0.129
0.541MetAsn: 0.541 ± 0.106
1.312MetPro: 1.312 ± 0.158
0.875MetGln: 0.875 ± 0.161
1.666MetArg: 1.666 ± 0.232
2.458MetSer: 2.458 ± 0.269
2.291MetThr: 2.291 ± 0.175
1.354MetVal: 1.354 ± 0.167
0.312MetTrp: 0.312 ± 0.076
0.521MetTyr: 0.521 ± 0.104
0.0MetXaa: 0.0 ± 0.0
Asn
3.353AsnAla: 3.353 ± 0.296
0.333AsnCys: 0.333 ± 0.092
2.145AsnAsp: 2.145 ± 0.236
1.874AsnGlu: 1.874 ± 0.219
0.916AsnPhe: 0.916 ± 0.154
3.645AsnGly: 3.645 ± 0.378
0.854AsnHis: 0.854 ± 0.124
1.083AsnIle: 1.083 ± 0.185
1.229AsnLys: 1.229 ± 0.158
2.603AsnLeu: 2.603 ± 0.296
0.875AsnMet: 0.875 ± 0.157
1.354AsnAsn: 1.354 ± 0.156
2.395AsnPro: 2.395 ± 0.217
1.208AsnGln: 1.208 ± 0.141
1.937AsnArg: 1.937 ± 0.193
1.749AsnSer: 1.749 ± 0.227
2.187AsnThr: 2.187 ± 0.287
2.104AsnVal: 2.104 ± 0.238
0.791AsnTrp: 0.791 ± 0.137
1.166AsnTyr: 1.166 ± 0.167
0.0AsnXaa: 0.0 ± 0.0
Pro
4.915ProAla: 4.915 ± 0.418
0.437ProCys: 0.437 ± 0.106
4.624ProAsp: 4.624 ± 0.338
4.624ProGlu: 4.624 ± 0.365
1.458ProPhe: 1.458 ± 0.164
5.832ProGly: 5.832 ± 0.444
1.291ProHis: 1.291 ± 0.18
2.145ProIle: 2.145 ± 0.205
2.041ProLys: 2.041 ± 0.287
3.728ProLeu: 3.728 ± 0.251
0.958ProMet: 0.958 ± 0.152
1.833ProAsn: 1.833 ± 0.245
3.187ProPro: 3.187 ± 0.393
2.291ProGln: 2.291 ± 0.229
2.749ProArg: 2.749 ± 0.268
3.582ProSer: 3.582 ± 0.297
3.374ProThr: 3.374 ± 0.251
4.499ProVal: 4.499 ± 0.331
0.791ProTrp: 0.791 ± 0.13
1.687ProTyr: 1.687 ± 0.217
0.0ProXaa: 0.0 ± 0.0
Gln
4.894GlnAla: 4.894 ± 0.364
0.25GlnCys: 0.25 ± 0.069
2.041GlnAsp: 2.041 ± 0.231
2.312GlnGlu: 2.312 ± 0.234
1.354GlnPhe: 1.354 ± 0.193
3.311GlnGly: 3.311 ± 0.474
0.812GlnHis: 0.812 ± 0.14
2.166GlnIle: 2.166 ± 0.195
1.25GlnLys: 1.25 ± 0.171
3.541GlnLeu: 3.541 ± 0.317
1.166GlnMet: 1.166 ± 0.167
1.312GlnAsn: 1.312 ± 0.171
1.874GlnPro: 1.874 ± 0.242
1.979GlnGln: 1.979 ± 0.274
3.02GlnArg: 3.02 ± 0.32
2.333GlnSer: 2.333 ± 0.247
2.041GlnThr: 2.041 ± 0.217
3.666GlnVal: 3.666 ± 0.32
0.916GlnTrp: 0.916 ± 0.127
1.145GlnTyr: 1.145 ± 0.149
0.0GlnXaa: 0.0 ± 0.0
Arg
6.186ArgAla: 6.186 ± 0.495
0.666ArgCys: 0.666 ± 0.139
4.061ArgAsp: 4.061 ± 0.305
5.228ArgGlu: 5.228 ± 0.475
1.687ArgPhe: 1.687 ± 0.166
4.499ArgGly: 4.499 ± 0.344
1.458ArgHis: 1.458 ± 0.195
2.874ArgIle: 2.874 ± 0.261
3.082ArgLys: 3.082 ± 0.379
5.269ArgLeu: 5.269 ± 0.342
1.666ArgMet: 1.666 ± 0.22
2.395ArgAsn: 2.395 ± 0.221
3.457ArgPro: 3.457 ± 0.225
3.187ArgGln: 3.187 ± 0.32
5.832ArgArg: 5.832 ± 0.475
3.77ArgSer: 3.77 ± 0.232
3.978ArgThr: 3.978 ± 0.304
4.478ArgVal: 4.478 ± 0.279
1.166ArgTrp: 1.166 ± 0.148
2.27ArgTyr: 2.27 ± 0.254
0.0ArgXaa: 0.0 ± 0.0
Ser
5.79SerAla: 5.79 ± 0.423
0.604SerCys: 0.604 ± 0.107
4.603SerAsp: 4.603 ± 0.364
3.374SerGlu: 3.374 ± 0.273
2.228SerPhe: 2.228 ± 0.222
6.061SerGly: 6.061 ± 0.432
1.145SerHis: 1.145 ± 0.194
2.562SerIle: 2.562 ± 0.208
1.979SerLys: 1.979 ± 0.2
4.811SerLeu: 4.811 ± 0.303
1.729SerMet: 1.729 ± 0.192
1.729SerAsn: 1.729 ± 0.22
3.082SerPro: 3.082 ± 0.234
2.478SerGln: 2.478 ± 0.269
3.811SerArg: 3.811 ± 0.261
4.186SerSer: 4.186 ± 0.33
4.332SerThr: 4.332 ± 0.331
4.186SerVal: 4.186 ± 0.305
1.208SerTrp: 1.208 ± 0.175
1.979SerTyr: 1.979 ± 0.203
0.0SerXaa: 0.0 ± 0.0
Thr
6.602ThrAla: 6.602 ± 0.617
0.521ThrCys: 0.521 ± 0.11
3.749ThrAsp: 3.749 ± 0.311
3.624ThrGlu: 3.624 ± 0.258
2.02ThrPhe: 2.02 ± 0.218
6.165ThrGly: 6.165 ± 0.542
1.375ThrHis: 1.375 ± 0.174
2.728ThrIle: 2.728 ± 0.231
2.437ThrLys: 2.437 ± 0.17
5.29ThrLeu: 5.29 ± 0.354
1.229ThrMet: 1.229 ± 0.177
2.791ThrAsn: 2.791 ± 0.281
4.207ThrPro: 4.207 ± 0.349
1.999ThrGln: 1.999 ± 0.2
3.541ThrArg: 3.541 ± 0.28
3.624ThrSer: 3.624 ± 0.402
4.499ThrThr: 4.499 ± 0.429
4.749ThrVal: 4.749 ± 0.384
1.083ThrTrp: 1.083 ± 0.15
1.77ThrTyr: 1.77 ± 0.195
0.0ThrXaa: 0.0 ± 0.0
Val
7.102ValAla: 7.102 ± 0.444
0.625ValCys: 0.625 ± 0.124
5.665ValAsp: 5.665 ± 0.309
5.415ValGlu: 5.415 ± 0.429
2.291ValPhe: 2.291 ± 0.243
5.665ValGly: 5.665 ± 0.398
1.458ValHis: 1.458 ± 0.179
2.937ValIle: 2.937 ± 0.276
2.333ValLys: 2.333 ± 0.231
4.749ValLeu: 4.749 ± 0.381
1.416ValMet: 1.416 ± 0.17
2.541ValAsn: 2.541 ± 0.262
3.707ValPro: 3.707 ± 0.278
2.666ValGln: 2.666 ± 0.291
4.686ValArg: 4.686 ± 0.377
4.624ValSer: 4.624 ± 0.326
5.103ValThr: 5.103 ± 0.355
5.373ValVal: 5.373 ± 0.352
1.25ValTrp: 1.25 ± 0.154
1.833ValTyr: 1.833 ± 0.2
0.0ValXaa: 0.0 ± 0.0
Trp
1.562TrpAla: 1.562 ± 0.19
0.146TrpCys: 0.146 ± 0.049
1.395TrpAsp: 1.395 ± 0.183
1.041TrpGlu: 1.041 ± 0.159
0.666TrpPhe: 0.666 ± 0.122
1.27TrpGly: 1.27 ± 0.156
0.458TrpHis: 0.458 ± 0.097
0.791TrpIle: 0.791 ± 0.117
0.875TrpLys: 0.875 ± 0.13
1.125TrpLeu: 1.125 ± 0.148
0.604TrpMet: 0.604 ± 0.122
0.771TrpAsn: 0.771 ± 0.116
0.604TrpPro: 0.604 ± 0.11
0.708TrpGln: 0.708 ± 0.121
1.479TrpArg: 1.479 ± 0.187
1.125TrpSer: 1.125 ± 0.133
1.083TrpThr: 1.083 ± 0.168
1.312TrpVal: 1.312 ± 0.174
0.354TrpTrp: 0.354 ± 0.091
0.417TrpTyr: 0.417 ± 0.095
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.583TyrAla: 2.583 ± 0.232
0.229TyrCys: 0.229 ± 0.067
2.312TyrAsp: 2.312 ± 0.245
1.687TyrGlu: 1.687 ± 0.184
1.208TyrPhe: 1.208 ± 0.165
2.666TyrGly: 2.666 ± 0.269
0.666TyrHis: 0.666 ± 0.112
1.25TyrIle: 1.25 ± 0.155
0.521TyrLys: 0.521 ± 0.104
2.291TyrLeu: 2.291 ± 0.198
0.562TyrMet: 0.562 ± 0.108
0.875TyrAsn: 0.875 ± 0.142
1.5TyrPro: 1.5 ± 0.186
1.25TyrGln: 1.25 ± 0.151
2.374TyrArg: 2.374 ± 0.242
2.062TyrSer: 2.062 ± 0.176
1.999TyrThr: 1.999 ± 0.24
2.291TyrVal: 2.291 ± 0.241
0.521TyrTrp: 0.521 ± 0.117
0.875TyrTyr: 0.875 ± 0.146
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 233 proteins (48016 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski