Amino acid dipepetide frequency for Vibrio phage Aphrodite1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.164AlaAla: 3.164 ± 0.325
0.411AlaCys: 0.411 ± 0.067
3.069AlaAsp: 3.069 ± 0.214
3.74AlaGlu: 3.74 ± 0.252
2.288AlaPhe: 2.288 ± 0.154
4.0AlaGly: 4.0 ± 0.255
1.096AlaHis: 1.096 ± 0.106
3.521AlaIle: 3.521 ± 0.221
3.548AlaLys: 3.548 ± 0.262
5.48AlaLeu: 5.48 ± 0.35
1.425AlaMet: 1.425 ± 0.161
2.712AlaAsn: 2.712 ± 0.22
2.027AlaPro: 2.027 ± 0.175
2.0AlaGln: 2.0 ± 0.187
2.726AlaArg: 2.726 ± 0.208
3.589AlaSer: 3.589 ± 0.263
3.452AlaThr: 3.452 ± 0.318
4.274AlaVal: 4.274 ± 0.251
0.74AlaTrp: 0.74 ± 0.1
2.329AlaTyr: 2.329 ± 0.212
0.0AlaXaa: 0.0 ± 0.0
Cys
0.315CysAla: 0.315 ± 0.069
0.096CysCys: 0.096 ± 0.04
0.342CysAsp: 0.342 ± 0.071
0.521CysGlu: 0.521 ± 0.08
0.37CysPhe: 0.37 ± 0.081
0.575CysGly: 0.575 ± 0.096
0.164CysHis: 0.164 ± 0.048
0.425CysIle: 0.425 ± 0.08
0.397CysLys: 0.397 ± 0.074
0.726CysLeu: 0.726 ± 0.108
0.205CysMet: 0.205 ± 0.053
0.315CysAsn: 0.315 ± 0.063
0.411CysPro: 0.411 ± 0.087
0.164CysGln: 0.164 ± 0.041
0.452CysArg: 0.452 ± 0.076
0.493CysSer: 0.493 ± 0.093
0.329CysThr: 0.329 ± 0.062
0.507CysVal: 0.507 ± 0.098
0.123CysTrp: 0.123 ± 0.038
0.247CysTyr: 0.247 ± 0.058
0.0CysXaa: 0.0 ± 0.0
Asp
3.466AspAla: 3.466 ± 0.207
0.329AspCys: 0.329 ± 0.067
3.973AspAsp: 3.973 ± 0.236
4.959AspGlu: 4.959 ± 0.261
2.959AspPhe: 2.959 ± 0.173
5.0AspGly: 5.0 ± 0.453
1.575AspHis: 1.575 ± 0.171
4.082AspIle: 4.082 ± 0.229
3.63AspLys: 3.63 ± 0.225
6.411AspLeu: 6.411 ± 0.333
1.795AspMet: 1.795 ± 0.164
2.836AspAsn: 2.836 ± 0.184
3.233AspPro: 3.233 ± 0.203
2.754AspGln: 2.754 ± 0.226
3.301AspArg: 3.301 ± 0.245
3.315AspSer: 3.315 ± 0.208
3.822AspThr: 3.822 ± 0.241
4.575AspVal: 4.575 ± 0.299
0.795AspTrp: 0.795 ± 0.094
2.726AspTyr: 2.726 ± 0.223
0.0AspXaa: 0.0 ± 0.0
Glu
4.137GluAla: 4.137 ± 0.284
0.685GluCys: 0.685 ± 0.106
4.425GluAsp: 4.425 ± 0.294
5.945GluGlu: 5.945 ± 0.44
3.411GluPhe: 3.411 ± 0.216
4.562GluGly: 4.562 ± 0.293
1.699GluHis: 1.699 ± 0.176
4.219GluIle: 4.219 ± 0.25
3.343GluLys: 3.343 ± 0.299
8.397GluLeu: 8.397 ± 0.406
1.959GluMet: 1.959 ± 0.143
3.123GluAsn: 3.123 ± 0.172
2.534GluPro: 2.534 ± 0.224
3.096GluGln: 3.096 ± 0.306
3.795GluArg: 3.795 ± 0.262
4.891GluSer: 4.891 ± 0.195
4.877GluThr: 4.877 ± 0.229
5.836GluVal: 5.836 ± 0.35
1.014GluTrp: 1.014 ± 0.117
2.945GluTyr: 2.945 ± 0.209
0.0GluXaa: 0.0 ± 0.0
Phe
1.986PheAla: 1.986 ± 0.17
0.315PheCys: 0.315 ± 0.075
3.603PheAsp: 3.603 ± 0.21
2.808PheGlu: 2.808 ± 0.186
1.685PhePhe: 1.685 ± 0.172
2.767PheGly: 2.767 ± 0.19
0.767PheHis: 0.767 ± 0.115
2.507PheIle: 2.507 ± 0.215
2.726PheLys: 2.726 ± 0.193
2.918PheLeu: 2.918 ± 0.198
1.164PheMet: 1.164 ± 0.145
2.836PheAsn: 2.836 ± 0.187
1.589PhePro: 1.589 ± 0.138
1.041PheGln: 1.041 ± 0.118
2.206PheArg: 2.206 ± 0.17
3.137PheSer: 3.137 ± 0.225
3.178PheThr: 3.178 ± 0.227
2.301PheVal: 2.301 ± 0.195
0.315PheTrp: 0.315 ± 0.067
1.74PheTyr: 1.74 ± 0.149
0.0PheXaa: 0.0 ± 0.0
Gly
3.206GlyAla: 3.206 ± 0.214
0.384GlyCys: 0.384 ± 0.076
4.562GlyAsp: 4.562 ± 0.314
5.233GlyGlu: 5.233 ± 0.354
2.356GlyPhe: 2.356 ± 0.156
3.849GlyGly: 3.849 ± 0.284
1.164GlyHis: 1.164 ± 0.144
3.986GlyIle: 3.986 ± 0.273
4.589GlyLys: 4.589 ± 0.266
5.891GlyLeu: 5.891 ± 0.279
2.343GlyMet: 2.343 ± 0.171
3.945GlyAsn: 3.945 ± 0.357
1.507GlyPro: 1.507 ± 0.165
2.137GlyGln: 2.137 ± 0.204
3.315GlyArg: 3.315 ± 0.255
3.589GlySer: 3.589 ± 0.247
4.343GlyThr: 4.343 ± 0.448
4.891GlyVal: 4.891 ± 0.263
0.863GlyTrp: 0.863 ± 0.114
2.493GlyTyr: 2.493 ± 0.2
0.0GlyXaa: 0.0 ± 0.0
His
1.219HisAla: 1.219 ± 0.113
0.205HisCys: 0.205 ± 0.059
1.288HisAsp: 1.288 ± 0.169
1.288HisGlu: 1.288 ± 0.115
1.027HisPhe: 1.027 ± 0.114
1.219HisGly: 1.219 ± 0.144
0.795HisHis: 0.795 ± 0.113
1.192HisIle: 1.192 ± 0.147
1.027HisLys: 1.027 ± 0.136
2.411HisLeu: 2.411 ± 0.24
0.411HisMet: 0.411 ± 0.061
0.973HisAsn: 0.973 ± 0.115
1.329HisPro: 1.329 ± 0.151
1.096HisGln: 1.096 ± 0.121
1.137HisArg: 1.137 ± 0.135
1.206HisSer: 1.206 ± 0.161
1.397HisThr: 1.397 ± 0.155
1.301HisVal: 1.301 ± 0.131
0.315HisTrp: 0.315 ± 0.076
1.069HisTyr: 1.069 ± 0.114
0.0HisXaa: 0.0 ± 0.0
Ile
3.589IleAla: 3.589 ± 0.229
0.274IleCys: 0.274 ± 0.056
4.452IleAsp: 4.452 ± 0.235
4.754IleGlu: 4.754 ± 0.297
1.356IlePhe: 1.356 ± 0.143
3.219IleGly: 3.219 ± 0.213
1.356IleHis: 1.356 ± 0.153
2.918IleIle: 2.918 ± 0.194
4.617IleLys: 4.617 ± 0.33
3.781IleLeu: 3.781 ± 0.249
1.397IleMet: 1.397 ± 0.133
3.493IleAsn: 3.493 ± 0.181
2.918IlePro: 2.918 ± 0.211
2.192IleGln: 2.192 ± 0.169
3.178IleArg: 3.178 ± 0.186
4.082IleSer: 4.082 ± 0.232
4.274IleThr: 4.274 ± 0.264
3.397IleVal: 3.397 ± 0.236
0.438IleTrp: 0.438 ± 0.074
1.986IleTyr: 1.986 ± 0.207
0.0IleXaa: 0.0 ± 0.0
Lys
3.959LysAla: 3.959 ± 0.314
0.384LysCys: 0.384 ± 0.071
4.274LysAsp: 4.274 ± 0.281
5.712LysGlu: 5.712 ± 0.351
2.301LysPhe: 2.301 ± 0.223
3.589LysGly: 3.589 ± 0.275
1.438LysHis: 1.438 ± 0.121
2.959LysIle: 2.959 ± 0.24
2.836LysLys: 2.836 ± 0.295
6.219LysLeu: 6.219 ± 0.334
1.192LysMet: 1.192 ± 0.135
2.671LysAsn: 2.671 ± 0.196
2.589LysPro: 2.589 ± 0.189
2.219LysGln: 2.219 ± 0.227
3.041LysArg: 3.041 ± 0.251
3.548LysSer: 3.548 ± 0.282
3.863LysThr: 3.863 ± 0.189
4.712LysVal: 4.712 ± 0.243
0.658LysTrp: 0.658 ± 0.095
2.0LysTyr: 2.0 ± 0.153
0.0LysXaa: 0.0 ± 0.0
Leu
5.11LeuAla: 5.11 ± 0.303
0.74LeuCys: 0.74 ± 0.095
6.973LeuAsp: 6.973 ± 0.314
7.096LeuGlu: 7.096 ± 0.434
3.096LeuPhe: 3.096 ± 0.209
5.767LeuGly: 5.767 ± 0.294
1.808LeuHis: 1.808 ± 0.162
5.082LeuIle: 5.082 ± 0.327
6.192LeuLys: 6.192 ± 0.402
7.713LeuLeu: 7.713 ± 0.444
2.521LeuMet: 2.521 ± 0.214
5.685LeuAsn: 5.685 ± 0.301
3.973LeuPro: 3.973 ± 0.217
2.507LeuGln: 2.507 ± 0.227
4.452LeuArg: 4.452 ± 0.306
6.466LeuSer: 6.466 ± 0.369
7.247LeuThr: 7.247 ± 0.319
5.767LeuVal: 5.767 ± 0.267
0.822LeuTrp: 0.822 ± 0.121
3.356LeuTyr: 3.356 ± 0.217
0.0LeuXaa: 0.0 ± 0.0
Met
1.589MetAla: 1.589 ± 0.162
0.205MetCys: 0.205 ± 0.052
1.658MetAsp: 1.658 ± 0.172
2.123MetGlu: 2.123 ± 0.178
1.151MetPhe: 1.151 ± 0.125
1.753MetGly: 1.753 ± 0.125
0.329MetHis: 0.329 ± 0.085
1.671MetIle: 1.671 ± 0.18
1.534MetLys: 1.534 ± 0.15
2.343MetLeu: 2.343 ± 0.21
0.562MetMet: 0.562 ± 0.086
1.562MetAsn: 1.562 ± 0.158
0.808MetPro: 0.808 ± 0.095
0.822MetGln: 0.822 ± 0.106
1.082MetArg: 1.082 ± 0.108
2.438MetSer: 2.438 ± 0.181
2.082MetThr: 2.082 ± 0.154
1.603MetVal: 1.603 ± 0.132
0.288MetTrp: 0.288 ± 0.062
0.822MetTyr: 0.822 ± 0.107
0.0MetXaa: 0.0 ± 0.0
Asn
3.411AsnAla: 3.411 ± 0.241
0.288AsnCys: 0.288 ± 0.068
3.041AsnAsp: 3.041 ± 0.209
3.452AsnGlu: 3.452 ± 0.219
2.233AsnPhe: 2.233 ± 0.172
4.069AsnGly: 4.069 ± 0.27
1.178AsnHis: 1.178 ± 0.142
2.836AsnIle: 2.836 ± 0.185
2.932AsnLys: 2.932 ± 0.202
4.808AsnLeu: 4.808 ± 0.323
1.425AsnMet: 1.425 ± 0.146
2.808AsnAsn: 2.808 ± 0.243
2.795AsnPro: 2.795 ± 0.221
2.329AsnGln: 2.329 ± 0.177
2.754AsnArg: 2.754 ± 0.191
3.014AsnSer: 3.014 ± 0.219
3.123AsnThr: 3.123 ± 0.192
3.781AsnVal: 3.781 ± 0.217
0.521AsnTrp: 0.521 ± 0.086
1.959AsnTyr: 1.959 ± 0.158
0.0AsnXaa: 0.0 ± 0.0
Pro
2.438ProAla: 2.438 ± 0.234
0.205ProCys: 0.205 ± 0.045
3.069ProAsp: 3.069 ± 0.231
3.795ProGlu: 3.795 ± 0.261
1.74ProPhe: 1.74 ± 0.127
2.356ProGly: 2.356 ± 0.219
0.836ProHis: 0.836 ± 0.13
2.343ProIle: 2.343 ± 0.179
2.849ProLys: 2.849 ± 0.191
3.014ProLeu: 3.014 ± 0.192
1.11ProMet: 1.11 ± 0.115
2.164ProAsn: 2.164 ± 0.138
1.288ProPro: 1.288 ± 0.151
0.973ProGln: 0.973 ± 0.137
1.685ProArg: 1.685 ± 0.168
2.808ProSer: 2.808 ± 0.203
3.041ProThr: 3.041 ± 0.201
3.507ProVal: 3.507 ± 0.292
0.466ProTrp: 0.466 ± 0.08
1.74ProTyr: 1.74 ± 0.187
0.0ProXaa: 0.0 ± 0.0
Gln
2.206GlnAla: 2.206 ± 0.172
0.301GlnCys: 0.301 ± 0.06
2.0GlnAsp: 2.0 ± 0.16
2.219GlnGlu: 2.219 ± 0.169
1.466GlnPhe: 1.466 ± 0.133
2.274GlnGly: 2.274 ± 0.154
1.055GlnHis: 1.055 ± 0.158
2.301GlnIle: 2.301 ± 0.165
1.452GlnLys: 1.452 ± 0.158
3.726GlnLeu: 3.726 ± 0.253
1.096GlnMet: 1.096 ± 0.152
1.712GlnAsn: 1.712 ± 0.144
1.438GlnPro: 1.438 ± 0.169
1.479GlnGln: 1.479 ± 0.189
1.973GlnArg: 1.973 ± 0.188
2.178GlnSer: 2.178 ± 0.173
2.329GlnThr: 2.329 ± 0.197
2.219GlnVal: 2.219 ± 0.159
0.438GlnTrp: 0.438 ± 0.079
1.397GlnTyr: 1.397 ± 0.136
0.0GlnXaa: 0.0 ± 0.0
Arg
2.986ArgAla: 2.986 ± 0.221
0.521ArgCys: 0.521 ± 0.083
3.151ArgAsp: 3.151 ± 0.211
3.26ArgGlu: 3.26 ± 0.281
2.493ArgPhe: 2.493 ± 0.161
2.562ArgGly: 2.562 ± 0.174
1.11ArgHis: 1.11 ± 0.124
2.986ArgIle: 2.986 ± 0.209
2.877ArgLys: 2.877 ± 0.248
5.137ArgLeu: 5.137 ± 0.336
1.384ArgMet: 1.384 ± 0.146
2.644ArgAsn: 2.644 ± 0.195
1.918ArgPro: 1.918 ± 0.176
2.0ArgGln: 2.0 ± 0.173
2.301ArgArg: 2.301 ± 0.208
2.89ArgSer: 2.89 ± 0.226
3.055ArgThr: 3.055 ± 0.206
3.603ArgVal: 3.603 ± 0.238
0.74ArgTrp: 0.74 ± 0.104
2.123ArgTyr: 2.123 ± 0.183
0.0ArgXaa: 0.0 ± 0.0
Ser
3.343SerAla: 3.343 ± 0.258
0.384SerCys: 0.384 ± 0.077
3.891SerAsp: 3.891 ± 0.227
4.849SerGlu: 4.849 ± 0.274
3.521SerPhe: 3.521 ± 0.223
4.644SerGly: 4.644 ± 0.291
1.37SerHis: 1.37 ± 0.187
3.849SerIle: 3.849 ± 0.224
3.918SerLys: 3.918 ± 0.279
6.343SerLeu: 6.343 ± 0.4
1.658SerMet: 1.658 ± 0.158
3.288SerAsn: 3.288 ± 0.17
2.617SerPro: 2.617 ± 0.22
1.918SerGln: 1.918 ± 0.199
2.534SerArg: 2.534 ± 0.173
3.712SerSer: 3.712 ± 0.232
3.425SerThr: 3.425 ± 0.228
4.836SerVal: 4.836 ± 0.316
0.712SerTrp: 0.712 ± 0.098
2.712SerTyr: 2.712 ± 0.183
0.0SerXaa: 0.0 ± 0.0
Thr
3.411ThrAla: 3.411 ± 0.286
0.37ThrCys: 0.37 ± 0.074
3.904ThrAsp: 3.904 ± 0.262
4.48ThrGlu: 4.48 ± 0.245
3.247ThrPhe: 3.247 ± 0.278
4.356ThrGly: 4.356 ± 0.346
1.384ThrHis: 1.384 ± 0.152
4.274ThrIle: 4.274 ± 0.266
3.754ThrLys: 3.754 ± 0.256
7.082ThrLeu: 7.082 ± 0.372
1.479ThrMet: 1.479 ± 0.159
3.206ThrAsn: 3.206 ± 0.208
3.548ThrPro: 3.548 ± 0.271
2.137ThrGln: 2.137 ± 0.173
2.918ThrArg: 2.918 ± 0.195
4.206ThrSer: 4.206 ± 0.299
4.48ThrThr: 4.48 ± 0.309
5.247ThrVal: 5.247 ± 0.463
1.014ThrTrp: 1.014 ± 0.172
2.754ThrTyr: 2.754 ± 0.262
0.0ThrXaa: 0.0 ± 0.0
Val
3.575ValAla: 3.575 ± 0.211
0.521ValCys: 0.521 ± 0.095
4.425ValAsp: 4.425 ± 0.283
5.302ValGlu: 5.302 ± 0.257
2.562ValPhe: 2.562 ± 0.208
4.877ValGly: 4.877 ± 0.341
1.548ValHis: 1.548 ± 0.164
3.959ValIle: 3.959 ± 0.256
5.493ValLys: 5.493 ± 0.264
5.343ValLeu: 5.343 ± 0.287
1.781ValMet: 1.781 ± 0.148
4.233ValAsn: 4.233 ± 0.254
3.055ValPro: 3.055 ± 0.263
2.247ValGln: 2.247 ± 0.17
3.411ValArg: 3.411 ± 0.228
4.548ValSer: 4.548 ± 0.248
5.658ValThr: 5.658 ± 0.566
4.671ValVal: 4.671 ± 0.462
0.822ValTrp: 0.822 ± 0.124
3.014ValTyr: 3.014 ± 0.25
0.0ValXaa: 0.0 ± 0.0
Trp
0.562TrpAla: 0.562 ± 0.084
0.26TrpCys: 0.26 ± 0.061
0.671TrpAsp: 0.671 ± 0.086
0.904TrpGlu: 0.904 ± 0.09
0.726TrpPhe: 0.726 ± 0.101
0.616TrpGly: 0.616 ± 0.083
0.192TrpHis: 0.192 ± 0.049
0.562TrpIle: 0.562 ± 0.084
0.699TrpLys: 0.699 ± 0.082
1.329TrpLeu: 1.329 ± 0.149
0.438TrpMet: 0.438 ± 0.075
0.562TrpAsn: 0.562 ± 0.105
0.164TrpPro: 0.164 ± 0.04
0.26TrpGln: 0.26 ± 0.059
0.479TrpArg: 0.479 ± 0.091
0.89TrpSer: 0.89 ± 0.101
0.548TrpThr: 0.548 ± 0.089
1.26TrpVal: 1.26 ± 0.138
0.123TrpTrp: 0.123 ± 0.037
0.507TrpTyr: 0.507 ± 0.083
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.918TyrAla: 1.918 ± 0.14
0.301TyrCys: 0.301 ± 0.075
2.767TyrAsp: 2.767 ± 0.211
2.452TyrGlu: 2.452 ± 0.178
1.603TyrPhe: 1.603 ± 0.133
2.589TyrGly: 2.589 ± 0.199
1.041TyrHis: 1.041 ± 0.13
2.0TyrIle: 2.0 ± 0.163
1.753TyrLys: 1.753 ± 0.153
3.301TyrLeu: 3.301 ± 0.264
1.069TyrMet: 1.069 ± 0.123
1.986TyrAsn: 1.986 ± 0.159
1.685TyrPro: 1.685 ± 0.13
1.89TyrGln: 1.89 ± 0.165
3.0TyrArg: 3.0 ± 0.213
2.521TyrSer: 2.521 ± 0.222
2.726TyrThr: 2.726 ± 0.177
2.754TyrVal: 2.754 ± 0.201
0.548TyrTrp: 0.548 ± 0.088
1.37TyrTyr: 1.37 ± 0.137
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 198 proteins (72999 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski