Amino acid dipepetide frequency for Vibrio phage eugene 12A10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.762AlaAla: 3.762 ± 0.437
0.767AlaCys: 0.767 ± 0.142
3.217AlaAsp: 3.217 ± 0.327
4.578AlaGlu: 4.578 ± 0.361
2.079AlaPhe: 2.079 ± 0.248
2.92AlaGly: 2.92 ± 0.309
0.99AlaHis: 0.99 ± 0.129
2.97AlaIle: 2.97 ± 0.27
3.786AlaLys: 3.786 ± 0.347
4.009AlaLeu: 4.009 ± 0.314
1.287AlaMet: 1.287 ± 0.165
2.252AlaAsn: 2.252 ± 0.228
1.163AlaPro: 1.163 ± 0.178
1.633AlaGln: 1.633 ± 0.238
2.104AlaArg: 2.104 ± 0.188
3.539AlaSer: 3.539 ± 0.353
3.267AlaThr: 3.267 ± 0.317
2.995AlaVal: 2.995 ± 0.262
0.817AlaTrp: 0.817 ± 0.137
2.277AlaTyr: 2.277 ± 0.261
0.0AlaXaa: 0.0 ± 0.0
Cys
0.916CysAla: 0.916 ± 0.139
0.322CysCys: 0.322 ± 0.095
0.94CysAsp: 0.94 ± 0.165
1.732CysGlu: 1.732 ± 0.224
0.817CysPhe: 0.817 ± 0.139
1.312CysGly: 1.312 ± 0.204
0.52CysHis: 0.52 ± 0.13
1.287CysIle: 1.287 ± 0.162
1.683CysLys: 1.683 ± 0.259
0.99CysLeu: 0.99 ± 0.173
0.346CysMet: 0.346 ± 0.086
1.015CysAsn: 1.015 ± 0.191
0.841CysPro: 0.841 ± 0.161
0.495CysGln: 0.495 ± 0.104
0.792CysArg: 0.792 ± 0.17
1.534CysSer: 1.534 ± 0.252
1.138CysThr: 1.138 ± 0.188
0.841CysVal: 0.841 ± 0.132
0.272CysTrp: 0.272 ± 0.083
0.643CysTyr: 0.643 ± 0.134
0.0CysXaa: 0.0 ± 0.0
Asp
3.044AspAla: 3.044 ± 0.271
1.411AspCys: 1.411 ± 0.183
2.995AspAsp: 2.995 ± 0.335
4.504AspGlu: 4.504 ± 0.357
3.465AspPhe: 3.465 ± 0.299
5.049AspGly: 5.049 ± 0.4
1.188AspHis: 1.188 ± 0.189
4.059AspIle: 4.059 ± 0.347
5.346AspLys: 5.346 ± 0.388
5.816AspLeu: 5.816 ± 0.377
1.732AspMet: 1.732 ± 0.227
3.292AspAsn: 3.292 ± 0.259
2.054AspPro: 2.054 ± 0.191
1.708AspGln: 1.708 ± 0.213
2.475AspArg: 2.475 ± 0.242
3.539AspSer: 3.539 ± 0.325
3.885AspThr: 3.885 ± 0.288
4.133AspVal: 4.133 ± 0.349
1.51AspTrp: 1.51 ± 0.199
3.143AspTyr: 3.143 ± 0.319
0.0AspXaa: 0.0 ± 0.0
Glu
4.578GluAla: 4.578 ± 0.286
1.262GluCys: 1.262 ± 0.203
7.672GluAsp: 7.672 ± 0.504
7.746GluGlu: 7.746 ± 0.604
3.415GluPhe: 3.415 ± 0.237
5.593GluGly: 5.593 ± 0.339
1.633GluHis: 1.633 ± 0.205
4.529GluIle: 4.529 ± 0.328
4.95GluLys: 4.95 ± 0.387
6.88GluLeu: 6.88 ± 0.485
2.376GluMet: 2.376 ± 0.263
3.267GluAsn: 3.267 ± 0.282
1.534GluPro: 1.534 ± 0.222
3.069GluGln: 3.069 ± 0.271
3.143GluArg: 3.143 ± 0.316
4.306GluSer: 4.306 ± 0.36
3.96GluThr: 3.96 ± 0.277
7.078GluVal: 7.078 ± 0.43
1.312GluTrp: 1.312 ± 0.175
3.663GluTyr: 3.663 ± 0.297
0.0GluXaa: 0.0 ± 0.0
Phe
1.757PheAla: 1.757 ± 0.268
0.99PheCys: 0.99 ± 0.153
3.069PheAsp: 3.069 ± 0.266
3.316PheGlu: 3.316 ± 0.351
1.336PhePhe: 1.336 ± 0.185
2.549PheGly: 2.549 ± 0.22
0.891PheHis: 0.891 ± 0.147
3.242PheIle: 3.242 ± 0.322
3.91PheLys: 3.91 ± 0.314
3.316PheLeu: 3.316 ± 0.339
0.99PheMet: 0.99 ± 0.138
2.549PheAsn: 2.549 ± 0.219
1.51PhePro: 1.51 ± 0.195
1.361PheGln: 1.361 ± 0.203
1.46PheArg: 1.46 ± 0.205
3.267PheSer: 3.267 ± 0.292
3.391PheThr: 3.391 ± 0.335
2.079PheVal: 2.079 ± 0.257
0.619PheTrp: 0.619 ± 0.136
1.782PheTyr: 1.782 ± 0.195
0.0PheXaa: 0.0 ± 0.0
Gly
2.772GlyAla: 2.772 ± 0.297
1.609GlyCys: 1.609 ± 0.193
3.663GlyAsp: 3.663 ± 0.293
4.43GlyGlu: 4.43 ± 0.307
3.044GlyPhe: 3.044 ± 0.216
3.91GlyGly: 3.91 ± 0.446
1.262GlyHis: 1.262 ± 0.163
3.341GlyIle: 3.341 ± 0.322
6.113GlyLys: 6.113 ± 0.365
5.692GlyLeu: 5.692 ± 0.377
1.51GlyMet: 1.51 ± 0.186
3.564GlyAsn: 3.564 ± 0.29
0.272GlyPro: 0.272 ± 0.087
1.955GlyGln: 1.955 ± 0.239
2.97GlyArg: 2.97 ± 0.258
4.479GlySer: 4.479 ± 0.324
3.687GlyThr: 3.687 ± 0.341
4.801GlyVal: 4.801 ± 0.345
1.559GlyTrp: 1.559 ± 0.205
2.673GlyTyr: 2.673 ± 0.316
0.0GlyXaa: 0.0 ± 0.0
His
0.99HisAla: 0.99 ± 0.198
0.495HisCys: 0.495 ± 0.121
1.188HisAsp: 1.188 ± 0.176
1.584HisGlu: 1.584 ± 0.203
1.163HisPhe: 1.163 ± 0.186
1.609HisGly: 1.609 ± 0.192
0.619HisHis: 0.619 ± 0.117
1.213HisIle: 1.213 ± 0.186
1.633HisLys: 1.633 ± 0.176
1.807HisLeu: 1.807 ± 0.203
0.569HisMet: 0.569 ± 0.118
1.336HisAsn: 1.336 ± 0.17
1.163HisPro: 1.163 ± 0.199
0.495HisGln: 0.495 ± 0.116
0.841HisArg: 0.841 ± 0.162
1.262HisSer: 1.262 ± 0.192
1.262HisThr: 1.262 ± 0.178
0.99HisVal: 0.99 ± 0.154
0.346HisTrp: 0.346 ± 0.095
0.94HisTyr: 0.94 ± 0.161
0.0HisXaa: 0.0 ± 0.0
Ile
3.118IleAla: 3.118 ± 0.269
0.792IleCys: 0.792 ± 0.127
4.281IleAsp: 4.281 ± 0.333
4.455IleGlu: 4.455 ± 0.31
2.227IlePhe: 2.227 ± 0.241
3.366IleGly: 3.366 ± 0.3
1.262IleHis: 1.262 ± 0.176
2.97IleIle: 2.97 ± 0.29
5.222IleLys: 5.222 ± 0.342
4.851IleLeu: 4.851 ± 0.359
1.089IleMet: 1.089 ± 0.143
3.069IleAsn: 3.069 ± 0.329
2.623IlePro: 2.623 ± 0.263
1.881IleGln: 1.881 ± 0.228
2.45IleArg: 2.45 ± 0.23
3.514IleSer: 3.514 ± 0.293
3.91IleThr: 3.91 ± 0.356
3.143IleVal: 3.143 ± 0.288
0.99IleTrp: 0.99 ± 0.162
2.104IleTyr: 2.104 ± 0.235
0.0IleXaa: 0.0 ± 0.0
Lys
4.133LysAla: 4.133 ± 0.434
1.114LysCys: 1.114 ± 0.174
5.222LysAsp: 5.222 ± 0.366
6.286LysGlu: 6.286 ± 0.528
3.762LysPhe: 3.762 ± 0.278
5.321LysGly: 5.321 ± 0.355
2.128LysHis: 2.128 ± 0.228
4.281LysIle: 4.281 ± 0.324
5.42LysLys: 5.42 ± 0.473
6.855LysLeu: 6.855 ± 0.38
2.747LysMet: 2.747 ± 0.292
2.995LysAsn: 2.995 ± 0.289
2.203LysPro: 2.203 ± 0.287
3.292LysGln: 3.292 ± 0.287
3.687LysArg: 3.687 ± 0.311
5.024LysSer: 5.024 ± 0.349
4.628LysThr: 4.628 ± 0.354
6.657LysVal: 6.657 ± 0.451
1.534LysTrp: 1.534 ± 0.218
3.118LysTyr: 3.118 ± 0.289
0.0LysXaa: 0.0 ± 0.0
Leu
4.059LeuAla: 4.059 ± 0.341
1.658LeuCys: 1.658 ± 0.187
5.742LeuAsp: 5.742 ± 0.33
8.068LeuGlu: 8.068 ± 0.424
3.588LeuPhe: 3.588 ± 0.275
4.801LeuGly: 4.801 ± 0.317
2.227LeuHis: 2.227 ± 0.249
3.96LeuIle: 3.96 ± 0.338
7.177LeuLys: 7.177 ± 0.431
7.177LeuLeu: 7.177 ± 0.446
2.252LeuMet: 2.252 ± 0.223
3.762LeuAsn: 3.762 ± 0.268
2.995LeuPro: 2.995 ± 0.288
3.217LeuGln: 3.217 ± 0.239
3.663LeuArg: 3.663 ± 0.312
6.336LeuSer: 6.336 ± 0.377
5.321LeuThr: 5.321 ± 0.381
4.851LeuVal: 4.851 ± 0.355
1.089LeuTrp: 1.089 ± 0.13
2.772LeuTyr: 2.772 ± 0.268
0.0LeuXaa: 0.0 ± 0.0
Met
1.683MetAla: 1.683 ± 0.2
0.223MetCys: 0.223 ± 0.074
1.361MetAsp: 1.361 ± 0.181
2.277MetGlu: 2.277 ± 0.254
1.262MetPhe: 1.262 ± 0.171
1.287MetGly: 1.287 ± 0.169
0.297MetHis: 0.297 ± 0.09
1.559MetIle: 1.559 ± 0.184
2.401MetLys: 2.401 ± 0.247
2.475MetLeu: 2.475 ± 0.244
0.866MetMet: 0.866 ± 0.142
1.138MetAsn: 1.138 ± 0.16
0.693MetPro: 0.693 ± 0.13
0.792MetGln: 0.792 ± 0.126
0.916MetArg: 0.916 ± 0.184
2.005MetSer: 2.005 ± 0.232
1.534MetThr: 1.534 ± 0.221
1.262MetVal: 1.262 ± 0.201
0.223MetTrp: 0.223 ± 0.075
1.089MetTyr: 1.089 ± 0.161
0.0MetXaa: 0.0 ± 0.0
Asn
2.252AsnAla: 2.252 ± 0.222
0.866AsnCys: 0.866 ± 0.141
2.203AsnAsp: 2.203 ± 0.252
2.92AsnGlu: 2.92 ± 0.269
2.475AsnPhe: 2.475 ± 0.213
3.267AsnGly: 3.267 ± 0.281
0.891AsnHis: 0.891 ± 0.151
3.391AsnIle: 3.391 ± 0.359
4.331AsnLys: 4.331 ± 0.337
4.628AsnLeu: 4.628 ± 0.305
1.015AsnMet: 1.015 ± 0.145
2.846AsnAsn: 2.846 ± 0.308
2.277AsnPro: 2.277 ± 0.267
1.312AsnGln: 1.312 ± 0.167
2.401AsnArg: 2.401 ± 0.247
3.069AsnSer: 3.069 ± 0.254
3.564AsnThr: 3.564 ± 0.313
2.722AsnVal: 2.722 ± 0.27
0.619AsnTrp: 0.619 ± 0.139
2.351AsnTyr: 2.351 ± 0.213
0.0AsnXaa: 0.0 ± 0.0
Pro
1.435ProAla: 1.435 ± 0.176
0.594ProCys: 0.594 ± 0.161
2.5ProAsp: 2.5 ± 0.288
3.44ProGlu: 3.44 ± 0.322
1.559ProPhe: 1.559 ± 0.166
0.198ProGly: 0.198 ± 0.087
0.817ProHis: 0.817 ± 0.133
1.361ProIle: 1.361 ± 0.203
2.549ProLys: 2.549 ± 0.254
1.881ProLeu: 1.881 ± 0.286
0.421ProMet: 0.421 ± 0.094
1.46ProAsn: 1.46 ± 0.214
0.965ProPro: 0.965 ± 0.169
0.94ProGln: 0.94 ± 0.154
1.064ProArg: 1.064 ± 0.16
2.277ProSer: 2.277 ± 0.266
2.079ProThr: 2.079 ± 0.215
2.178ProVal: 2.178 ± 0.256
0.495ProTrp: 0.495 ± 0.121
1.856ProTyr: 1.856 ± 0.217
0.0ProXaa: 0.0 ± 0.0
Gln
2.153GlnAla: 2.153 ± 0.26
0.52GlnCys: 0.52 ± 0.093
2.104GlnAsp: 2.104 ± 0.27
3.267GlnGlu: 3.267 ± 0.276
1.163GlnPhe: 1.163 ± 0.191
2.104GlnGly: 2.104 ± 0.217
1.015GlnHis: 1.015 ± 0.149
1.609GlnIle: 1.609 ± 0.202
2.277GlnLys: 2.277 ± 0.216
2.846GlnLeu: 2.846 ± 0.297
1.039GlnMet: 1.039 ± 0.165
1.51GlnAsn: 1.51 ± 0.21
1.114GlnPro: 1.114 ± 0.149
1.188GlnGln: 1.188 ± 0.181
1.609GlnArg: 1.609 ± 0.27
1.732GlnSer: 1.732 ± 0.184
1.633GlnThr: 1.633 ± 0.207
2.549GlnVal: 2.549 ± 0.313
0.619GlnTrp: 0.619 ± 0.124
1.559GlnTyr: 1.559 ± 0.21
0.0GlnXaa: 0.0 ± 0.0
Arg
1.93ArgAla: 1.93 ± 0.219
0.965ArgCys: 0.965 ± 0.19
2.227ArgAsp: 2.227 ± 0.243
3.811ArgGlu: 3.811 ± 0.377
1.633ArgPhe: 1.633 ± 0.207
2.673ArgGly: 2.673 ± 0.276
0.891ArgHis: 0.891 ± 0.175
2.871ArgIle: 2.871 ± 0.294
3.564ArgLys: 3.564 ± 0.305
3.316ArgLeu: 3.316 ± 0.355
1.312ArgMet: 1.312 ± 0.213
2.326ArgAsn: 2.326 ± 0.257
1.114ArgPro: 1.114 ± 0.156
1.633ArgGln: 1.633 ± 0.197
1.856ArgArg: 1.856 ± 0.231
3.069ArgSer: 3.069 ± 0.271
1.856ArgThr: 1.856 ± 0.227
2.747ArgVal: 2.747 ± 0.258
0.965ArgTrp: 0.965 ± 0.115
1.633ArgTyr: 1.633 ± 0.2
0.0ArgXaa: 0.0 ± 0.0
Ser
3.465SerAla: 3.465 ± 0.369
1.336SerCys: 1.336 ± 0.212
3.91SerAsp: 3.91 ± 0.311
5.445SerGlu: 5.445 ± 0.417
2.846SerPhe: 2.846 ± 0.286
4.999SerGly: 4.999 ± 0.332
1.089SerHis: 1.089 ± 0.163
3.539SerIle: 3.539 ± 0.337
5.766SerLys: 5.766 ± 0.417
4.974SerLeu: 4.974 ± 0.369
1.435SerMet: 1.435 ± 0.17
3.638SerAsn: 3.638 ± 0.326
1.633SerPro: 1.633 ± 0.2
2.302SerGln: 2.302 ± 0.243
2.722SerArg: 2.722 ± 0.248
4.232SerSer: 4.232 ± 0.368
3.341SerThr: 3.341 ± 0.291
4.653SerVal: 4.653 ± 0.371
0.866SerTrp: 0.866 ± 0.146
3.094SerTyr: 3.094 ± 0.318
0.0SerXaa: 0.0 ± 0.0
Thr
2.747ThrAla: 2.747 ± 0.322
0.916ThrCys: 0.916 ± 0.14
3.391ThrAsp: 3.391 ± 0.301
4.281ThrGlu: 4.281 ± 0.372
2.698ThrPhe: 2.698 ± 0.288
4.702ThrGly: 4.702 ± 0.37
1.188ThrHis: 1.188 ± 0.204
4.257ThrIle: 4.257 ± 0.338
4.702ThrLys: 4.702 ± 0.351
5.915ThrLeu: 5.915 ± 0.37
1.188ThrMet: 1.188 ± 0.175
2.401ThrAsn: 2.401 ± 0.25
2.574ThrPro: 2.574 ± 0.294
1.93ThrGln: 1.93 ± 0.225
2.351ThrArg: 2.351 ± 0.228
3.44ThrSer: 3.44 ± 0.306
3.861ThrThr: 3.861 ± 0.373
4.43ThrVal: 4.43 ± 0.341
0.792ThrTrp: 0.792 ± 0.126
3.118ThrTyr: 3.118 ± 0.289
0.0ThrXaa: 0.0 ± 0.0
Val
3.094ValAla: 3.094 ± 0.317
1.411ValCys: 1.411 ± 0.183
4.578ValAsp: 4.578 ± 0.349
5.816ValGlu: 5.816 ± 0.414
2.549ValPhe: 2.549 ± 0.259
4.083ValGly: 4.083 ± 0.307
1.188ValHis: 1.188 ± 0.201
3.588ValIle: 3.588 ± 0.284
4.974ValLys: 4.974 ± 0.39
5.766ValLeu: 5.766 ± 0.4
1.757ValMet: 1.757 ± 0.254
3.489ValAsn: 3.489 ± 0.299
1.807ValPro: 1.807 ± 0.223
2.029ValGln: 2.029 ± 0.265
2.747ValArg: 2.747 ± 0.221
4.405ValSer: 4.405 ± 0.302
4.232ValThr: 4.232 ± 0.345
5.123ValVal: 5.123 ± 0.367
1.163ValTrp: 1.163 ± 0.172
3.168ValTyr: 3.168 ± 0.275
0.0ValXaa: 0.0 ± 0.0
Trp
0.742TrpAla: 0.742 ± 0.143
0.272TrpCys: 0.272 ± 0.092
1.312TrpAsp: 1.312 ± 0.181
1.336TrpGlu: 1.336 ± 0.179
0.643TrpPhe: 0.643 ± 0.151
0.817TrpGly: 0.817 ± 0.153
0.371TrpHis: 0.371 ± 0.091
0.866TrpIle: 0.866 ± 0.167
1.559TrpLys: 1.559 ± 0.232
1.633TrpLeu: 1.633 ± 0.23
0.643TrpMet: 0.643 ± 0.112
0.866TrpAsn: 0.866 ± 0.155
0.099TrpPro: 0.099 ± 0.047
0.52TrpGln: 0.52 ± 0.123
0.742TrpArg: 0.742 ± 0.118
1.039TrpSer: 1.039 ± 0.179
0.792TrpThr: 0.792 ± 0.136
1.411TrpVal: 1.411 ± 0.165
0.346TrpTrp: 0.346 ± 0.101
0.668TrpTyr: 0.668 ± 0.134
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.782TyrAla: 1.782 ± 0.207
0.916TyrCys: 0.916 ± 0.139
2.772TyrAsp: 2.772 ± 0.292
2.673TyrGlu: 2.673 ± 0.293
1.658TyrPhe: 1.658 ± 0.225
2.747TyrGly: 2.747 ± 0.287
1.039TyrHis: 1.039 ± 0.181
2.425TyrIle: 2.425 ± 0.232
3.069TyrLys: 3.069 ± 0.298
3.885TyrLeu: 3.885 ± 0.29
0.742TyrMet: 0.742 ± 0.125
2.623TyrAsn: 2.623 ± 0.256
1.435TyrPro: 1.435 ± 0.189
1.856TyrGln: 1.856 ± 0.185
2.401TyrArg: 2.401 ± 0.231
3.217TyrSer: 3.217 ± 0.315
3.539TyrThr: 3.539 ± 0.299
2.326TyrVal: 2.326 ± 0.189
0.544TyrTrp: 0.544 ± 0.114
1.93TyrTyr: 1.93 ± 0.222
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 250 proteins (40408 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski