Amino acid dipepetide frequency for Fusobacterium phage Fnu1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.288AlaCys: 0.288 ± 0.098
1.542AlaAsp: 1.542 ± 0.181
1.856AlaGlu: 1.856 ± 0.258
1.386AlaPhe: 1.386 ± 0.253
1.203AlaGly: 1.203 ± 0.266
0.34AlaHis: 0.34 ± 0.089
4.13AlaIle: 4.13 ± 0.391
3.686AlaLys: 3.686 ± 0.375
3.608AlaLeu: 3.608 ± 0.403
0.941AlaMet: 0.941 ± 0.159
2.248AlaAsn: 2.248 ± 0.206
0.654AlaPro: 0.654 ± 0.127
1.046AlaGln: 1.046 ± 0.216
1.386AlaArg: 1.386 ± 0.258
1.725AlaSer: 1.725 ± 0.229
2.248AlaThr: 2.248 ± 0.338
1.176AlaVal: 1.176 ± 0.238
0.314AlaTrp: 0.314 ± 0.088
1.778AlaTyr: 1.778 ± 0.195
0.0AlaXaa: 0.0 ± 0.0
Cys
0.261CysAla: 0.261 ± 0.068
0.235CysCys: 0.235 ± 0.079
0.627CysAsp: 0.627 ± 0.138
0.68CysGlu: 0.68 ± 0.133
0.523CysPhe: 0.523 ± 0.167
0.784CysGly: 0.784 ± 0.168
0.183CysHis: 0.183 ± 0.072
0.915CysIle: 0.915 ± 0.167
1.464CysLys: 1.464 ± 0.248
1.124CysLeu: 1.124 ± 0.172
0.418CysMet: 0.418 ± 0.116
0.941CysAsn: 0.941 ± 0.145
0.392CysPro: 0.392 ± 0.113
0.366CysGln: 0.366 ± 0.111
0.444CysArg: 0.444 ± 0.114
0.575CysSer: 0.575 ± 0.161
0.627CysThr: 0.627 ± 0.127
0.68CysVal: 0.68 ± 0.139
0.052CysTrp: 0.052 ± 0.038
0.889CysTyr: 0.889 ± 0.172
0.0CysXaa: 0.0 ± 0.0
Asp
0.81AspAla: 0.81 ± 0.122
0.863AspCys: 0.863 ± 0.153
3.006AspAsp: 3.006 ± 0.306
4.34AspGlu: 4.34 ± 0.348
3.555AspPhe: 3.555 ± 0.292
2.039AspGly: 2.039 ± 0.212
0.34AspHis: 0.34 ± 0.081
7.346AspIle: 7.346 ± 0.545
7.424AspLys: 7.424 ± 0.492
6.326AspLeu: 6.326 ± 0.452
2.013AspMet: 2.013 ± 0.226
4.889AspAsn: 4.889 ± 0.393
0.758AspPro: 0.758 ± 0.165
0.418AspGln: 0.418 ± 0.095
2.483AspArg: 2.483 ± 0.232
3.215AspSer: 3.215 ± 0.303
3.764AspThr: 3.764 ± 0.301
3.372AspVal: 3.372 ± 0.298
0.889AspTrp: 0.889 ± 0.157
3.764AspTyr: 3.764 ± 0.364
0.026AspXaa: 0.026 ± 0.025
Glu
1.934GluAla: 1.934 ± 0.256
0.784GluCys: 0.784 ± 0.154
4.47GluAsp: 4.47 ± 0.355
6.509GluGlu: 6.509 ± 0.55
3.817GluPhe: 3.817 ± 0.351
2.483GluGly: 2.483 ± 0.269
0.915GluHis: 0.915 ± 0.2
8.731GluIle: 8.731 ± 0.567
7.581GluLys: 7.581 ± 0.501
9.725GluLeu: 9.725 ± 0.59
1.961GluMet: 1.961 ± 0.247
7.084GluAsn: 7.084 ± 0.537
1.542GluPro: 1.542 ± 0.252
3.268GluGln: 3.268 ± 0.333
2.405GluArg: 2.405 ± 0.216
3.503GluSer: 3.503 ± 0.29
3.372GluThr: 3.372 ± 0.274
4.183GluVal: 4.183 ± 0.299
0.837GluTrp: 0.837 ± 0.17
5.124GluTyr: 5.124 ± 0.334
0.0GluXaa: 0.0 ± 0.0
Phe
1.229PheAla: 1.229 ± 0.194
0.523PheCys: 0.523 ± 0.117
3.215PheAsp: 3.215 ± 0.316
3.032PheGlu: 3.032 ± 0.305
1.333PhePhe: 1.333 ± 0.18
1.987PheGly: 1.987 ± 0.223
0.732PheHis: 0.732 ± 0.135
3.947PheIle: 3.947 ± 0.381
5.071PheLys: 5.071 ± 0.342
4.157PheLeu: 4.157 ± 0.393
1.15PheMet: 1.15 ± 0.156
3.66PheAsn: 3.66 ± 0.24
0.758PhePro: 0.758 ± 0.121
0.837PheGln: 0.837 ± 0.161
1.699PheArg: 1.699 ± 0.229
2.588PheSer: 2.588 ± 0.313
2.562PheThr: 2.562 ± 0.275
2.117PheVal: 2.117 ± 0.237
0.471PheTrp: 0.471 ± 0.128
2.091PheTyr: 2.091 ± 0.221
0.0PheXaa: 0.0 ± 0.0
Gly
1.124GlyAla: 1.124 ± 0.173
0.706GlyCys: 0.706 ± 0.152
2.379GlyAsp: 2.379 ± 0.291
2.588GlyGlu: 2.588 ± 0.231
1.725GlyPhe: 1.725 ± 0.202
1.83GlyGly: 1.83 ± 0.23
0.471GlyHis: 0.471 ± 0.109
3.503GlyIle: 3.503 ± 0.352
4.836GlyLys: 4.836 ± 0.382
4.104GlyLeu: 4.104 ± 0.456
1.072GlyMet: 1.072 ± 0.205
3.425GlyAsn: 3.425 ± 0.308
0.026GlyPro: 0.026 ± 0.024
1.778GlyGln: 1.778 ± 0.268
1.856GlyArg: 1.856 ± 0.242
2.457GlySer: 2.457 ± 0.274
2.876GlyThr: 2.876 ± 0.277
3.163GlyVal: 3.163 ± 0.251
0.471GlyTrp: 0.471 ± 0.097
2.823GlyTyr: 2.823 ± 0.27
0.0GlyXaa: 0.0 ± 0.0
His
0.235HisAla: 0.235 ± 0.07
0.209HisCys: 0.209 ± 0.074
0.288HisAsp: 0.288 ± 0.08
0.34HisGlu: 0.34 ± 0.095
0.941HisPhe: 0.941 ± 0.158
0.105HisGly: 0.105 ± 0.05
0.157HisHis: 0.157 ± 0.063
1.673HisIle: 1.673 ± 0.237
1.542HisLys: 1.542 ± 0.226
1.046HisLeu: 1.046 ± 0.162
0.078HisMet: 0.078 ± 0.043
0.941HisAsn: 0.941 ± 0.146
0.314HisPro: 0.314 ± 0.088
0.444HisGln: 0.444 ± 0.106
0.523HisArg: 0.523 ± 0.117
1.098HisSer: 1.098 ± 0.176
0.915HisThr: 0.915 ± 0.168
0.0HisVal: 0.0 ± 0.0
0.157HisTrp: 0.157 ± 0.067
0.784HisTyr: 0.784 ± 0.151
0.0HisXaa: 0.0 ± 0.0
Ile
2.876IleAla: 2.876 ± 0.291
1.281IleCys: 1.281 ± 0.19
6.379IleAsp: 6.379 ± 0.419
7.346IleGlu: 7.346 ± 0.496
3.921IlePhe: 3.921 ± 0.36
4.287IleGly: 4.287 ± 0.365
1.203IleHis: 1.203 ± 0.198
7.686IleIle: 7.686 ± 0.672
9.62IleLys: 9.62 ± 0.498
8.757IleLeu: 8.757 ± 0.597
1.908IleMet: 1.908 ± 0.246
7.424IleAsn: 7.424 ± 0.493
2.562IlePro: 2.562 ± 0.256
3.608IleGln: 3.608 ± 0.291
3.346IleArg: 3.346 ± 0.317
5.019IleSer: 5.019 ± 0.43
4.889IleThr: 4.889 ± 0.396
4.078IleVal: 4.078 ± 0.362
0.575IleTrp: 0.575 ± 0.124
5.176IleTyr: 5.176 ± 0.437
0.026IleXaa: 0.026 ± 0.025
Lys
4.915LysAla: 4.915 ± 0.455
1.124LysCys: 1.124 ± 0.182
8.235LysAsp: 8.235 ± 0.418
11.973LysGlu: 11.973 ± 0.681
3.921LysPhe: 3.921 ± 0.346
5.098LysGly: 5.098 ± 0.342
1.281LysHis: 1.281 ± 0.172
8.627LysIle: 8.627 ± 0.518
8.888LysLys: 8.888 ± 0.609
10.378LysLeu: 10.378 ± 0.634
2.353LysMet: 2.353 ± 0.247
6.666LysAsn: 6.666 ± 0.457
1.856LysPro: 1.856 ± 0.208
3.895LysGln: 3.895 ± 0.397
3.608LysArg: 3.608 ± 0.323
4.81LysSer: 4.81 ± 0.357
6.614LysThr: 6.614 ± 0.447
5.359LysVal: 5.359 ± 0.374
1.098LysTrp: 1.098 ± 0.154
6.509LysTyr: 6.509 ± 0.515
0.0LysXaa: 0.0 ± 0.0
Leu
3.555LeuAla: 3.555 ± 0.313
1.124LeuCys: 1.124 ± 0.187
7.45LeuAsp: 7.45 ± 0.427
9.437LeuGlu: 9.437 ± 0.541
3.66LeuPhe: 3.66 ± 0.317
5.071LeuGly: 5.071 ± 0.495
1.229LeuHis: 1.229 ± 0.192
7.111LeuIle: 7.111 ± 0.422
10.378LeuLys: 10.378 ± 0.62
7.895LeuLeu: 7.895 ± 0.479
2.457LeuMet: 2.457 ± 0.27
7.581LeuAsn: 7.581 ± 0.463
2.117LeuPro: 2.117 ± 0.251
3.817LeuGln: 3.817 ± 0.424
2.98LeuArg: 2.98 ± 0.277
5.83LeuSer: 5.83 ± 0.464
4.836LeuThr: 4.836 ± 0.391
5.071LeuVal: 5.071 ± 0.407
0.732LeuTrp: 0.732 ± 0.132
4.784LeuTyr: 4.784 ± 0.399
0.0LeuXaa: 0.0 ± 0.0
Met
1.333MetAla: 1.333 ± 0.203
0.288MetCys: 0.288 ± 0.08
0.784MetAsp: 0.784 ± 0.139
1.934MetGlu: 1.934 ± 0.242
1.15MetPhe: 1.15 ± 0.184
0.68MetGly: 0.68 ± 0.147
0.235MetHis: 0.235 ± 0.08
1.699MetIle: 1.699 ± 0.208
3.189MetLys: 3.189 ± 0.296
2.536MetLeu: 2.536 ± 0.274
0.366MetMet: 0.366 ± 0.111
1.856MetAsn: 1.856 ± 0.229
0.444MetPro: 0.444 ± 0.098
0.915MetGln: 0.915 ± 0.145
1.02MetArg: 1.02 ± 0.161
1.778MetSer: 1.778 ± 0.357
1.281MetThr: 1.281 ± 0.201
1.046MetVal: 1.046 ± 0.158
0.235MetTrp: 0.235 ± 0.065
1.203MetTyr: 1.203 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
2.666AsnAla: 2.666 ± 0.261
0.863AsnCys: 0.863 ± 0.144
2.849AsnAsp: 2.849 ± 0.256
5.333AsnGlu: 5.333 ± 0.365
2.954AsnPhe: 2.954 ± 0.267
3.634AsnGly: 3.634 ± 0.369
0.732AsnHis: 0.732 ± 0.131
8.601AsnIle: 8.601 ± 0.484
8.888AsnLys: 8.888 ± 0.499
7.058AsnLeu: 7.058 ± 0.561
1.83AsnMet: 1.83 ± 0.198
6.169AsnAsn: 6.169 ± 0.637
1.49AsnPro: 1.49 ± 0.208
2.379AsnGln: 2.379 ± 0.278
2.954AsnArg: 2.954 ± 0.312
4.706AsnSer: 4.706 ± 0.367
4.418AsnThr: 4.418 ± 0.362
3.346AsnVal: 3.346 ± 0.328
0.889AsnTrp: 0.889 ± 0.115
4.889AsnTyr: 4.889 ± 0.376
0.0AsnXaa: 0.0 ± 0.0
Pro
0.575ProAla: 0.575 ± 0.11
0.261ProCys: 0.261 ± 0.091
0.575ProAsp: 0.575 ± 0.122
0.889ProGlu: 0.889 ± 0.132
1.203ProPhe: 1.203 ± 0.168
0.052ProGly: 0.052 ± 0.034
0.157ProHis: 0.157 ± 0.069
1.882ProIle: 1.882 ± 0.24
2.3ProLys: 2.3 ± 0.218
1.673ProLeu: 1.673 ± 0.216
0.627ProMet: 0.627 ± 0.125
1.882ProAsn: 1.882 ± 0.211
0.314ProPro: 0.314 ± 0.091
0.837ProGln: 0.837 ± 0.144
0.784ProArg: 0.784 ± 0.133
1.176ProSer: 1.176 ± 0.211
1.307ProThr: 1.307 ± 0.186
0.732ProVal: 0.732 ± 0.155
0.0ProTrp: 0.0 ± 0.0
1.673ProTyr: 1.673 ± 0.204
0.026ProXaa: 0.026 ± 0.025
Gln
1.386GlnAla: 1.386 ± 0.208
0.549GlnCys: 0.549 ± 0.137
2.013GlnAsp: 2.013 ± 0.204
3.163GlnGlu: 3.163 ± 0.305
1.072GlnPhe: 1.072 ± 0.174
2.065GlnGly: 2.065 ± 0.246
0.471GlnHis: 0.471 ± 0.123
2.902GlnIle: 2.902 ± 0.33
3.451GlnLys: 3.451 ± 0.3
3.189GlnLeu: 3.189 ± 0.382
0.837GlnMet: 0.837 ± 0.156
2.17GlnAsn: 2.17 ± 0.217
0.863GlnPro: 0.863 ± 0.143
1.359GlnGln: 1.359 ± 0.21
1.15GlnArg: 1.15 ± 0.188
1.908GlnSer: 1.908 ± 0.21
2.144GlnThr: 2.144 ± 0.317
1.673GlnVal: 1.673 ± 0.223
0.471GlnTrp: 0.471 ± 0.131
1.751GlnTyr: 1.751 ± 0.186
0.0GlnXaa: 0.0 ± 0.0
Arg
1.072ArgAla: 1.072 ± 0.204
0.68ArgCys: 0.68 ± 0.155
2.588ArgAsp: 2.588 ± 0.28
3.974ArgGlu: 3.974 ± 0.363
1.359ArgPhe: 1.359 ± 0.188
1.856ArgGly: 1.856 ± 0.232
0.654ArgHis: 0.654 ± 0.155
2.849ArgIle: 2.849 ± 0.274
3.608ArgLys: 3.608 ± 0.346
3.294ArgLeu: 3.294 ± 0.324
0.837ArgMet: 0.837 ± 0.149
2.3ArgAsn: 2.3 ± 0.291
0.706ArgPro: 0.706 ± 0.133
1.359ArgGln: 1.359 ± 0.194
0.967ArgArg: 0.967 ± 0.183
1.046ArgSer: 1.046 ± 0.183
1.725ArgThr: 1.725 ± 0.229
2.327ArgVal: 2.327 ± 0.205
0.314ArgTrp: 0.314 ± 0.11
2.117ArgTyr: 2.117 ± 0.214
0.0ArgXaa: 0.0 ± 0.0
Ser
1.647SerAla: 1.647 ± 0.28
0.366SerCys: 0.366 ± 0.098
3.425SerAsp: 3.425 ± 0.403
3.738SerGlu: 3.738 ± 0.309
2.196SerPhe: 2.196 ± 0.243
2.3SerGly: 2.3 ± 0.263
0.471SerHis: 0.471 ± 0.1
4.993SerIle: 4.993 ± 0.433
7.032SerLys: 7.032 ± 0.392
5.098SerLeu: 5.098 ± 0.469
1.49SerMet: 1.49 ± 0.258
4.026SerAsn: 4.026 ± 0.37
0.523SerPro: 0.523 ± 0.131
1.699SerGln: 1.699 ± 0.226
1.83SerArg: 1.83 ± 0.203
2.849SerSer: 2.849 ± 0.368
2.745SerThr: 2.745 ± 0.337
3.137SerVal: 3.137 ± 0.315
0.68SerTrp: 0.68 ± 0.157
3.215SerTyr: 3.215 ± 0.259
0.026SerXaa: 0.026 ± 0.025
Thr
1.621ThrAla: 1.621 ± 0.357
0.444ThrCys: 0.444 ± 0.122
3.581ThrAsp: 3.581 ± 0.283
4.209ThrGlu: 4.209 ± 0.387
2.64ThrPhe: 2.64 ± 0.291
2.483ThrGly: 2.483 ± 0.398
0.837ThrHis: 0.837 ± 0.158
5.019ThrIle: 5.019 ± 0.357
5.647ThrLys: 5.647 ± 0.327
5.542ThrLeu: 5.542 ± 0.387
0.837ThrMet: 0.837 ± 0.135
3.843ThrAsn: 3.843 ± 0.366
1.229ThrPro: 1.229 ± 0.211
2.64ThrGln: 2.64 ± 0.425
2.222ThrArg: 2.222 ± 0.242
2.823ThrSer: 2.823 ± 0.307
2.849ThrThr: 2.849 ± 0.33
3.189ThrVal: 3.189 ± 0.285
0.471ThrTrp: 0.471 ± 0.113
2.849ThrTyr: 2.849 ± 0.235
0.026ThrXaa: 0.026 ± 0.025
Val
2.091ValAla: 2.091 ± 0.273
0.627ValCys: 0.627 ± 0.132
3.738ValAsp: 3.738 ± 0.316
4.052ValGlu: 4.052 ± 0.286
2.562ValPhe: 2.562 ± 0.273
2.64ValGly: 2.64 ± 0.322
0.575ValHis: 0.575 ± 0.126
4.0ValIle: 4.0 ± 0.366
5.254ValLys: 5.254 ± 0.364
5.359ValLeu: 5.359 ± 0.362
1.516ValMet: 1.516 ± 0.186
3.791ValAsn: 3.791 ± 0.309
1.359ValPro: 1.359 ± 0.209
1.438ValGln: 1.438 ± 0.187
1.673ValArg: 1.673 ± 0.167
3.006ValSer: 3.006 ± 0.346
1.908ValThr: 1.908 ± 0.268
3.111ValVal: 3.111 ± 0.29
0.575ValTrp: 0.575 ± 0.15
3.294ValTyr: 3.294 ± 0.353
0.0ValXaa: 0.0 ± 0.0
Trp
0.261TrpAla: 0.261 ± 0.077
0.261TrpCys: 0.261 ± 0.09
0.627TrpAsp: 0.627 ± 0.131
0.784TrpGlu: 0.784 ± 0.155
0.444TrpPhe: 0.444 ± 0.103
0.523TrpGly: 0.523 ± 0.118
0.078TrpHis: 0.078 ± 0.049
0.654TrpIle: 0.654 ± 0.113
0.837TrpLys: 0.837 ± 0.169
1.072TrpLeu: 1.072 ± 0.155
0.131TrpMet: 0.131 ± 0.059
0.889TrpAsn: 0.889 ± 0.174
0.0TrpPro: 0.0 ± 0.0
0.497TrpGln: 0.497 ± 0.103
0.392TrpArg: 0.392 ± 0.102
0.392TrpSer: 0.392 ± 0.101
0.497TrpThr: 0.497 ± 0.121
0.863TrpVal: 0.863 ± 0.18
0.105TrpTrp: 0.105 ± 0.051
0.706TrpTyr: 0.706 ± 0.133
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.987TyrAla: 1.987 ± 0.248
0.706TyrCys: 0.706 ± 0.152
3.843TyrAsp: 3.843 ± 0.416
4.052TyrGlu: 4.052 ± 0.348
2.797TyrPhe: 2.797 ± 0.244
2.065TyrGly: 2.065 ± 0.173
0.732TyrHis: 0.732 ± 0.147
5.673TyrIle: 5.673 ± 0.418
6.509TyrLys: 6.509 ± 0.352
5.176TyrLeu: 5.176 ± 0.37
1.124TyrMet: 1.124 ± 0.157
4.862TyrAsn: 4.862 ± 0.415
1.02TyrPro: 1.02 ± 0.174
1.987TyrGln: 1.987 ± 0.235
1.908TyrArg: 1.908 ± 0.215
2.98TyrSer: 2.98 ± 0.313
3.398TyrThr: 3.398 ± 0.349
3.921TyrVal: 3.921 ± 0.34
0.68TyrTrp: 0.68 ± 0.129
3.529TyrTyr: 3.529 ± 0.342
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.026XaaAsp: 0.026 ± 0.025
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.026XaaGly: 0.026 ± 0.025
0.0XaaHis: 0.0 ± 0.0
0.026XaaIle: 0.026 ± 0.025
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.026XaaThr: 0.026 ± 0.025
0.026XaaVal: 0.026 ± 0.025
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.026XaaXaa: 0.026 ± 0.025
Statistics based on 181 proteins (38254 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski