Amino acid dipepetide frequency for Yersinia phage phiR1-37

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.692AlaAla: 3.692 ± 0.348
0.429AlaCys: 0.429 ± 0.073
3.079AlaAsp: 3.079 ± 0.182
3.091AlaGlu: 3.091 ± 0.209
2.49AlaPhe: 2.49 ± 0.21
2.38AlaGly: 2.38 ± 0.256
0.662AlaHis: 0.662 ± 0.084
4.085AlaIle: 4.085 ± 0.223
5.115AlaLys: 5.115 ± 0.313
4.416AlaLeu: 4.416 ± 0.216
1.423AlaMet: 1.423 ± 0.15
3.84AlaAsn: 3.84 ± 0.245
1.435AlaPro: 1.435 ± 0.141
1.202AlaGln: 1.202 ± 0.152
2.024AlaArg: 2.024 ± 0.157
3.496AlaSer: 3.496 ± 0.25
3.263AlaThr: 3.263 ± 0.238
3.3AlaVal: 3.3 ± 0.2
0.245AlaTrp: 0.245 ± 0.058
2.061AlaTyr: 2.061 ± 0.166
0.0AlaXaa: 0.0 ± 0.0
Cys
0.478CysAla: 0.478 ± 0.087
0.061CysCys: 0.061 ± 0.026
0.515CysAsp: 0.515 ± 0.082
0.454CysGlu: 0.454 ± 0.084
0.662CysPhe: 0.662 ± 0.096
0.527CysGly: 0.527 ± 0.084
0.147CysHis: 0.147 ± 0.042
0.601CysIle: 0.601 ± 0.092
0.699CysLys: 0.699 ± 0.112
0.785CysLeu: 0.785 ± 0.102
0.343CysMet: 0.343 ± 0.068
0.761CysAsn: 0.761 ± 0.111
0.454CysPro: 0.454 ± 0.086
0.233CysGln: 0.233 ± 0.061
0.319CysArg: 0.319 ± 0.069
0.54CysSer: 0.54 ± 0.087
0.527CysThr: 0.527 ± 0.081
0.368CysVal: 0.368 ± 0.062
0.098CysTrp: 0.098 ± 0.035
0.319CysTyr: 0.319 ± 0.066
0.0CysXaa: 0.0 ± 0.0
Asp
2.809AspAla: 2.809 ± 0.173
0.454AspCys: 0.454 ± 0.078
4.085AspAsp: 4.085 ± 0.323
4.563AspGlu: 4.563 ± 0.265
4.183AspPhe: 4.183 ± 0.227
3.324AspGly: 3.324 ± 0.259
0.896AspHis: 0.896 ± 0.105
5.606AspIle: 5.606 ± 0.311
5.876AspLys: 5.876 ± 0.259
5.594AspLeu: 5.594 ± 0.304
1.57AspMet: 1.57 ± 0.136
4.318AspAsn: 4.318 ± 0.24
2.061AspPro: 2.061 ± 0.155
1.448AspGln: 1.448 ± 0.128
2.184AspArg: 2.184 ± 0.145
4.846AspSer: 4.846 ± 0.335
3.251AspThr: 3.251 ± 0.216
3.692AspVal: 3.692 ± 0.221
0.491AspTrp: 0.491 ± 0.086
3.116AspTyr: 3.116 ± 0.242
0.0AspXaa: 0.0 ± 0.0
Glu
3.447GluAla: 3.447 ± 0.25
0.687GluCys: 0.687 ± 0.086
4.355GluAsp: 4.355 ± 0.317
6.036GluGlu: 6.036 ± 0.741
3.545GluPhe: 3.545 ± 0.257
3.251GluGly: 3.251 ± 0.21
0.871GluHis: 0.871 ± 0.096
5.226GluIle: 5.226 ± 0.277
5.263GluLys: 5.263 ± 0.332
5.913GluLeu: 5.913 ± 0.305
2.036GluMet: 2.036 ± 0.158
4.245GluAsn: 4.245 ± 0.224
1.116GluPro: 1.116 ± 0.126
1.57GluGln: 1.57 ± 0.139
2.38GluArg: 2.38 ± 0.166
4.723GluSer: 4.723 ± 0.277
3.717GluThr: 3.717 ± 0.235
4.22GluVal: 4.22 ± 0.261
0.368GluTrp: 0.368 ± 0.067
3.091GluTyr: 3.091 ± 0.234
0.0GluXaa: 0.0 ± 0.0
Phe
2.65PheAla: 2.65 ± 0.195
0.491PheCys: 0.491 ± 0.076
3.84PheAsp: 3.84 ± 0.209
2.797PheGlu: 2.797 ± 0.198
2.809PhePhe: 2.809 ± 0.189
2.576PheGly: 2.576 ± 0.216
0.981PheHis: 0.981 ± 0.106
4.931PheIle: 4.931 ± 0.244
4.723PheLys: 4.723 ± 0.229
4.379PheLeu: 4.379 ± 0.211
1.264PheMet: 1.264 ± 0.128
4.22PheAsn: 4.22 ± 0.212
1.497PhePro: 1.497 ± 0.141
1.129PheGln: 1.129 ± 0.12
2.061PheArg: 2.061 ± 0.17
4.257PheSer: 4.257 ± 0.214
3.435PheThr: 3.435 ± 0.216
2.981PheVal: 2.981 ± 0.185
0.221PheTrp: 0.221 ± 0.064
2.662PheTyr: 2.662 ± 0.223
0.0PheXaa: 0.0 ± 0.0
Gly
2.417GlyAla: 2.417 ± 0.265
0.589GlyCys: 0.589 ± 0.093
2.785GlyAsp: 2.785 ± 0.215
2.711GlyGlu: 2.711 ± 0.205
3.018GlyPhe: 3.018 ± 0.25
3.324GlyGly: 3.324 ± 0.379
0.601GlyHis: 0.601 ± 0.101
4.318GlyIle: 4.318 ± 0.291
5.128GlyLys: 5.128 ± 0.364
3.926GlyLeu: 3.926 ± 0.288
1.288GlyMet: 1.288 ± 0.128
3.521GlyAsn: 3.521 ± 0.265
0.38GlyPro: 0.38 ± 0.074
0.908GlyGln: 0.908 ± 0.11
2.368GlyArg: 2.368 ± 0.219
3.729GlySer: 3.729 ± 0.302
2.871GlyThr: 2.871 ± 0.263
3.055GlyVal: 3.055 ± 0.203
0.442GlyTrp: 0.442 ± 0.086
2.184GlyTyr: 2.184 ± 0.143
0.0GlyXaa: 0.0 ± 0.0
His
0.724HisAla: 0.724 ± 0.097
0.258HisCys: 0.258 ± 0.061
0.896HisAsp: 0.896 ± 0.1
0.92HisGlu: 0.92 ± 0.109
1.018HisPhe: 1.018 ± 0.14
0.761HisGly: 0.761 ± 0.129
0.331HisHis: 0.331 ± 0.068
1.533HisIle: 1.533 ± 0.156
1.288HisLys: 1.288 ± 0.119
1.411HisLeu: 1.411 ± 0.128
0.368HisMet: 0.368 ± 0.061
1.055HisAsn: 1.055 ± 0.131
0.859HisPro: 0.859 ± 0.114
0.478HisGln: 0.478 ± 0.081
0.736HisArg: 0.736 ± 0.107
1.349HisSer: 1.349 ± 0.136
0.748HisThr: 0.748 ± 0.102
0.773HisVal: 0.773 ± 0.112
0.086HisTrp: 0.086 ± 0.032
0.92HisTyr: 0.92 ± 0.114
0.0HisXaa: 0.0 ± 0.0
Ile
3.938IleAla: 3.938 ± 0.236
0.883IleCys: 0.883 ± 0.1
5.729IleAsp: 5.729 ± 0.284
5.52IleGlu: 5.52 ± 0.298
3.57IlePhe: 3.57 ± 0.203
3.57IleGly: 3.57 ± 0.294
1.705IleHis: 1.705 ± 0.155
5.864IleIle: 5.864 ± 0.32
6.747IleLys: 6.747 ± 0.289
6.293IleLeu: 6.293 ± 0.273
1.975IleMet: 1.975 ± 0.159
6.097IleAsn: 6.097 ± 0.297
3.484IlePro: 3.484 ± 0.214
2.257IleGln: 2.257 ± 0.193
3.901IleArg: 3.901 ± 0.222
6.992IleSer: 6.992 ± 0.299
4.76IleThr: 4.76 ± 0.236
3.999IleVal: 3.999 ± 0.219
0.38IleTrp: 0.38 ± 0.067
3.226IleTyr: 3.226 ± 0.211
0.0IleXaa: 0.0 ± 0.0
Lys
4.674LysAla: 4.674 ± 0.422
0.81LysCys: 0.81 ± 0.116
5.815LysAsp: 5.815 ± 0.229
6.649LysGlu: 6.649 ± 0.379
4.024LysPhe: 4.024 ± 0.284
4.281LysGly: 4.281 ± 0.393
1.227LysHis: 1.227 ± 0.119
6.624LysIle: 6.624 ± 0.302
6.919LysLys: 6.919 ± 0.376
7.066LysLeu: 7.066 ± 0.32
2.245LysMet: 2.245 ± 0.165
4.833LysAsn: 4.833 ± 0.235
2.601LysPro: 2.601 ± 0.17
2.073LysGln: 2.073 ± 0.163
3.582LysArg: 3.582 ± 0.2
6.318LysSer: 6.318 ± 0.281
4.698LysThr: 4.698 ± 0.319
5.594LysVal: 5.594 ± 0.251
0.736LysTrp: 0.736 ± 0.095
3.938LysTyr: 3.938 ± 0.247
0.0LysXaa: 0.0 ± 0.0
Leu
4.711LeuAla: 4.711 ± 0.225
0.92LeuCys: 0.92 ± 0.122
5.422LeuAsp: 5.422 ± 0.281
5.925LeuGlu: 5.925 ± 0.284
4.11LeuPhe: 4.11 ± 0.227
4.11LeuGly: 4.11 ± 0.372
1.435LeuHis: 1.435 ± 0.131
5.618LeuIle: 5.618 ± 0.298
7.005LeuLys: 7.005 ± 0.293
6.637LeuLeu: 6.637 ± 0.291
2.355LeuMet: 2.355 ± 0.154
6.269LeuAsn: 6.269 ± 0.259
3.055LeuPro: 3.055 ± 0.187
2.319LeuGln: 2.319 ± 0.186
3.386LeuArg: 3.386 ± 0.181
7.262LeuSer: 7.262 ± 0.33
5.336LeuThr: 5.336 ± 0.253
5.079LeuVal: 5.079 ± 0.245
0.54LeuTrp: 0.54 ± 0.082
3.202LeuTyr: 3.202 ± 0.209
0.0LeuXaa: 0.0 ± 0.0
Met
1.779MetAla: 1.779 ± 0.138
0.135MetCys: 0.135 ± 0.046
1.533MetAsp: 1.533 ± 0.136
1.582MetGlu: 1.582 ± 0.152
1.472MetPhe: 1.472 ± 0.147
1.067MetGly: 1.067 ± 0.122
0.429MetHis: 0.429 ± 0.07
1.901MetIle: 1.901 ± 0.145
2.429MetLys: 2.429 ± 0.169
2.392MetLeu: 2.392 ± 0.181
0.81MetMet: 0.81 ± 0.106
1.914MetAsn: 1.914 ± 0.166
0.699MetPro: 0.699 ± 0.093
0.699MetGln: 0.699 ± 0.099
0.994MetArg: 0.994 ± 0.122
2.282MetSer: 2.282 ± 0.175
1.3MetThr: 1.3 ± 0.138
1.582MetVal: 1.582 ± 0.131
0.086MetTrp: 0.086 ± 0.032
1.129MetTyr: 1.129 ± 0.141
0.0MetXaa: 0.0 ± 0.0
Asn
3.582AsnAla: 3.582 ± 0.245
0.491AsnCys: 0.491 ± 0.076
4.355AsnAsp: 4.355 ± 0.238
4.048AsnGlu: 4.048 ± 0.218
3.705AsnPhe: 3.705 ± 0.223
3.84AsnGly: 3.84 ± 0.247
1.19AsnHis: 1.19 ± 0.144
5.839AsnIle: 5.839 ± 0.307
5.668AsnLys: 5.668 ± 0.286
6.072AsnLeu: 6.072 ± 0.296
1.816AsnMet: 1.816 ± 0.147
4.723AsnAsn: 4.723 ± 0.297
2.539AsnPro: 2.539 ± 0.187
1.742AsnGln: 1.742 ± 0.171
2.944AsnArg: 2.944 ± 0.192
5.263AsnSer: 5.263 ± 0.265
4.588AsnThr: 4.588 ± 0.262
3.521AsnVal: 3.521 ± 0.188
0.454AsnTrp: 0.454 ± 0.078
3.067AsnTyr: 3.067 ± 0.22
0.0AsnXaa: 0.0 ± 0.0
Pro
1.398ProAla: 1.398 ± 0.136
0.294ProCys: 0.294 ± 0.059
2.147ProAsp: 2.147 ± 0.169
2.233ProGlu: 2.233 ± 0.182
1.681ProPhe: 1.681 ± 0.145
1.092ProGly: 1.092 ± 0.119
0.38ProHis: 0.38 ± 0.063
2.723ProIle: 2.723 ± 0.181
2.49ProLys: 2.49 ± 0.188
2.76ProLeu: 2.76 ± 0.209
0.577ProMet: 0.577 ± 0.081
2.11ProAsn: 2.11 ± 0.145
0.908ProPro: 0.908 ± 0.079
0.54ProGln: 0.54 ± 0.088
0.957ProArg: 0.957 ± 0.102
2.429ProSer: 2.429 ± 0.166
1.865ProThr: 1.865 ± 0.143
2.515ProVal: 2.515 ± 0.176
0.135ProTrp: 0.135 ± 0.045
1.619ProTyr: 1.619 ± 0.156
0.0ProXaa: 0.0 ± 0.0
Gln
1.092GlnAla: 1.092 ± 0.116
0.184GlnCys: 0.184 ± 0.047
1.558GlnAsp: 1.558 ± 0.13
1.754GlnGlu: 1.754 ± 0.158
1.484GlnPhe: 1.484 ± 0.133
1.19GlnGly: 1.19 ± 0.138
0.38GlnHis: 0.38 ± 0.07
1.926GlnIle: 1.926 ± 0.148
1.742GlnLys: 1.742 ± 0.15
2.49GlnLeu: 2.49 ± 0.188
0.662GlnMet: 0.662 ± 0.105
1.521GlnAsn: 1.521 ± 0.124
0.626GlnPro: 0.626 ± 0.092
0.81GlnGln: 0.81 ± 0.097
1.018GlnArg: 1.018 ± 0.129
1.668GlnSer: 1.668 ± 0.127
1.497GlnThr: 1.497 ± 0.168
1.742GlnVal: 1.742 ± 0.16
0.209GlnTrp: 0.209 ± 0.05
1.129GlnTyr: 1.129 ± 0.116
0.0GlnXaa: 0.0 ± 0.0
Arg
1.705ArgAla: 1.705 ± 0.146
0.417ArgCys: 0.417 ± 0.077
2.269ArgAsp: 2.269 ± 0.141
2.269ArgGlu: 2.269 ± 0.168
2.196ArgPhe: 2.196 ± 0.164
1.754ArgGly: 1.754 ± 0.154
0.662ArgHis: 0.662 ± 0.099
3.962ArgIle: 3.962 ± 0.205
3.668ArgLys: 3.668 ± 0.237
3.717ArgLeu: 3.717 ± 0.223
0.981ArgMet: 0.981 ± 0.115
2.907ArgAsn: 2.907 ± 0.204
1.19ArgPro: 1.19 ± 0.11
1.043ArgGln: 1.043 ± 0.115
1.693ArgArg: 1.693 ± 0.167
2.429ArgSer: 2.429 ± 0.172
2.171ArgThr: 2.171 ± 0.164
2.306ArgVal: 2.306 ± 0.177
0.405ArgTrp: 0.405 ± 0.06
1.901ArgTyr: 1.901 ± 0.187
0.0ArgXaa: 0.0 ± 0.0
Ser
3.827SerAla: 3.827 ± 0.277
0.527SerCys: 0.527 ± 0.08
5.03SerAsp: 5.03 ± 0.279
4.441SerGlu: 4.441 ± 0.254
4.465SerPhe: 4.465 ± 0.224
4.232SerGly: 4.232 ± 0.291
1.362SerHis: 1.362 ± 0.113
6.011SerIle: 6.011 ± 0.267
5.999SerLys: 5.999 ± 0.28
6.686SerLeu: 6.686 ± 0.309
2.171SerMet: 2.171 ± 0.172
5.496SerAsn: 5.496 ± 0.277
2.22SerPro: 2.22 ± 0.154
1.889SerGln: 1.889 ± 0.138
2.49SerArg: 2.49 ± 0.188
6.735SerSer: 6.735 ± 0.49
4.588SerThr: 4.588 ± 0.281
4.649SerVal: 4.649 ± 0.231
0.442SerTrp: 0.442 ± 0.071
3.717SerTyr: 3.717 ± 0.2
0.0SerXaa: 0.0 ± 0.0
Thr
3.165ThrAla: 3.165 ± 0.255
0.417ThrCys: 0.417 ± 0.073
3.545ThrAsp: 3.545 ± 0.23
3.84ThrGlu: 3.84 ± 0.218
3.312ThrPhe: 3.312 ± 0.206
3.226ThrGly: 3.226 ± 0.291
1.104ThrHis: 1.104 ± 0.137
4.931ThrIle: 4.931 ± 0.208
4.735ThrLys: 4.735 ± 0.262
4.981ThrLeu: 4.981 ± 0.261
1.178ThrMet: 1.178 ± 0.119
3.975ThrAsn: 3.975 ± 0.234
1.914ThrPro: 1.914 ± 0.181
1.46ThrGln: 1.46 ± 0.149
2.024ThrArg: 2.024 ± 0.151
4.245ThrSer: 4.245 ± 0.251
3.778ThrThr: 3.778 ± 0.306
3.852ThrVal: 3.852 ± 0.258
0.564ThrTrp: 0.564 ± 0.081
2.969ThrTyr: 2.969 ± 0.203
0.0ThrXaa: 0.0 ± 0.0
Val
3.337ValAla: 3.337 ± 0.212
0.429ValCys: 0.429 ± 0.083
3.95ValAsp: 3.95 ± 0.241
4.232ValGlu: 4.232 ± 0.286
3.275ValPhe: 3.275 ± 0.208
2.748ValGly: 2.748 ± 0.219
0.883ValHis: 0.883 ± 0.101
4.858ValIle: 4.858 ± 0.244
4.919ValLys: 4.919 ± 0.22
4.87ValLeu: 4.87 ± 0.248
1.644ValMet: 1.644 ± 0.161
4.048ValAsn: 4.048 ± 0.194
2.147ValPro: 2.147 ± 0.141
1.398ValGln: 1.398 ± 0.145
2.233ValArg: 2.233 ± 0.145
3.987ValSer: 3.987 ± 0.189
3.729ValThr: 3.729 ± 0.22
3.889ValVal: 3.889 ± 0.217
0.343ValTrp: 0.343 ± 0.063
2.772ValTyr: 2.772 ± 0.195
0.0ValXaa: 0.0 ± 0.0
Trp
0.343TrpAla: 0.343 ± 0.068
0.049TrpCys: 0.049 ± 0.023
0.491TrpAsp: 0.491 ± 0.081
0.393TrpGlu: 0.393 ± 0.066
0.429TrpPhe: 0.429 ± 0.077
0.233TrpGly: 0.233 ± 0.057
0.123TrpHis: 0.123 ± 0.044
0.589TrpIle: 0.589 ± 0.076
0.478TrpLys: 0.478 ± 0.07
0.589TrpLeu: 0.589 ± 0.108
0.258TrpMet: 0.258 ± 0.06
0.454TrpAsn: 0.454 ± 0.077
0.135TrpPro: 0.135 ± 0.051
0.184TrpGln: 0.184 ± 0.046
0.258TrpArg: 0.258 ± 0.048
0.515TrpSer: 0.515 ± 0.079
0.294TrpThr: 0.294 ± 0.066
0.417TrpVal: 0.417 ± 0.078
0.037TrpTrp: 0.037 ± 0.022
0.282TrpTyr: 0.282 ± 0.06
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.963TyrAla: 1.963 ± 0.133
0.368TyrCys: 0.368 ± 0.066
2.969TyrAsp: 2.969 ± 0.201
2.601TyrGlu: 2.601 ± 0.176
2.453TyrPhe: 2.453 ± 0.208
2.073TyrGly: 2.073 ± 0.199
1.141TyrHis: 1.141 ± 0.123
3.742TyrIle: 3.742 ± 0.225
3.815TyrLys: 3.815 ± 0.258
3.717TyrLeu: 3.717 ± 0.211
1.239TyrMet: 1.239 ± 0.12
3.14TyrAsn: 3.14 ± 0.211
1.533TyrPro: 1.533 ± 0.133
1.3TyrGln: 1.3 ± 0.117
2.098TyrArg: 2.098 ± 0.164
3.926TyrSer: 3.926 ± 0.219
2.797TyrThr: 2.797 ± 0.211
2.159TyrVal: 2.159 ± 0.177
0.27TyrTrp: 0.27 ± 0.067
2.098TyrTyr: 2.098 ± 0.209
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 367 proteins (81518 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski