Amino acid dipepetide frequency for Edwardsiella phage PEi20

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.65AlaAla: 5.65 ± 0.367
0.492AlaCys: 0.492 ± 0.105
3.9AlaAsp: 3.9 ± 0.239
4.885AlaGlu: 4.885 ± 0.402
2.77AlaPhe: 2.77 ± 0.212
5.395AlaGly: 5.395 ± 0.415
1.294AlaHis: 1.294 ± 0.172
4.411AlaIle: 4.411 ± 0.255
5.231AlaLys: 5.231 ± 0.294
6.252AlaLeu: 6.252 ± 0.375
1.987AlaMet: 1.987 ± 0.19
3.536AlaAsn: 3.536 ± 0.364
2.661AlaPro: 2.661 ± 0.226
3.007AlaGln: 3.007 ± 0.257
3.427AlaArg: 3.427 ± 0.25
4.356AlaSer: 4.356 ± 0.314
4.538AlaThr: 4.538 ± 0.425
5.523AlaVal: 5.523 ± 0.331
0.948AlaTrp: 0.948 ± 0.13
2.57AlaTyr: 2.57 ± 0.208
0.0AlaXaa: 0.0 ± 0.0
Cys
0.747CysAla: 0.747 ± 0.111
0.146CysCys: 0.146 ± 0.05
0.857CysAsp: 0.857 ± 0.11
0.711CysGlu: 0.711 ± 0.128
0.346CysPhe: 0.346 ± 0.082
0.711CysGly: 0.711 ± 0.135
0.2CysHis: 0.2 ± 0.051
0.529CysIle: 0.529 ± 0.099
0.766CysLys: 0.766 ± 0.125
0.656CysLeu: 0.656 ± 0.119
0.383CysMet: 0.383 ± 0.082
0.456CysAsn: 0.456 ± 0.088
0.51CysPro: 0.51 ± 0.081
0.437CysGln: 0.437 ± 0.097
0.529CysArg: 0.529 ± 0.107
0.93CysSer: 0.93 ± 0.141
0.583CysThr: 0.583 ± 0.105
0.711CysVal: 0.711 ± 0.1
0.128CysTrp: 0.128 ± 0.048
0.383CysTyr: 0.383 ± 0.088
0.0CysXaa: 0.0 ± 0.0
Asp
4.429AspAla: 4.429 ± 0.305
0.638AspCys: 0.638 ± 0.101
4.666AspAsp: 4.666 ± 0.304
4.666AspGlu: 4.666 ± 0.322
3.299AspPhe: 3.299 ± 0.255
4.739AspGly: 4.739 ± 0.335
1.002AspHis: 1.002 ± 0.119
4.684AspIle: 4.684 ± 0.347
4.393AspLys: 4.393 ± 0.353
4.666AspLeu: 4.666 ± 0.304
1.987AspMet: 1.987 ± 0.2
2.752AspAsn: 2.752 ± 0.201
2.57AspPro: 2.57 ± 0.241
1.932AspGln: 1.932 ± 0.187
2.643AspArg: 2.643 ± 0.193
3.773AspSer: 3.773 ± 0.247
3.117AspThr: 3.117 ± 0.223
4.393AspVal: 4.393 ± 0.303
1.294AspTrp: 1.294 ± 0.17
3.044AspTyr: 3.044 ± 0.288
0.0AspXaa: 0.0 ± 0.0
Glu
5.668GluAla: 5.668 ± 0.382
0.838GluCys: 0.838 ± 0.143
4.265GluAsp: 4.265 ± 0.325
5.085GluGlu: 5.085 ± 0.409
3.518GluPhe: 3.518 ± 0.222
4.174GluGly: 4.174 ± 0.261
1.276GluHis: 1.276 ± 0.153
4.666GluIle: 4.666 ± 0.315
4.192GluLys: 4.192 ± 0.255
6.89GluLeu: 6.89 ± 0.36
2.588GluMet: 2.588 ± 0.221
3.062GluAsn: 3.062 ± 0.224
1.659GluPro: 1.659 ± 0.184
2.625GluGln: 2.625 ± 0.22
3.281GluArg: 3.281 ± 0.243
3.7GluSer: 3.7 ± 0.263
3.973GluThr: 3.973 ± 0.252
5.814GluVal: 5.814 ± 0.307
0.948GluTrp: 0.948 ± 0.128
3.335GluTyr: 3.335 ± 0.242
0.0GluXaa: 0.0 ± 0.0
Phe
3.244PheAla: 3.244 ± 0.24
0.474PheCys: 0.474 ± 0.106
3.08PheAsp: 3.08 ± 0.262
3.536PheGlu: 3.536 ± 0.273
1.367PhePhe: 1.367 ± 0.182
3.19PheGly: 3.19 ± 0.232
0.766PheHis: 0.766 ± 0.113
3.007PheIle: 3.007 ± 0.255
3.846PheLys: 3.846 ± 0.279
2.424PheLeu: 2.424 ± 0.189
1.385PheMet: 1.385 ± 0.156
2.643PheAsn: 2.643 ± 0.214
1.239PhePro: 1.239 ± 0.149
1.422PheGln: 1.422 ± 0.171
1.841PheArg: 1.841 ± 0.174
2.497PheSer: 2.497 ± 0.204
2.625PheThr: 2.625 ± 0.22
2.989PheVal: 2.989 ± 0.224
0.911PheTrp: 0.911 ± 0.138
1.604PheTyr: 1.604 ± 0.138
0.0PheXaa: 0.0 ± 0.0
Gly
3.992GlyAla: 3.992 ± 0.371
0.674GlyCys: 0.674 ± 0.117
4.447GlyAsp: 4.447 ± 0.338
4.283GlyGlu: 4.283 ± 0.289
2.971GlyPhe: 2.971 ± 0.253
3.973GlyGly: 3.973 ± 0.381
1.258GlyHis: 1.258 ± 0.154
4.356GlyIle: 4.356 ± 0.248
4.429GlyLys: 4.429 ± 0.34
5.012GlyLeu: 5.012 ± 0.289
1.95GlyMet: 1.95 ± 0.186
3.19GlyAsn: 3.19 ± 0.334
1.968GlyPro: 1.968 ± 0.17
2.133GlyGln: 2.133 ± 0.205
2.843GlyArg: 2.843 ± 0.238
3.973GlySer: 3.973 ± 0.25
4.812GlyThr: 4.812 ± 0.415
4.32GlyVal: 4.32 ± 0.297
1.203GlyTrp: 1.203 ± 0.139
2.898GlyTyr: 2.898 ± 0.216
0.0GlyXaa: 0.0 ± 0.0
His
1.276HisAla: 1.276 ± 0.164
0.328HisCys: 0.328 ± 0.094
1.312HisAsp: 1.312 ± 0.137
1.13HisGlu: 1.13 ± 0.178
0.948HisPhe: 0.948 ± 0.142
1.203HisGly: 1.203 ± 0.146
0.51HisHis: 0.51 ± 0.116
1.531HisIle: 1.531 ± 0.179
1.385HisLys: 1.385 ± 0.147
1.44HisLeu: 1.44 ± 0.156
0.492HisMet: 0.492 ± 0.08
0.857HisAsn: 0.857 ± 0.13
0.984HisPro: 0.984 ± 0.156
0.492HisGln: 0.492 ± 0.087
0.784HisArg: 0.784 ± 0.132
1.021HisSer: 1.021 ± 0.15
0.911HisThr: 0.911 ± 0.11
1.331HisVal: 1.331 ± 0.161
0.273HisTrp: 0.273 ± 0.066
0.747HisTyr: 0.747 ± 0.116
0.0HisXaa: 0.0 ± 0.0
Ile
5.012IleAla: 5.012 ± 0.39
0.729IleCys: 0.729 ± 0.105
5.213IleAsp: 5.213 ± 0.281
5.231IleGlu: 5.231 ± 0.278
2.26IlePhe: 2.26 ± 0.207
3.536IleGly: 3.536 ± 0.24
1.422IleHis: 1.422 ± 0.164
4.411IleIle: 4.411 ± 0.324
5.468IleLys: 5.468 ± 0.269
3.755IleLeu: 3.755 ± 0.222
2.151IleMet: 2.151 ± 0.21
3.591IleAsn: 3.591 ± 0.253
2.442IlePro: 2.442 ± 0.215
2.388IleGln: 2.388 ± 0.191
3.445IleArg: 3.445 ± 0.263
3.645IleSer: 3.645 ± 0.273
4.338IleThr: 4.338 ± 0.264
4.065IleVal: 4.065 ± 0.236
0.529IleTrp: 0.529 ± 0.101
2.333IleTyr: 2.333 ± 0.19
0.0IleXaa: 0.0 ± 0.0
Lys
6.197LysAla: 6.197 ± 0.37
0.547LysCys: 0.547 ± 0.101
4.283LysAsp: 4.283 ± 0.324
5.213LysGlu: 5.213 ± 0.384
3.244LysPhe: 3.244 ± 0.256
4.301LysGly: 4.301 ± 0.24
1.458LysHis: 1.458 ± 0.153
4.593LysIle: 4.593 ± 0.307
4.374LysLys: 4.374 ± 0.314
5.668LysLeu: 5.668 ± 0.325
2.789LysMet: 2.789 ± 0.224
3.591LysAsn: 3.591 ± 0.241
2.388LysPro: 2.388 ± 0.207
2.442LysGln: 2.442 ± 0.237
3.354LysArg: 3.354 ± 0.26
3.791LysSer: 3.791 ± 0.252
4.174LysThr: 4.174 ± 0.276
4.921LysVal: 4.921 ± 0.317
1.094LysTrp: 1.094 ± 0.114
2.88LysTyr: 2.88 ± 0.257
0.0LysXaa: 0.0 ± 0.0
Leu
5.632LeuAla: 5.632 ± 0.337
0.784LeuCys: 0.784 ± 0.119
4.739LeuAsp: 4.739 ± 0.292
5.432LeuGlu: 5.432 ± 0.352
2.843LeuPhe: 2.843 ± 0.278
3.973LeuGly: 3.973 ± 0.252
1.349LeuHis: 1.349 ± 0.149
4.502LeuIle: 4.502 ± 0.251
5.978LeuLys: 5.978 ± 0.297
4.63LeuLeu: 4.63 ± 0.319
2.297LeuMet: 2.297 ± 0.236
4.557LeuAsn: 4.557 ± 0.251
3.317LeuPro: 3.317 ± 0.225
2.716LeuGln: 2.716 ± 0.2
3.463LeuArg: 3.463 ± 0.223
4.21LeuSer: 4.21 ± 0.32
4.794LeuThr: 4.794 ± 0.364
4.63LeuVal: 4.63 ± 0.319
0.82LeuTrp: 0.82 ± 0.106
2.77LeuTyr: 2.77 ± 0.248
0.0LeuXaa: 0.0 ± 0.0
Met
2.588MetAla: 2.588 ± 0.244
0.292MetCys: 0.292 ± 0.071
1.786MetAsp: 1.786 ± 0.183
1.695MetGlu: 1.695 ± 0.182
1.586MetPhe: 1.586 ± 0.165
1.732MetGly: 1.732 ± 0.195
0.674MetHis: 0.674 ± 0.121
2.041MetIle: 2.041 ± 0.197
2.898MetLys: 2.898 ± 0.226
2.078MetLeu: 2.078 ± 0.215
0.838MetMet: 0.838 ± 0.117
1.567MetAsn: 1.567 ± 0.162
0.966MetPro: 0.966 ± 0.115
1.331MetGln: 1.331 ± 0.172
1.367MetArg: 1.367 ± 0.158
2.278MetSer: 2.278 ± 0.207
2.06MetThr: 2.06 ± 0.161
1.95MetVal: 1.95 ± 0.186
0.365MetTrp: 0.365 ± 0.083
0.948MetTyr: 0.948 ± 0.125
0.0MetXaa: 0.0 ± 0.0
Asn
3.645AsnAla: 3.645 ± 0.292
0.547AsnCys: 0.547 ± 0.098
2.807AsnAsp: 2.807 ± 0.236
3.955AsnGlu: 3.955 ± 0.232
2.369AsnPhe: 2.369 ± 0.193
4.229AsnGly: 4.229 ± 0.332
0.93AsnHis: 0.93 ± 0.138
3.335AsnIle: 3.335 ± 0.189
3.153AsnLys: 3.153 ± 0.237
3.645AsnLeu: 3.645 ± 0.229
1.586AsnMet: 1.586 ± 0.183
2.752AsnAsn: 2.752 ± 0.232
2.151AsnPro: 2.151 ± 0.227
1.95AsnGln: 1.95 ± 0.212
1.95AsnArg: 1.95 ± 0.179
2.825AsnSer: 2.825 ± 0.215
2.789AsnThr: 2.789 ± 0.202
3.39AsnVal: 3.39 ± 0.22
0.638AsnTrp: 0.638 ± 0.105
1.823AsnTyr: 1.823 ± 0.178
0.0AsnXaa: 0.0 ± 0.0
Pro
2.406ProAla: 2.406 ± 0.221
0.437ProCys: 0.437 ± 0.092
2.661ProAsp: 2.661 ± 0.218
3.062ProGlu: 3.062 ± 0.235
1.75ProPhe: 1.75 ± 0.159
2.57ProGly: 2.57 ± 0.196
0.601ProHis: 0.601 ± 0.098
2.315ProIle: 2.315 ± 0.21
2.497ProLys: 2.497 ± 0.196
2.242ProLeu: 2.242 ± 0.207
0.911ProMet: 0.911 ± 0.122
1.804ProAsn: 1.804 ± 0.158
1.094ProPro: 1.094 ± 0.161
1.148ProGln: 1.148 ± 0.149
1.331ProArg: 1.331 ± 0.156
2.588ProSer: 2.588 ± 0.217
2.133ProThr: 2.133 ± 0.19
2.643ProVal: 2.643 ± 0.217
0.766ProTrp: 0.766 ± 0.131
1.495ProTyr: 1.495 ± 0.178
0.0ProXaa: 0.0 ± 0.0
Gln
2.934GlnAla: 2.934 ± 0.247
0.419GlnCys: 0.419 ± 0.085
1.95GlnAsp: 1.95 ± 0.211
2.096GlnGlu: 2.096 ± 0.182
1.932GlnPhe: 1.932 ± 0.187
2.187GlnGly: 2.187 ± 0.21
0.711GlnHis: 0.711 ± 0.119
2.679GlnIle: 2.679 ± 0.221
2.552GlnLys: 2.552 ± 0.227
2.953GlnLeu: 2.953 ± 0.214
1.185GlnMet: 1.185 ± 0.146
1.513GlnAsn: 1.513 ± 0.158
1.367GlnPro: 1.367 ± 0.177
0.966GlnGln: 0.966 ± 0.153
2.133GlnArg: 2.133 ± 0.212
1.859GlnSer: 1.859 ± 0.175
1.914GlnThr: 1.914 ± 0.178
2.388GlnVal: 2.388 ± 0.219
0.82GlnTrp: 0.82 ± 0.118
1.586GlnTyr: 1.586 ± 0.158
0.0GlnXaa: 0.0 ± 0.0
Arg
3.208ArgAla: 3.208 ± 0.277
0.62ArgCys: 0.62 ± 0.11
3.153ArgAsp: 3.153 ± 0.294
3.518ArgGlu: 3.518 ± 0.255
2.133ArgPhe: 2.133 ± 0.194
2.679ArgGly: 2.679 ± 0.208
0.802ArgHis: 0.802 ± 0.129
3.244ArgIle: 3.244 ± 0.241
3.026ArgLys: 3.026 ± 0.246
3.572ArgLeu: 3.572 ± 0.273
1.203ArgMet: 1.203 ± 0.152
2.479ArgAsn: 2.479 ± 0.201
1.64ArgPro: 1.64 ± 0.163
2.041ArgGln: 2.041 ± 0.192
1.896ArgArg: 1.896 ± 0.212
2.552ArgSer: 2.552 ± 0.183
2.297ArgThr: 2.297 ± 0.212
3.226ArgVal: 3.226 ± 0.235
0.802ArgTrp: 0.802 ± 0.106
1.695ArgTyr: 1.695 ± 0.17
0.0ArgXaa: 0.0 ± 0.0
Ser
3.627SerAla: 3.627 ± 0.288
0.674SerCys: 0.674 ± 0.125
3.755SerAsp: 3.755 ± 0.264
4.01SerGlu: 4.01 ± 0.24
2.789SerPhe: 2.789 ± 0.227
4.247SerGly: 4.247 ± 0.316
0.948SerHis: 0.948 ± 0.142
3.591SerIle: 3.591 ± 0.277
4.283SerLys: 4.283 ± 0.314
4.52SerLeu: 4.52 ± 0.257
1.896SerMet: 1.896 ± 0.179
2.552SerAsn: 2.552 ± 0.23
2.041SerPro: 2.041 ± 0.224
2.041SerGln: 2.041 ± 0.206
2.989SerArg: 2.989 ± 0.277
3.773SerSer: 3.773 ± 0.302
3.664SerThr: 3.664 ± 0.304
3.882SerVal: 3.882 ± 0.268
0.838SerTrp: 0.838 ± 0.122
2.552SerTyr: 2.552 ± 0.202
0.0SerXaa: 0.0 ± 0.0
Thr
4.065ThrAla: 4.065 ± 0.324
0.51ThrCys: 0.51 ± 0.081
3.427ThrAsp: 3.427 ± 0.26
4.083ThrGlu: 4.083 ± 0.323
2.679ThrPhe: 2.679 ± 0.205
4.447ThrGly: 4.447 ± 0.307
1.021ThrHis: 1.021 ± 0.136
4.283ThrIle: 4.283 ± 0.293
3.828ThrLys: 3.828 ± 0.286
4.721ThrLeu: 4.721 ± 0.339
1.385ThrMet: 1.385 ± 0.156
2.77ThrAsn: 2.77 ± 0.229
2.843ThrPro: 2.843 ± 0.241
1.914ThrGln: 1.914 ± 0.209
2.752ThrArg: 2.752 ± 0.22
3.445ThrSer: 3.445 ± 0.287
3.335ThrThr: 3.335 ± 0.285
4.156ThrVal: 4.156 ± 0.267
0.857ThrTrp: 0.857 ± 0.119
2.242ThrTyr: 2.242 ± 0.18
0.0ThrXaa: 0.0 ± 0.0
Val
4.775ValAla: 4.775 ± 0.276
0.966ValCys: 0.966 ± 0.146
4.411ValAsp: 4.411 ± 0.25
5.596ValGlu: 5.596 ± 0.33
2.588ValPhe: 2.588 ± 0.253
4.265ValGly: 4.265 ± 0.27
1.385ValHis: 1.385 ± 0.155
4.156ValIle: 4.156 ± 0.302
4.885ValLys: 4.885 ± 0.326
4.484ValLeu: 4.484 ± 0.285
2.005ValMet: 2.005 ± 0.207
3.518ValAsn: 3.518 ± 0.217
2.26ValPro: 2.26 ± 0.187
2.789ValGln: 2.789 ± 0.228
3.445ValArg: 3.445 ± 0.261
4.338ValSer: 4.338 ± 0.293
4.01ValThr: 4.01 ± 0.309
5.103ValVal: 5.103 ± 0.354
0.984ValTrp: 0.984 ± 0.157
3.062ValTyr: 3.062 ± 0.23
0.0ValXaa: 0.0 ± 0.0
Trp
1.112TrpAla: 1.112 ± 0.158
0.164TrpCys: 0.164 ± 0.058
0.966TrpAsp: 0.966 ± 0.132
0.984TrpGlu: 0.984 ± 0.13
0.747TrpPhe: 0.747 ± 0.108
0.601TrpGly: 0.601 ± 0.123
0.492TrpHis: 0.492 ± 0.096
0.838TrpIle: 0.838 ± 0.138
1.331TrpLys: 1.331 ± 0.154
1.002TrpLeu: 1.002 ± 0.117
0.729TrpMet: 0.729 ± 0.129
0.857TrpAsn: 0.857 ± 0.112
0.51TrpPro: 0.51 ± 0.1
0.51TrpGln: 0.51 ± 0.097
0.583TrpArg: 0.583 ± 0.11
0.838TrpSer: 0.838 ± 0.132
0.784TrpThr: 0.784 ± 0.133
1.021TrpVal: 1.021 ± 0.13
0.292TrpTrp: 0.292 ± 0.07
0.729TrpTyr: 0.729 ± 0.118
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.625TyrAla: 2.625 ± 0.238
0.437TyrCys: 0.437 ± 0.088
3.007TyrAsp: 3.007 ± 0.246
2.388TyrGlu: 2.388 ± 0.214
1.968TyrPhe: 1.968 ± 0.197
2.533TyrGly: 2.533 ± 0.227
0.857TyrHis: 0.857 ± 0.129
2.789TyrIle: 2.789 ± 0.246
2.789TyrLys: 2.789 ± 0.234
2.825TyrLeu: 2.825 ± 0.261
1.276TyrMet: 1.276 ± 0.151
2.424TyrAsn: 2.424 ± 0.191
1.841TyrPro: 1.841 ± 0.2
1.859TyrGln: 1.859 ± 0.194
1.732TyrArg: 1.732 ± 0.156
2.278TyrSer: 2.278 ± 0.195
1.914TyrThr: 1.914 ± 0.216
2.57TyrVal: 2.57 ± 0.208
0.583TyrTrp: 0.583 ± 0.112
1.64TyrTyr: 1.64 ± 0.192
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 301 proteins (54866 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski