Amino acid dipepetide frequency for Escherichia phage vB_EcoM_Schickermooser

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.94AlaAla: 4.94 ± 0.415
0.962AlaCys: 0.962 ± 0.151
3.475AlaAsp: 3.475 ± 0.253
4.962AlaGlu: 4.962 ± 0.391
2.841AlaPhe: 2.841 ± 0.273
4.852AlaGly: 4.852 ± 0.35
1.311AlaHis: 1.311 ± 0.162
4.109AlaIle: 4.109 ± 0.304
5.049AlaLys: 5.049 ± 0.328
6.033AlaLeu: 6.033 ± 0.35
2.12AlaMet: 2.12 ± 0.23
3.825AlaAsn: 3.825 ± 0.295
2.033AlaPro: 2.033 ± 0.224
1.967AlaGln: 1.967 ± 0.191
2.732AlaArg: 2.732 ± 0.261
3.912AlaSer: 3.912 ± 0.346
4.612AlaThr: 4.612 ± 0.365
4.743AlaVal: 4.743 ± 0.325
1.224AlaTrp: 1.224 ± 0.156
3.497AlaTyr: 3.497 ± 0.272
0.0AlaXaa: 0.0 ± 0.0
Cys
0.984CysAla: 0.984 ± 0.137
0.306CysCys: 0.306 ± 0.084
0.787CysAsp: 0.787 ± 0.142
0.94CysGlu: 0.94 ± 0.16
1.137CysPhe: 1.137 ± 0.176
0.874CysGly: 0.874 ± 0.152
0.284CysHis: 0.284 ± 0.085
0.809CysIle: 0.809 ± 0.139
1.049CysLys: 1.049 ± 0.161
1.115CysLeu: 1.115 ± 0.166
0.634CysMet: 0.634 ± 0.132
0.568CysAsn: 0.568 ± 0.125
0.525CysPro: 0.525 ± 0.119
0.59CysGln: 0.59 ± 0.124
0.721CysArg: 0.721 ± 0.155
0.809CysSer: 0.809 ± 0.105
0.612CysThr: 0.612 ± 0.129
1.115CysVal: 1.115 ± 0.169
0.306CysTrp: 0.306 ± 0.102
0.568CysTyr: 0.568 ± 0.111
0.0CysXaa: 0.0 ± 0.0
Asp
4.131AspAla: 4.131 ± 0.313
0.809AspCys: 0.809 ± 0.154
3.563AspAsp: 3.563 ± 0.296
4.087AspGlu: 4.087 ± 0.298
3.038AspPhe: 3.038 ± 0.257
4.656AspGly: 4.656 ± 0.358
1.464AspHis: 1.464 ± 0.209
4.109AspIle: 4.109 ± 0.307
3.978AspLys: 3.978 ± 0.284
5.552AspLeu: 5.552 ± 0.372
1.683AspMet: 1.683 ± 0.208
3.432AspAsn: 3.432 ± 0.246
2.448AspPro: 2.448 ± 0.239
2.098AspGln: 2.098 ± 0.183
2.339AspArg: 2.339 ± 0.257
3.322AspSer: 3.322 ± 0.256
3.344AspThr: 3.344 ± 0.256
4.35AspVal: 4.35 ± 0.302
1.115AspTrp: 1.115 ± 0.156
3.716AspTyr: 3.716 ± 0.282
0.0AspXaa: 0.0 ± 0.0
Glu
5.727GluAla: 5.727 ± 0.355
0.94GluCys: 0.94 ± 0.138
5.836GluAsp: 5.836 ± 0.417
8.415GluGlu: 8.415 ± 0.69
2.841GluPhe: 2.841 ± 0.204
4.743GluGly: 4.743 ± 0.326
1.246GluHis: 1.246 ± 0.174
4.656GluIle: 4.656 ± 0.349
4.896GluLys: 4.896 ± 0.355
5.18GluLeu: 5.18 ± 0.41
2.601GluMet: 2.601 ± 0.215
3.891GluAsn: 3.891 ± 0.322
1.814GluPro: 1.814 ± 0.241
2.754GluGln: 2.754 ± 0.278
3.06GluArg: 3.06 ± 0.271
3.497GluSer: 3.497 ± 0.257
4.065GluThr: 4.065 ± 0.296
4.896GluVal: 4.896 ± 0.337
1.727GluTrp: 1.727 ± 0.206
3.279GluTyr: 3.279 ± 0.302
0.0GluXaa: 0.0 ± 0.0
Phe
3.082PheAla: 3.082 ± 0.259
0.612PheCys: 0.612 ± 0.117
3.147PheAsp: 3.147 ± 0.228
3.279PheGlu: 3.279 ± 0.281
2.076PhePhe: 2.076 ± 0.227
2.798PheGly: 2.798 ± 0.292
0.721PheHis: 0.721 ± 0.108
2.47PheIle: 2.47 ± 0.204
3.169PheLys: 3.169 ± 0.277
3.213PheLeu: 3.213 ± 0.263
0.984PheMet: 0.984 ± 0.153
2.601PheAsn: 2.601 ± 0.244
1.661PhePro: 1.661 ± 0.2
1.333PheGln: 1.333 ± 0.182
1.727PheArg: 1.727 ± 0.218
2.667PheSer: 2.667 ± 0.251
2.667PheThr: 2.667 ± 0.259
3.257PheVal: 3.257 ± 0.321
0.656PheTrp: 0.656 ± 0.097
2.12PheTyr: 2.12 ± 0.212
0.0PheXaa: 0.0 ± 0.0
Gly
3.847GlyAla: 3.847 ± 0.3
1.224GlyCys: 1.224 ± 0.198
4.634GlyAsp: 4.634 ± 0.381
4.809GlyGlu: 4.809 ± 0.305
3.169GlyPhe: 3.169 ± 0.274
4.218GlyGly: 4.218 ± 0.339
1.311GlyHis: 1.311 ± 0.194
4.044GlyIle: 4.044 ± 0.258
4.852GlyLys: 4.852 ± 0.329
5.202GlyLeu: 5.202 ± 0.316
1.814GlyMet: 1.814 ± 0.231
3.388GlyAsn: 3.388 ± 0.315
0.962GlyPro: 0.962 ± 0.199
1.814GlyGln: 1.814 ± 0.199
2.535GlyArg: 2.535 ± 0.262
3.694GlySer: 3.694 ± 0.294
3.956GlyThr: 3.956 ± 0.508
5.049GlyVal: 5.049 ± 0.341
1.224GlyTrp: 1.224 ± 0.165
3.606GlyTyr: 3.606 ± 0.258
0.0GlyXaa: 0.0 ± 0.0
His
1.071HisAla: 1.071 ± 0.134
0.393HisCys: 0.393 ± 0.095
1.027HisAsp: 1.027 ± 0.137
1.071HisGlu: 1.071 ± 0.167
0.984HisPhe: 0.984 ± 0.152
1.377HisGly: 1.377 ± 0.192
0.306HisHis: 0.306 ± 0.074
1.224HisIle: 1.224 ± 0.169
1.202HisLys: 1.202 ± 0.168
1.53HisLeu: 1.53 ± 0.178
0.634HisMet: 0.634 ± 0.136
1.027HisAsn: 1.027 ± 0.158
0.809HisPro: 0.809 ± 0.127
0.525HisGln: 0.525 ± 0.119
0.852HisArg: 0.852 ± 0.144
0.918HisSer: 0.918 ± 0.13
0.918HisThr: 0.918 ± 0.161
1.464HisVal: 1.464 ± 0.176
0.35HisTrp: 0.35 ± 0.091
1.18HisTyr: 1.18 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
4.612IleAla: 4.612 ± 0.376
1.093IleCys: 1.093 ± 0.19
4.0IleAsp: 4.0 ± 0.283
4.568IleGlu: 4.568 ± 0.297
2.382IlePhe: 2.382 ± 0.246
3.606IleGly: 3.606 ± 0.272
0.962IleHis: 0.962 ± 0.138
3.912IleIle: 3.912 ± 0.331
4.284IleLys: 4.284 ± 0.285
4.896IleLeu: 4.896 ± 0.349
1.639IleMet: 1.639 ± 0.187
3.978IleAsn: 3.978 ± 0.288
2.885IlePro: 2.885 ± 0.328
2.033IleGln: 2.033 ± 0.206
2.776IleArg: 2.776 ± 0.252
3.344IleSer: 3.344 ± 0.263
4.437IleThr: 4.437 ± 0.341
4.524IleVal: 4.524 ± 0.315
0.546IleTrp: 0.546 ± 0.096
2.404IleTyr: 2.404 ± 0.267
0.0IleXaa: 0.0 ± 0.0
Lys
5.377LysAla: 5.377 ± 0.399
0.634LysCys: 0.634 ± 0.117
4.218LysAsp: 4.218 ± 0.344
6.033LysGlu: 6.033 ± 0.459
2.361LysPhe: 2.361 ± 0.222
4.415LysGly: 4.415 ± 0.393
1.617LysHis: 1.617 ± 0.205
4.218LysIle: 4.218 ± 0.294
4.393LysLys: 4.393 ± 0.328
4.524LysLeu: 4.524 ± 0.367
2.929LysMet: 2.929 ± 0.273
3.41LysAsn: 3.41 ± 0.243
2.12LysPro: 2.12 ± 0.251
2.623LysGln: 2.623 ± 0.248
2.82LysArg: 2.82 ± 0.252
3.563LysSer: 3.563 ± 0.303
4.044LysThr: 4.044 ± 0.33
4.721LysVal: 4.721 ± 0.35
0.962LysTrp: 0.962 ± 0.143
2.645LysTyr: 2.645 ± 0.279
0.0LysXaa: 0.0 ± 0.0
Leu
6.273LeuAla: 6.273 ± 0.395
1.158LeuCys: 1.158 ± 0.17
5.443LeuAsp: 5.443 ± 0.341
6.251LeuGlu: 6.251 ± 0.36
3.191LeuPhe: 3.191 ± 0.255
4.699LeuGly: 4.699 ± 0.31
1.399LeuHis: 1.399 ± 0.185
4.437LeuIle: 4.437 ± 0.359
5.792LeuLys: 5.792 ± 0.319
5.902LeuLeu: 5.902 ± 0.366
2.339LeuMet: 2.339 ± 0.214
4.109LeuAsn: 4.109 ± 0.299
3.257LeuPro: 3.257 ± 0.303
2.361LeuGln: 2.361 ± 0.238
3.41LeuArg: 3.41 ± 0.282
5.399LeuSer: 5.399 ± 0.335
4.94LeuThr: 4.94 ± 0.357
4.962LeuVal: 4.962 ± 0.312
0.984LeuTrp: 0.984 ± 0.169
3.104LeuTyr: 3.104 ± 0.253
0.0LeuXaa: 0.0 ± 0.0
Met
2.208MetAla: 2.208 ± 0.221
0.328MetCys: 0.328 ± 0.079
1.661MetAsp: 1.661 ± 0.196
1.552MetGlu: 1.552 ± 0.23
1.093MetPhe: 1.093 ± 0.144
1.705MetGly: 1.705 ± 0.181
0.699MetHis: 0.699 ± 0.138
2.12MetIle: 2.12 ± 0.225
2.229MetLys: 2.229 ± 0.245
2.208MetLeu: 2.208 ± 0.243
0.699MetMet: 0.699 ± 0.124
0.874MetAsn: 0.874 ± 0.162
0.809MetPro: 0.809 ± 0.138
1.202MetGln: 1.202 ± 0.198
1.508MetArg: 1.508 ± 0.185
2.339MetSer: 2.339 ± 0.269
1.945MetThr: 1.945 ± 0.244
1.967MetVal: 1.967 ± 0.202
0.284MetTrp: 0.284 ± 0.092
1.049MetTyr: 1.049 ± 0.139
0.0MetXaa: 0.0 ± 0.0
Asn
3.563AsnAla: 3.563 ± 0.254
0.568AsnCys: 0.568 ± 0.111
2.667AsnAsp: 2.667 ± 0.221
3.038AsnGlu: 3.038 ± 0.264
2.098AsnPhe: 2.098 ± 0.207
4.459AsnGly: 4.459 ± 0.361
0.874AsnHis: 0.874 ± 0.156
4.0AsnIle: 4.0 ± 0.36
3.803AsnLys: 3.803 ± 0.262
4.24AsnLeu: 4.24 ± 0.341
1.486AsnMet: 1.486 ± 0.165
3.213AsnAsn: 3.213 ± 0.34
2.951AsnPro: 2.951 ± 0.272
1.421AsnGln: 1.421 ± 0.202
2.098AsnArg: 2.098 ± 0.208
3.388AsnSer: 3.388 ± 0.289
3.453AsnThr: 3.453 ± 0.311
3.279AsnVal: 3.279 ± 0.31
0.94AsnTrp: 0.94 ± 0.156
2.448AsnTyr: 2.448 ± 0.212
0.0AsnXaa: 0.0 ± 0.0
Pro
2.055ProAla: 2.055 ± 0.204
0.765ProCys: 0.765 ± 0.127
2.295ProAsp: 2.295 ± 0.225
3.169ProGlu: 3.169 ± 0.253
1.989ProPhe: 1.989 ± 0.249
1.945ProGly: 1.945 ± 0.259
0.678ProHis: 0.678 ± 0.12
1.792ProIle: 1.792 ± 0.199
2.055ProLys: 2.055 ± 0.202
2.382ProLeu: 2.382 ± 0.23
0.699ProMet: 0.699 ± 0.141
2.251ProAsn: 2.251 ± 0.274
1.005ProPro: 1.005 ± 0.153
1.005ProGln: 1.005 ± 0.165
1.268ProArg: 1.268 ± 0.184
2.361ProSer: 2.361 ± 0.224
2.47ProThr: 2.47 ± 0.249
2.667ProVal: 2.667 ± 0.223
0.546ProTrp: 0.546 ± 0.139
1.705ProTyr: 1.705 ± 0.206
0.0ProXaa: 0.0 ± 0.0
Gln
2.448GlnAla: 2.448 ± 0.21
0.459GlnCys: 0.459 ± 0.099
1.967GlnAsp: 1.967 ± 0.229
2.973GlnGlu: 2.973 ± 0.216
1.224GlnPhe: 1.224 ± 0.14
1.814GlnGly: 1.814 ± 0.225
0.612GlnHis: 0.612 ± 0.117
2.667GlnIle: 2.667 ± 0.22
1.443GlnLys: 1.443 ± 0.176
2.186GlnLeu: 2.186 ± 0.209
0.984GlnMet: 0.984 ± 0.152
1.727GlnAsn: 1.727 ± 0.21
1.115GlnPro: 1.115 ± 0.134
0.984GlnGln: 0.984 ± 0.149
1.158GlnArg: 1.158 ± 0.162
1.464GlnSer: 1.464 ± 0.159
1.792GlnThr: 1.792 ± 0.189
2.011GlnVal: 2.011 ± 0.235
0.59GlnTrp: 0.59 ± 0.115
1.421GlnTyr: 1.421 ± 0.175
0.0GlnXaa: 0.0 ± 0.0
Arg
2.426ArgAla: 2.426 ± 0.22
0.765ArgCys: 0.765 ± 0.129
2.929ArgAsp: 2.929 ± 0.265
2.841ArgGlu: 2.841 ± 0.362
1.705ArgPhe: 1.705 ± 0.188
2.667ArgGly: 2.667 ± 0.258
0.568ArgHis: 0.568 ± 0.131
2.71ArgIle: 2.71 ± 0.234
3.104ArgLys: 3.104 ± 0.245
3.475ArgLeu: 3.475 ± 0.304
1.464ArgMet: 1.464 ± 0.216
2.426ArgAsn: 2.426 ± 0.24
1.137ArgPro: 1.137 ± 0.175
1.464ArgGln: 1.464 ± 0.181
1.945ArgArg: 1.945 ± 0.256
2.841ArgSer: 2.841 ± 0.227
1.77ArgThr: 1.77 ± 0.187
2.514ArgVal: 2.514 ± 0.18
0.437ArgTrp: 0.437 ± 0.107
1.311ArgTyr: 1.311 ± 0.179
0.0ArgXaa: 0.0 ± 0.0
Ser
4.065SerAla: 4.065 ± 0.305
0.918SerCys: 0.918 ± 0.157
3.453SerAsp: 3.453 ± 0.277
3.694SerGlu: 3.694 ± 0.261
3.06SerPhe: 3.06 ± 0.268
4.065SerGly: 4.065 ± 0.356
1.115SerHis: 1.115 ± 0.161
3.388SerIle: 3.388 ± 0.278
3.978SerLys: 3.978 ± 0.296
5.268SerLeu: 5.268 ± 0.366
1.29SerMet: 1.29 ± 0.142
3.344SerAsn: 3.344 ± 0.284
2.295SerPro: 2.295 ± 0.233
1.705SerGln: 1.705 ± 0.219
2.601SerArg: 2.601 ± 0.276
3.497SerSer: 3.497 ± 0.345
3.3SerThr: 3.3 ± 0.324
4.087SerVal: 4.087 ± 0.272
0.831SerTrp: 0.831 ± 0.124
2.535SerTyr: 2.535 ± 0.272
0.0SerXaa: 0.0 ± 0.0
Thr
3.978ThrAla: 3.978 ± 0.386
0.743ThrCys: 0.743 ± 0.138
2.994ThrAsp: 2.994 ± 0.235
4.262ThrGlu: 4.262 ± 0.291
3.147ThrPhe: 3.147 ± 0.316
4.568ThrGly: 4.568 ± 0.403
1.049ThrHis: 1.049 ± 0.151
3.847ThrIle: 3.847 ± 0.307
4.131ThrLys: 4.131 ± 0.316
5.443ThrLeu: 5.443 ± 0.397
1.093ThrMet: 1.093 ± 0.139
2.907ThrAsn: 2.907 ± 0.261
3.06ThrPro: 3.06 ± 0.242
1.617ThrGln: 1.617 ± 0.18
1.858ThrArg: 1.858 ± 0.189
3.453ThrSer: 3.453 ± 0.315
3.978ThrThr: 3.978 ± 0.38
4.459ThrVal: 4.459 ± 0.374
0.984ThrTrp: 0.984 ± 0.154
2.535ThrTyr: 2.535 ± 0.224
0.0ThrXaa: 0.0 ± 0.0
Val
4.481ValAla: 4.481 ± 0.336
1.158ValCys: 1.158 ± 0.171
4.546ValAsp: 4.546 ± 0.331
5.574ValGlu: 5.574 ± 0.376
3.126ValPhe: 3.126 ± 0.309
4.415ValGly: 4.415 ± 0.441
1.18ValHis: 1.18 ± 0.169
4.809ValIle: 4.809 ± 0.294
4.918ValLys: 4.918 ± 0.372
5.617ValLeu: 5.617 ± 0.376
1.77ValMet: 1.77 ± 0.161
3.453ValAsn: 3.453 ± 0.278
1.923ValPro: 1.923 ± 0.195
1.661ValGln: 1.661 ± 0.202
2.47ValArg: 2.47 ± 0.218
4.24ValSer: 4.24 ± 0.357
4.24ValThr: 4.24 ± 0.307
6.186ValVal: 6.186 ± 0.551
0.874ValTrp: 0.874 ± 0.129
3.366ValTyr: 3.366 ± 0.28
0.0ValXaa: 0.0 ± 0.0
Trp
0.787TrpAla: 0.787 ± 0.125
0.219TrpCys: 0.219 ± 0.057
1.377TrpAsp: 1.377 ± 0.203
1.29TrpGlu: 1.29 ± 0.187
0.984TrpPhe: 0.984 ± 0.169
0.721TrpGly: 0.721 ± 0.129
0.393TrpHis: 0.393 ± 0.085
1.049TrpIle: 1.049 ± 0.144
0.874TrpLys: 0.874 ± 0.162
1.53TrpLeu: 1.53 ± 0.169
0.503TrpMet: 0.503 ± 0.11
0.918TrpAsn: 0.918 ± 0.163
0.306TrpPro: 0.306 ± 0.091
0.35TrpGln: 0.35 ± 0.102
0.656TrpArg: 0.656 ± 0.12
0.831TrpSer: 0.831 ± 0.144
0.809TrpThr: 0.809 ± 0.128
0.962TrpVal: 0.962 ± 0.136
0.35TrpTrp: 0.35 ± 0.084
0.634TrpTyr: 0.634 ± 0.123
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.147TyrAla: 3.147 ± 0.24
0.678TyrCys: 0.678 ± 0.131
3.213TyrAsp: 3.213 ± 0.297
3.257TyrGlu: 3.257 ± 0.291
1.814TyrPhe: 1.814 ± 0.201
2.754TyrGly: 2.754 ± 0.246
1.071TyrHis: 1.071 ± 0.132
2.514TyrIle: 2.514 ± 0.226
2.426TyrLys: 2.426 ± 0.246
3.912TyrLeu: 3.912 ± 0.276
1.093TyrMet: 1.093 ± 0.176
2.645TyrAsn: 2.645 ± 0.256
1.989TyrPro: 1.989 ± 0.199
1.53TyrGln: 1.53 ± 0.169
1.967TyrArg: 1.967 ± 0.2
2.951TyrSer: 2.951 ± 0.265
2.754TyrThr: 2.754 ± 0.232
2.798TyrVal: 2.798 ± 0.258
0.612TyrTrp: 0.612 ± 0.13
2.011TyrTyr: 2.011 ± 0.177
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 284 proteins (45752 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski