Amino acid dipepetide frequency for Cellulophaga phage phi4:1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.401AlaAla: 2.401 ± 0.263
0.507AlaCys: 0.507 ± 0.103
2.688AlaAsp: 2.688 ± 0.246
4.009AlaGlu: 4.009 ± 0.375
2.291AlaPhe: 2.291 ± 0.231
3.062AlaGly: 3.062 ± 0.319
0.573AlaHis: 0.573 ± 0.108
4.098AlaIle: 4.098 ± 0.321
4.604AlaLys: 4.604 ± 0.435
4.979AlaLeu: 4.979 ± 0.394
0.903AlaMet: 0.903 ± 0.125
3.062AlaAsn: 3.062 ± 0.3
1.63AlaPro: 1.63 ± 0.229
2.203AlaGln: 2.203 ± 0.238
1.806AlaArg: 1.806 ± 0.224
3.15AlaSer: 3.15 ± 0.293
3.15AlaThr: 3.15 ± 0.408
2.489AlaVal: 2.489 ± 0.242
0.485AlaTrp: 0.485 ± 0.105
2.335AlaTyr: 2.335 ± 0.27
0.0AlaXaa: 0.0 ± 0.0
Cys
0.308CysAla: 0.308 ± 0.082
0.154CysCys: 0.154 ± 0.059
1.013CysAsp: 1.013 ± 0.169
0.595CysGlu: 0.595 ± 0.127
0.33CysPhe: 0.33 ± 0.083
0.661CysGly: 0.661 ± 0.137
0.154CysHis: 0.154 ± 0.06
0.595CysIle: 0.595 ± 0.138
0.991CysLys: 0.991 ± 0.175
0.881CysLeu: 0.881 ± 0.162
0.176CysMet: 0.176 ± 0.062
0.749CysAsn: 0.749 ± 0.141
0.308CysPro: 0.308 ± 0.094
0.397CysGln: 0.397 ± 0.087
0.529CysArg: 0.529 ± 0.12
0.661CysSer: 0.661 ± 0.149
0.969CysThr: 0.969 ± 0.178
0.507CysVal: 0.507 ± 0.108
0.154CysTrp: 0.154 ± 0.053
0.242CysTyr: 0.242 ± 0.074
0.0CysXaa: 0.0 ± 0.0
Asp
3.415AspAla: 3.415 ± 0.315
0.991AspCys: 0.991 ± 0.18
2.754AspAsp: 2.754 ± 0.276
3.811AspGlu: 3.811 ± 0.347
3.282AspPhe: 3.282 ± 0.242
3.481AspGly: 3.481 ± 0.378
0.551AspHis: 0.551 ± 0.113
5.243AspIle: 5.243 ± 0.32
6.08AspLys: 6.08 ± 0.353
5.265AspLeu: 5.265 ± 0.376
1.498AspMet: 1.498 ± 0.199
4.164AspAsn: 4.164 ± 0.418
1.939AspPro: 1.939 ± 0.237
1.234AspGln: 1.234 ± 0.209
1.983AspArg: 1.983 ± 0.205
4.714AspSer: 4.714 ± 0.354
3.547AspThr: 3.547 ± 0.266
3.062AspVal: 3.062 ± 0.236
1.079AspTrp: 1.079 ± 0.157
3.172AspTyr: 3.172 ± 0.245
0.0AspXaa: 0.0 ± 0.0
Glu
4.56GluAla: 4.56 ± 0.39
0.639GluCys: 0.639 ± 0.131
6.014GluAsp: 6.014 ± 0.38
8.415GluGlu: 8.415 ± 0.572
3.15GluPhe: 3.15 ± 0.284
5.221GluGly: 5.221 ± 0.334
0.925GluHis: 0.925 ± 0.145
5.816GluIle: 5.816 ± 0.404
6.697GluLys: 6.697 ± 0.599
6.895GluLeu: 6.895 ± 0.586
1.939GluMet: 1.939 ± 0.223
5.287GluAsn: 5.287 ± 0.338
1.388GluPro: 1.388 ± 0.191
2.622GluGln: 2.622 ± 0.246
2.577GluArg: 2.577 ± 0.324
4.648GluSer: 4.648 ± 0.355
4.538GluThr: 4.538 ± 0.351
5.882GluVal: 5.882 ± 0.344
0.859GluTrp: 0.859 ± 0.127
3.877GluTyr: 3.877 ± 0.361
0.0GluXaa: 0.0 ± 0.0
Phe
1.895PheAla: 1.895 ± 0.248
0.595PheCys: 0.595 ± 0.121
3.128PheAsp: 3.128 ± 0.233
3.437PheGlu: 3.437 ± 0.253
2.005PhePhe: 2.005 ± 0.193
2.754PheGly: 2.754 ± 0.245
0.595PheHis: 0.595 ± 0.107
2.996PheIle: 2.996 ± 0.287
4.384PheLys: 4.384 ± 0.376
3.811PheLeu: 3.811 ± 0.286
1.124PheMet: 1.124 ± 0.172
3.238PheAsn: 3.238 ± 0.257
1.234PhePro: 1.234 ± 0.158
1.168PheGln: 1.168 ± 0.199
1.63PheArg: 1.63 ± 0.216
3.172PheSer: 3.172 ± 0.271
3.327PheThr: 3.327 ± 0.312
2.071PheVal: 2.071 ± 0.205
0.441PheTrp: 0.441 ± 0.086
2.357PheTyr: 2.357 ± 0.32
0.0PheXaa: 0.0 ± 0.0
Gly
2.952GlyAla: 2.952 ± 0.34
0.683GlyCys: 0.683 ± 0.147
3.547GlyAsp: 3.547 ± 0.313
4.538GlyGlu: 4.538 ± 0.424
3.282GlyPhe: 3.282 ± 0.279
3.811GlyGly: 3.811 ± 0.371
1.146GlyHis: 1.146 ± 0.246
4.714GlyIle: 4.714 ± 0.309
5.463GlyLys: 5.463 ± 0.403
4.252GlyLeu: 4.252 ± 0.331
0.969GlyMet: 0.969 ± 0.116
3.481GlyAsn: 3.481 ± 0.314
0.859GlyPro: 0.859 ± 0.137
1.454GlyGln: 1.454 ± 0.21
2.203GlyArg: 2.203 ± 0.227
4.538GlySer: 4.538 ± 0.467
4.296GlyThr: 4.296 ± 0.47
4.252GlyVal: 4.252 ± 0.363
0.639GlyTrp: 0.639 ± 0.138
2.952GlyTyr: 2.952 ± 0.25
0.0GlyXaa: 0.0 ± 0.0
His
0.507HisAla: 0.507 ± 0.12
0.264HisCys: 0.264 ± 0.076
0.617HisAsp: 0.617 ± 0.148
0.793HisGlu: 0.793 ± 0.154
0.661HisPhe: 0.661 ± 0.119
0.837HisGly: 0.837 ± 0.164
0.22HisHis: 0.22 ± 0.079
1.079HisIle: 1.079 ± 0.163
0.969HisLys: 0.969 ± 0.167
1.63HisLeu: 1.63 ± 0.183
0.397HisMet: 0.397 ± 0.111
0.925HisAsn: 0.925 ± 0.164
0.573HisPro: 0.573 ± 0.116
0.441HisGln: 0.441 ± 0.11
0.771HisArg: 0.771 ± 0.115
1.101HisSer: 1.101 ± 0.191
0.793HisThr: 0.793 ± 0.101
0.485HisVal: 0.485 ± 0.102
0.264HisTrp: 0.264 ± 0.085
0.793HisTyr: 0.793 ± 0.145
0.0HisXaa: 0.0 ± 0.0
Ile
4.274IleAla: 4.274 ± 0.33
0.705IleCys: 0.705 ± 0.13
5.133IleAsp: 5.133 ± 0.316
6.631IleGlu: 6.631 ± 0.375
3.194IlePhe: 3.194 ± 0.249
4.406IleGly: 4.406 ± 0.336
0.859IleHis: 0.859 ± 0.155
4.538IleIle: 4.538 ± 0.302
7.358IleLys: 7.358 ± 0.455
6.168IleLeu: 6.168 ± 0.421
1.124IleMet: 1.124 ± 0.155
4.714IleAsn: 4.714 ± 0.325
2.489IlePro: 2.489 ± 0.243
2.181IleGln: 2.181 ± 0.229
2.379IleArg: 2.379 ± 0.198
5.265IleSer: 5.265 ± 0.362
4.758IleThr: 4.758 ± 0.348
3.899IleVal: 3.899 ± 0.341
0.837IleTrp: 0.837 ± 0.148
2.357IleTyr: 2.357 ± 0.285
0.0IleXaa: 0.0 ± 0.0
Lys
4.847LysAla: 4.847 ± 0.363
0.661LysCys: 0.661 ± 0.138
5.926LysAsp: 5.926 ± 0.414
9.429LysGlu: 9.429 ± 0.691
3.459LysPhe: 3.459 ± 0.275
5.309LysGly: 5.309 ± 0.392
1.542LysHis: 1.542 ± 0.208
6.741LysIle: 6.741 ± 0.476
8.173LysLys: 8.173 ± 0.591
6.807LysLeu: 6.807 ± 0.453
2.467LysMet: 2.467 ± 0.269
6.014LysAsn: 6.014 ± 0.394
2.754LysPro: 2.754 ± 0.271
2.577LysGln: 2.577 ± 0.253
3.437LysArg: 3.437 ± 0.323
4.935LysSer: 4.935 ± 0.292
5.816LysThr: 5.816 ± 0.398
5.64LysVal: 5.64 ± 0.357
0.925LysTrp: 0.925 ± 0.163
4.23LysTyr: 4.23 ± 0.366
0.0LysXaa: 0.0 ± 0.0
Leu
4.23LeuAla: 4.23 ± 0.296
0.529LeuCys: 0.529 ± 0.124
5.089LeuAsp: 5.089 ± 0.274
7.27LeuGlu: 7.27 ± 0.474
3.194LeuPhe: 3.194 ± 0.345
5.463LeuGly: 5.463 ± 0.38
1.212LeuHis: 1.212 ± 0.162
5.772LeuIle: 5.772 ± 0.415
8.746LeuLys: 8.746 ± 0.519
5.684LeuLeu: 5.684 ± 0.384
1.652LeuMet: 1.652 ± 0.22
5.552LeuAsn: 5.552 ± 0.414
2.952LeuPro: 2.952 ± 0.275
2.754LeuGln: 2.754 ± 0.239
3.084LeuArg: 3.084 ± 0.251
6.367LeuSer: 6.367 ± 0.414
5.441LeuThr: 5.441 ± 0.369
4.098LeuVal: 4.098 ± 0.308
0.595LeuTrp: 0.595 ± 0.122
3.062LeuTyr: 3.062 ± 0.283
0.0LeuXaa: 0.0 ± 0.0
Met
1.366MetAla: 1.366 ± 0.14
0.176MetCys: 0.176 ± 0.059
1.344MetAsp: 1.344 ± 0.176
1.498MetGlu: 1.498 ± 0.188
0.793MetPhe: 0.793 ± 0.137
0.991MetGly: 0.991 ± 0.139
0.286MetHis: 0.286 ± 0.079
1.035MetIle: 1.035 ± 0.209
2.159MetLys: 2.159 ± 0.254
1.784MetLeu: 1.784 ± 0.251
0.176MetMet: 0.176 ± 0.059
1.101MetAsn: 1.101 ± 0.157
0.639MetPro: 0.639 ± 0.108
0.771MetGln: 0.771 ± 0.118
0.639MetArg: 0.639 ± 0.103
1.542MetSer: 1.542 ± 0.196
1.3MetThr: 1.3 ± 0.181
1.146MetVal: 1.146 ± 0.162
0.132MetTrp: 0.132 ± 0.053
0.793MetTyr: 0.793 ± 0.131
0.0MetXaa: 0.0 ± 0.0
Asn
2.864AsnAla: 2.864 ± 0.27
0.947AsnCys: 0.947 ± 0.179
2.952AsnAsp: 2.952 ± 0.258
3.745AsnGlu: 3.745 ± 0.306
3.26AsnPhe: 3.26 ± 0.335
3.943AsnGly: 3.943 ± 0.327
1.079AsnHis: 1.079 ± 0.137
5.089AsnIle: 5.089 ± 0.413
5.948AsnLys: 5.948 ± 0.549
5.221AsnLeu: 5.221 ± 0.34
1.124AsnMet: 1.124 ± 0.122
4.847AsnAsn: 4.847 ± 0.512
2.335AsnPro: 2.335 ± 0.189
2.137AsnGln: 2.137 ± 0.238
2.247AsnArg: 2.247 ± 0.254
4.913AsnSer: 4.913 ± 0.421
4.626AsnThr: 4.626 ± 0.406
3.701AsnVal: 3.701 ± 0.425
1.146AsnTrp: 1.146 ± 0.16
3.635AsnTyr: 3.635 ± 0.354
0.0AsnXaa: 0.0 ± 0.0
Pro
1.234ProAla: 1.234 ± 0.183
0.242ProCys: 0.242 ± 0.079
1.851ProAsp: 1.851 ± 0.168
2.864ProGlu: 2.864 ± 0.282
1.432ProPhe: 1.432 ± 0.202
0.991ProGly: 0.991 ± 0.183
0.441ProHis: 0.441 ± 0.093
2.181ProIle: 2.181 ± 0.229
2.644ProLys: 2.644 ± 0.258
2.577ProLeu: 2.577 ± 0.266
0.529ProMet: 0.529 ± 0.106
2.159ProAsn: 2.159 ± 0.211
0.573ProPro: 0.573 ± 0.149
0.925ProGln: 0.925 ± 0.148
0.661ProArg: 0.661 ± 0.115
2.093ProSer: 2.093 ± 0.227
2.159ProThr: 2.159 ± 0.224
2.093ProVal: 2.093 ± 0.238
0.308ProTrp: 0.308 ± 0.084
1.212ProTyr: 1.212 ± 0.187
0.0ProXaa: 0.0 ± 0.0
Gln
1.983GlnAla: 1.983 ± 0.239
0.154GlnCys: 0.154 ± 0.058
1.74GlnAsp: 1.74 ± 0.201
2.577GlnGlu: 2.577 ± 0.26
0.925GlnPhe: 0.925 ± 0.159
2.115GlnGly: 2.115 ± 0.226
0.463GlnHis: 0.463 ± 0.102
2.225GlnIle: 2.225 ± 0.268
2.622GlnLys: 2.622 ± 0.328
2.644GlnLeu: 2.644 ± 0.25
0.617GlnMet: 0.617 ± 0.118
1.851GlnAsn: 1.851 ± 0.192
0.661GlnPro: 0.661 ± 0.172
1.035GlnGln: 1.035 ± 0.15
1.124GlnArg: 1.124 ± 0.197
2.203GlnSer: 2.203 ± 0.204
1.939GlnThr: 1.939 ± 0.264
1.873GlnVal: 1.873 ± 0.208
0.485GlnTrp: 0.485 ± 0.088
1.476GlnTyr: 1.476 ± 0.176
0.0GlnXaa: 0.0 ± 0.0
Arg
1.454ArgAla: 1.454 ± 0.211
0.286ArgCys: 0.286 ± 0.079
2.379ArgAsp: 2.379 ± 0.259
2.996ArgGlu: 2.996 ± 0.262
1.542ArgPhe: 1.542 ± 0.173
1.917ArgGly: 1.917 ± 0.2
0.551ArgHis: 0.551 ± 0.088
2.401ArgIle: 2.401 ± 0.207
3.172ArgLys: 3.172 ± 0.336
2.908ArgLeu: 2.908 ± 0.243
0.815ArgMet: 0.815 ± 0.177
2.335ArgAsn: 2.335 ± 0.222
1.057ArgPro: 1.057 ± 0.146
1.3ArgGln: 1.3 ± 0.203
1.256ArgArg: 1.256 ± 0.165
1.762ArgSer: 1.762 ± 0.261
2.181ArgThr: 2.181 ± 0.304
2.555ArgVal: 2.555 ± 0.229
0.397ArgTrp: 0.397 ± 0.114
1.895ArgTyr: 1.895 ± 0.218
0.0ArgXaa: 0.0 ± 0.0
Ser
3.216SerAla: 3.216 ± 0.383
0.859SerCys: 0.859 ± 0.16
3.811SerAsp: 3.811 ± 0.323
5.023SerGlu: 5.023 ± 0.355
3.657SerPhe: 3.657 ± 0.259
4.736SerGly: 4.736 ± 0.535
0.837SerHis: 0.837 ± 0.147
5.463SerIle: 5.463 ± 0.387
7.028SerLys: 7.028 ± 0.444
5.86SerLeu: 5.86 ± 0.382
1.212SerMet: 1.212 ± 0.181
4.362SerAsn: 4.362 ± 0.426
1.961SerPro: 1.961 ± 0.174
2.181SerGln: 2.181 ± 0.264
2.379SerArg: 2.379 ± 0.215
4.78SerSer: 4.78 ± 0.454
4.758SerThr: 4.758 ± 0.448
3.459SerVal: 3.459 ± 0.246
0.771SerTrp: 0.771 ± 0.12
2.798SerTyr: 2.798 ± 0.275
0.0SerXaa: 0.0 ± 0.0
Thr
3.415ThrAla: 3.415 ± 0.314
0.573ThrCys: 0.573 ± 0.133
3.613ThrAsp: 3.613 ± 0.42
5.001ThrGlu: 5.001 ± 0.325
3.26ThrPhe: 3.26 ± 0.295
4.076ThrGly: 4.076 ± 0.426
0.991ThrHis: 0.991 ± 0.157
5.133ThrIle: 5.133 ± 0.406
5.089ThrLys: 5.089 ± 0.391
5.552ThrLeu: 5.552 ± 0.399
0.837ThrMet: 0.837 ± 0.133
4.296ThrAsn: 4.296 ± 0.382
2.379ThrPro: 2.379 ± 0.247
2.115ThrGln: 2.115 ± 0.296
1.917ThrArg: 1.917 ± 0.178
4.869ThrSer: 4.869 ± 0.488
4.45ThrThr: 4.45 ± 0.359
4.098ThrVal: 4.098 ± 0.317
0.551ThrTrp: 0.551 ± 0.113
2.974ThrTyr: 2.974 ± 0.263
0.0ThrXaa: 0.0 ± 0.0
Val
3.172ValAla: 3.172 ± 0.296
0.507ValCys: 0.507 ± 0.105
4.318ValAsp: 4.318 ± 0.305
5.023ValGlu: 5.023 ± 0.331
2.666ValPhe: 2.666 ± 0.275
3.503ValGly: 3.503 ± 0.339
0.727ValHis: 0.727 ± 0.17
4.428ValIle: 4.428 ± 0.325
4.56ValLys: 4.56 ± 0.302
4.692ValLeu: 4.692 ± 0.371
0.881ValMet: 0.881 ± 0.152
3.657ValAsn: 3.657 ± 0.308
1.63ValPro: 1.63 ± 0.196
1.63ValGln: 1.63 ± 0.169
2.027ValArg: 2.027 ± 0.188
4.098ValSer: 4.098 ± 0.257
3.525ValThr: 3.525 ± 0.353
4.098ValVal: 4.098 ± 0.434
0.33ValTrp: 0.33 ± 0.082
2.357ValTyr: 2.357 ± 0.233
0.0ValXaa: 0.0 ± 0.0
Trp
0.639TrpAla: 0.639 ± 0.103
0.154TrpCys: 0.154 ± 0.054
0.969TrpAsp: 0.969 ± 0.134
1.035TrpGlu: 1.035 ± 0.162
0.551TrpPhe: 0.551 ± 0.142
0.551TrpGly: 0.551 ± 0.112
0.242TrpHis: 0.242 ± 0.068
0.727TrpIle: 0.727 ± 0.134
0.749TrpLys: 0.749 ± 0.14
1.079TrpLeu: 1.079 ± 0.181
0.352TrpMet: 0.352 ± 0.092
0.727TrpAsn: 0.727 ± 0.137
0.132TrpPro: 0.132 ± 0.056
0.308TrpGln: 0.308 ± 0.08
0.485TrpArg: 0.485 ± 0.105
0.661TrpSer: 0.661 ± 0.129
0.661TrpThr: 0.661 ± 0.129
0.507TrpVal: 0.507 ± 0.097
0.132TrpTrp: 0.132 ± 0.048
0.529TrpTyr: 0.529 ± 0.115
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.696TyrAla: 1.696 ± 0.176
0.749TyrCys: 0.749 ± 0.141
2.401TyrAsp: 2.401 ± 0.266
3.26TyrGlu: 3.26 ± 0.292
2.423TyrPhe: 2.423 ± 0.243
2.203TyrGly: 2.203 ± 0.215
0.727TyrHis: 0.727 ± 0.143
2.996TyrIle: 2.996 ± 0.283
4.252TyrLys: 4.252 ± 0.286
4.053TyrLeu: 4.053 ± 0.306
0.859TyrMet: 0.859 ± 0.122
3.238TyrAsn: 3.238 ± 0.263
1.674TyrPro: 1.674 ± 0.211
1.278TyrGln: 1.278 ± 0.181
2.027TyrArg: 2.027 ± 0.277
3.679TyrSer: 3.679 ± 0.476
2.952TyrThr: 2.952 ± 0.289
1.917TyrVal: 1.917 ± 0.193
0.661TyrTrp: 0.661 ± 0.128
1.895TyrTyr: 1.895 ± 0.257
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 197 proteins (45394 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski