Amino acid dipepetide frequency for Pantoea phage vB_PagS_AAS21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.161AlaAla: 8.161 ± 1.117
0.713AlaCys: 0.713 ± 0.135
3.966AlaAsp: 3.966 ± 0.395
5.593AlaGlu: 5.593 ± 0.533
2.511AlaPhe: 2.511 ± 0.218
4.651AlaGly: 4.651 ± 0.405
1.084AlaHis: 1.084 ± 0.175
5.393AlaIle: 5.393 ± 0.429
6.534AlaLys: 6.534 ± 0.657
6.706AlaLeu: 6.706 ± 0.374
2.197AlaMet: 2.197 ± 0.226
3.881AlaAsn: 3.881 ± 0.432
2.226AlaPro: 2.226 ± 0.283
2.625AlaGln: 2.625 ± 0.3
3.367AlaArg: 3.367 ± 0.261
5.079AlaSer: 5.079 ± 0.585
4.851AlaThr: 4.851 ± 0.537
4.822AlaVal: 4.822 ± 0.443
0.999AlaTrp: 0.999 ± 0.13
2.882AlaTyr: 2.882 ± 0.311
0.0AlaXaa: 0.0 ± 0.0
Cys
0.514CysAla: 0.514 ± 0.122
0.171CysCys: 0.171 ± 0.072
0.656CysAsp: 0.656 ± 0.125
0.628CysGlu: 0.628 ± 0.13
0.571CysPhe: 0.571 ± 0.116
0.942CysGly: 0.942 ± 0.187
0.228CysHis: 0.228 ± 0.072
0.599CysIle: 0.599 ± 0.131
0.571CysLys: 0.571 ± 0.134
1.141CysLeu: 1.141 ± 0.186
0.371CysMet: 0.371 ± 0.124
0.457CysAsn: 0.457 ± 0.09
0.399CysPro: 0.399 ± 0.129
0.485CysGln: 0.485 ± 0.147
0.599CysArg: 0.599 ± 0.136
0.685CysSer: 0.685 ± 0.158
0.656CysThr: 0.656 ± 0.131
0.628CysVal: 0.628 ± 0.143
0.171CysTrp: 0.171 ± 0.07
0.457CysTyr: 0.457 ± 0.124
0.0CysXaa: 0.0 ± 0.0
Asp
5.136AspAla: 5.136 ± 0.395
0.542AspCys: 0.542 ± 0.121
2.711AspAsp: 2.711 ± 0.317
3.995AspGlu: 3.995 ± 0.364
2.682AspPhe: 2.682 ± 0.269
4.423AspGly: 4.423 ± 0.444
0.885AspHis: 0.885 ± 0.149
4.994AspIle: 4.994 ± 0.339
3.424AspLys: 3.424 ± 0.382
5.393AspLeu: 5.393 ± 0.431
1.855AspMet: 1.855 ± 0.215
2.397AspAsn: 2.397 ± 0.256
2.226AspPro: 2.226 ± 0.271
1.712AspGln: 1.712 ± 0.235
2.311AspArg: 2.311 ± 0.267
3.367AspSer: 3.367 ± 0.32
3.082AspThr: 3.082 ± 0.336
3.909AspVal: 3.909 ± 0.301
0.713AspTrp: 0.713 ± 0.146
2.996AspTyr: 2.996 ± 0.341
0.0AspXaa: 0.0 ± 0.0
Glu
5.136GluAla: 5.136 ± 0.35
0.713GluCys: 0.713 ± 0.152
3.453GluAsp: 3.453 ± 0.389
5.051GluGlu: 5.051 ± 0.462
3.139GluPhe: 3.139 ± 0.289
3.281GluGly: 3.281 ± 0.27
1.427GluHis: 1.427 ± 0.211
4.508GluIle: 4.508 ± 0.399
3.995GluLys: 3.995 ± 0.319
6.791GluLeu: 6.791 ± 0.559
2.254GluMet: 2.254 ± 0.259
2.939GluAsn: 2.939 ± 0.278
1.626GluPro: 1.626 ± 0.207
2.568GluGln: 2.568 ± 0.276
2.654GluArg: 2.654 ± 0.274
3.852GluSer: 3.852 ± 0.354
4.708GluThr: 4.708 ± 0.632
4.851GluVal: 4.851 ± 0.406
0.77GluTrp: 0.77 ± 0.139
3.339GluTyr: 3.339 ± 0.326
0.0GluXaa: 0.0 ± 0.0
Phe
2.568PheAla: 2.568 ± 0.23
0.457PheCys: 0.457 ± 0.131
3.053PheAsp: 3.053 ± 0.293
2.54PheGlu: 2.54 ± 0.28
1.826PhePhe: 1.826 ± 0.216
2.768PheGly: 2.768 ± 0.283
0.97PheHis: 0.97 ± 0.178
3.11PheIle: 3.11 ± 0.267
2.768PheLys: 2.768 ± 0.293
3.339PheLeu: 3.339 ± 0.359
0.77PheMet: 0.77 ± 0.148
2.739PheAsn: 2.739 ± 0.276
1.798PhePro: 1.798 ± 0.213
1.227PheGln: 1.227 ± 0.168
1.912PheArg: 1.912 ± 0.27
3.196PheSer: 3.196 ± 0.239
2.112PheThr: 2.112 ± 0.229
2.425PheVal: 2.425 ± 0.259
0.485PheTrp: 0.485 ± 0.097
1.569PheTyr: 1.569 ± 0.271
0.0PheXaa: 0.0 ± 0.0
Gly
4.737GlyAla: 4.737 ± 0.504
0.656GlyCys: 0.656 ± 0.138
3.367GlyAsp: 3.367 ± 0.334
3.881GlyGlu: 3.881 ± 0.334
2.853GlyPhe: 2.853 ± 0.253
3.938GlyGly: 3.938 ± 0.529
1.227GlyHis: 1.227 ± 0.205
4.023GlyIle: 4.023 ± 0.319
4.937GlyLys: 4.937 ± 0.461
4.794GlyLeu: 4.794 ± 0.365
1.798GlyMet: 1.798 ± 0.192
3.31GlyAsn: 3.31 ± 0.378
1.37GlyPro: 1.37 ± 0.272
1.997GlyGln: 1.997 ± 0.25
2.911GlyArg: 2.911 ± 0.271
4.851GlySer: 4.851 ± 0.552
3.995GlyThr: 3.995 ± 0.414
4.48GlyVal: 4.48 ± 0.398
0.97GlyTrp: 0.97 ± 0.182
3.082GlyTyr: 3.082 ± 0.309
0.0GlyXaa: 0.0 ± 0.0
His
1.284HisAla: 1.284 ± 0.201
0.2HisCys: 0.2 ± 0.08
0.942HisAsp: 0.942 ± 0.181
1.141HisGlu: 1.141 ± 0.2
0.685HisPhe: 0.685 ± 0.148
0.97HisGly: 0.97 ± 0.176
0.514HisHis: 0.514 ± 0.146
1.141HisIle: 1.141 ± 0.178
1.084HisLys: 1.084 ± 0.197
1.512HisLeu: 1.512 ± 0.233
0.542HisMet: 0.542 ± 0.109
0.656HisAsn: 0.656 ± 0.135
0.942HisPro: 0.942 ± 0.192
0.885HisGln: 0.885 ± 0.156
0.799HisArg: 0.799 ± 0.149
1.484HisSer: 1.484 ± 0.217
0.885HisThr: 0.885 ± 0.161
0.999HisVal: 0.999 ± 0.164
0.171HisTrp: 0.171 ± 0.07
0.856HisTyr: 0.856 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
4.423IleAla: 4.423 ± 0.35
0.571IleCys: 0.571 ± 0.123
4.252IleAsp: 4.252 ± 0.425
4.109IleGlu: 4.109 ± 0.339
2.54IlePhe: 2.54 ± 0.329
4.08IleGly: 4.08 ± 0.389
1.512IleHis: 1.512 ± 0.187
4.223IleIle: 4.223 ± 0.385
4.309IleLys: 4.309 ± 0.379
6.363IleLeu: 6.363 ± 0.487
1.883IleMet: 1.883 ± 0.203
4.08IleAsn: 4.08 ± 0.356
2.54IlePro: 2.54 ± 0.256
2.368IleGln: 2.368 ± 0.229
2.911IleArg: 2.911 ± 0.238
4.052IleSer: 4.052 ± 0.362
4.508IleThr: 4.508 ± 0.329
3.738IleVal: 3.738 ± 0.352
0.913IleTrp: 0.913 ± 0.161
2.625IleTyr: 2.625 ± 0.321
0.0IleXaa: 0.0 ± 0.0
Lys
5.25LysAla: 5.25 ± 0.413
0.599LysCys: 0.599 ± 0.13
5.108LysAsp: 5.108 ± 0.429
5.365LysGlu: 5.365 ± 0.49
2.996LysPhe: 2.996 ± 0.357
3.082LysGly: 3.082 ± 0.305
1.056LysHis: 1.056 ± 0.189
4.08LysIle: 4.08 ± 0.392
3.681LysLys: 3.681 ± 0.354
6.192LysLeu: 6.192 ± 0.517
2.14LysMet: 2.14 ± 0.252
3.51LysAsn: 3.51 ± 0.262
2.197LysPro: 2.197 ± 0.251
3.025LysGln: 3.025 ± 0.327
2.882LysArg: 2.882 ± 0.278
4.109LysSer: 4.109 ± 0.348
4.195LysThr: 4.195 ± 0.452
4.708LysVal: 4.708 ± 0.459
0.742LysTrp: 0.742 ± 0.156
2.54LysTyr: 2.54 ± 0.25
0.0LysXaa: 0.0 ± 0.0
Leu
7.02LeuAla: 7.02 ± 0.426
1.084LeuCys: 1.084 ± 0.163
5.964LeuAsp: 5.964 ± 0.484
6.563LeuGlu: 6.563 ± 0.511
3.339LeuPhe: 3.339 ± 0.329
4.794LeuGly: 4.794 ± 0.361
1.598LeuHis: 1.598 ± 0.267
5.479LeuIle: 5.479 ± 0.444
5.45LeuLys: 5.45 ± 0.476
6.42LeuLeu: 6.42 ± 0.461
2.14LeuMet: 2.14 ± 0.244
5.136LeuAsn: 5.136 ± 0.451
3.624LeuPro: 3.624 ± 0.369
2.996LeuGln: 2.996 ± 0.315
4.109LeuArg: 4.109 ± 0.318
5.193LeuSer: 5.193 ± 0.337
4.765LeuThr: 4.765 ± 0.446
5.964LeuVal: 5.964 ± 0.499
0.742LeuTrp: 0.742 ± 0.159
2.825LeuTyr: 2.825 ± 0.328
0.0LeuXaa: 0.0 ± 0.0
Met
1.741MetAla: 1.741 ± 0.223
0.342MetCys: 0.342 ± 0.106
1.484MetAsp: 1.484 ± 0.208
1.626MetGlu: 1.626 ± 0.211
1.084MetPhe: 1.084 ± 0.179
1.598MetGly: 1.598 ± 0.173
0.685MetHis: 0.685 ± 0.128
1.598MetIle: 1.598 ± 0.252
1.997MetLys: 1.997 ± 0.25
2.397MetLeu: 2.397 ± 0.293
0.628MetMet: 0.628 ± 0.137
1.427MetAsn: 1.427 ± 0.192
0.856MetPro: 0.856 ± 0.161
1.113MetGln: 1.113 ± 0.208
1.427MetArg: 1.427 ± 0.191
2.026MetSer: 2.026 ± 0.292
1.855MetThr: 1.855 ± 0.252
1.341MetVal: 1.341 ± 0.229
0.314MetTrp: 0.314 ± 0.092
1.198MetTyr: 1.198 ± 0.196
0.0MetXaa: 0.0 ± 0.0
Asn
3.681AsnAla: 3.681 ± 0.523
0.542AsnCys: 0.542 ± 0.137
2.968AsnAsp: 2.968 ± 0.34
2.911AsnGlu: 2.911 ± 0.259
1.769AsnPhe: 1.769 ± 0.211
4.508AsnGly: 4.508 ± 0.406
0.628AsnHis: 0.628 ± 0.125
3.167AsnIle: 3.167 ± 0.294
3.795AsnLys: 3.795 ± 0.489
4.423AsnLeu: 4.423 ± 0.298
1.455AsnMet: 1.455 ± 0.203
3.025AsnAsn: 3.025 ± 0.292
2.311AsnPro: 2.311 ± 0.276
1.427AsnGln: 1.427 ± 0.197
2.682AsnArg: 2.682 ± 0.211
3.995AsnSer: 3.995 ± 0.294
3.595AsnThr: 3.595 ± 0.404
3.481AsnVal: 3.481 ± 0.299
0.885AsnTrp: 0.885 ± 0.205
2.226AsnTyr: 2.226 ± 0.227
0.0AsnXaa: 0.0 ± 0.0
Pro
2.625ProAla: 2.625 ± 0.317
0.342ProCys: 0.342 ± 0.101
2.055ProAsp: 2.055 ± 0.24
2.511ProGlu: 2.511 ± 0.235
1.141ProPhe: 1.141 ± 0.183
2.169ProGly: 2.169 ± 0.308
0.399ProHis: 0.399 ± 0.126
2.14ProIle: 2.14 ± 0.289
2.283ProLys: 2.283 ± 0.232
2.568ProLeu: 2.568 ± 0.283
0.656ProMet: 0.656 ± 0.128
2.14ProAsn: 2.14 ± 0.245
1.256ProPro: 1.256 ± 0.249
1.141ProGln: 1.141 ± 0.152
1.512ProArg: 1.512 ± 0.193
2.026ProSer: 2.026 ± 0.25
2.597ProThr: 2.597 ± 0.292
2.796ProVal: 2.796 ± 0.279
0.371ProTrp: 0.371 ± 0.097
1.655ProTyr: 1.655 ± 0.225
0.0ProXaa: 0.0 ± 0.0
Gln
3.082GlnAla: 3.082 ± 0.344
0.371GlnCys: 0.371 ± 0.114
1.969GlnAsp: 1.969 ± 0.23
2.568GlnGlu: 2.568 ± 0.262
1.541GlnPhe: 1.541 ± 0.23
1.883GlnGly: 1.883 ± 0.218
0.799GlnHis: 0.799 ± 0.157
2.739GlnIle: 2.739 ± 0.328
2.283GlnLys: 2.283 ± 0.292
3.167GlnLeu: 3.167 ± 0.277
0.885GlnMet: 0.885 ± 0.165
1.541GlnAsn: 1.541 ± 0.211
0.942GlnPro: 0.942 ± 0.209
1.684GlnGln: 1.684 ± 0.271
1.626GlnArg: 1.626 ± 0.167
2.197GlnSer: 2.197 ± 0.315
1.969GlnThr: 1.969 ± 0.312
2.283GlnVal: 2.283 ± 0.267
0.656GlnTrp: 0.656 ± 0.149
1.598GlnTyr: 1.598 ± 0.22
0.0GlnXaa: 0.0 ± 0.0
Arg
3.453ArgAla: 3.453 ± 0.275
0.599ArgCys: 0.599 ± 0.139
2.853ArgAsp: 2.853 ± 0.226
3.082ArgGlu: 3.082 ± 0.292
1.883ArgPhe: 1.883 ± 0.231
3.538ArgGly: 3.538 ± 0.282
0.713ArgHis: 0.713 ± 0.131
3.196ArgIle: 3.196 ± 0.327
3.367ArgLys: 3.367 ± 0.355
3.995ArgLeu: 3.995 ± 0.335
1.17ArgMet: 1.17 ± 0.184
2.425ArgAsn: 2.425 ± 0.278
0.999ArgPro: 0.999 ± 0.178
1.598ArgGln: 1.598 ± 0.236
2.311ArgArg: 2.311 ± 0.273
2.853ArgSer: 2.853 ± 0.346
2.568ArgThr: 2.568 ± 0.258
3.025ArgVal: 3.025 ± 0.278
0.514ArgTrp: 0.514 ± 0.123
1.741ArgTyr: 1.741 ± 0.214
0.0ArgXaa: 0.0 ± 0.0
Ser
4.908SerAla: 4.908 ± 0.45
0.913SerCys: 0.913 ± 0.156
3.281SerAsp: 3.281 ± 0.258
4.908SerGlu: 4.908 ± 0.713
3.224SerPhe: 3.224 ± 0.317
4.423SerGly: 4.423 ± 0.336
0.713SerHis: 0.713 ± 0.139
3.852SerIle: 3.852 ± 0.327
4.794SerLys: 4.794 ± 0.388
5.25SerLeu: 5.25 ± 0.403
1.655SerMet: 1.655 ± 0.222
3.909SerAsn: 3.909 ± 0.278
1.769SerPro: 1.769 ± 0.214
2.397SerGln: 2.397 ± 0.311
3.31SerArg: 3.31 ± 0.293
4.28SerSer: 4.28 ± 0.401
4.052SerThr: 4.052 ± 0.347
4.594SerVal: 4.594 ± 0.411
0.913SerTrp: 0.913 ± 0.175
2.34SerTyr: 2.34 ± 0.28
0.0SerXaa: 0.0 ± 0.0
Thr
5.022ThrAla: 5.022 ± 0.525
0.685ThrCys: 0.685 ± 0.129
3.253ThrAsp: 3.253 ± 0.293
2.968ThrGlu: 2.968 ± 0.278
2.682ThrPhe: 2.682 ± 0.278
4.794ThrGly: 4.794 ± 0.335
1.227ThrHis: 1.227 ± 0.175
4.309ThrIle: 4.309 ± 0.318
4.138ThrLys: 4.138 ± 0.34
5.136ThrLeu: 5.136 ± 0.321
1.37ThrMet: 1.37 ± 0.219
3.224ThrAsn: 3.224 ± 0.499
2.939ThrPro: 2.939 ± 0.302
2.055ThrGln: 2.055 ± 0.271
2.796ThrArg: 2.796 ± 0.264
3.966ThrSer: 3.966 ± 0.435
3.624ThrThr: 3.624 ± 0.382
4.68ThrVal: 4.68 ± 0.373
0.542ThrTrp: 0.542 ± 0.1
2.055ThrTyr: 2.055 ± 0.244
0.0ThrXaa: 0.0 ± 0.0
Val
5.935ValAla: 5.935 ± 0.457
0.628ValCys: 0.628 ± 0.144
3.966ValAsp: 3.966 ± 0.401
4.651ValGlu: 4.651 ± 0.398
2.968ValPhe: 2.968 ± 0.303
3.824ValGly: 3.824 ± 0.362
1.056ValHis: 1.056 ± 0.209
4.109ValIle: 4.109 ± 0.303
4.023ValLys: 4.023 ± 0.425
5.108ValLeu: 5.108 ± 0.387
1.427ValMet: 1.427 ± 0.217
3.624ValAsn: 3.624 ± 0.258
2.654ValPro: 2.654 ± 0.221
2.226ValGln: 2.226 ± 0.22
3.167ValArg: 3.167 ± 0.316
4.651ValSer: 4.651 ± 0.483
4.166ValThr: 4.166 ± 0.291
5.022ValVal: 5.022 ± 0.36
0.942ValTrp: 0.942 ± 0.161
2.996ValTyr: 2.996 ± 0.325
0.0ValXaa: 0.0 ± 0.0
Trp
0.628TrpAla: 0.628 ± 0.114
0.314TrpCys: 0.314 ± 0.092
0.828TrpAsp: 0.828 ± 0.133
0.685TrpGlu: 0.685 ± 0.136
0.656TrpPhe: 0.656 ± 0.155
0.828TrpGly: 0.828 ± 0.167
0.171TrpHis: 0.171 ± 0.064
0.77TrpIle: 0.77 ± 0.173
0.828TrpLys: 0.828 ± 0.146
1.056TrpLeu: 1.056 ± 0.18
0.342TrpMet: 0.342 ± 0.11
0.856TrpAsn: 0.856 ± 0.167
0.171TrpPro: 0.171 ± 0.067
0.542TrpGln: 0.542 ± 0.131
0.457TrpArg: 0.457 ± 0.123
0.685TrpSer: 0.685 ± 0.134
0.799TrpThr: 0.799 ± 0.165
1.141TrpVal: 1.141 ± 0.182
0.114TrpTrp: 0.114 ± 0.055
0.428TrpTyr: 0.428 ± 0.109
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.939TyrAla: 2.939 ± 0.307
0.485TyrCys: 0.485 ± 0.101
2.625TyrAsp: 2.625 ± 0.241
2.283TyrGlu: 2.283 ± 0.241
1.569TyrPhe: 1.569 ± 0.244
2.682TyrGly: 2.682 ± 0.296
0.799TyrHis: 0.799 ± 0.161
2.568TyrIle: 2.568 ± 0.284
3.196TyrLys: 3.196 ± 0.278
3.453TyrLeu: 3.453 ± 0.335
1.17TyrMet: 1.17 ± 0.176
2.197TyrAsn: 2.197 ± 0.268
1.541TyrPro: 1.541 ± 0.207
1.712TyrGln: 1.712 ± 0.222
2.14TyrArg: 2.14 ± 0.241
2.939TyrSer: 2.939 ± 0.287
2.483TyrThr: 2.483 ± 0.244
2.254TyrVal: 2.254 ± 0.231
0.371TyrTrp: 0.371 ± 0.104
1.798TyrTyr: 1.798 ± 0.212
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 213 proteins (35046 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski