Amino acid dipepetide frequency for Flavobacterium phage vB_FspP_elemoA_2-5C

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.643AlaAla: 2.643 ± 0.723
0.518AlaCys: 0.518 ± 0.257
2.591AlaAsp: 2.591 ± 0.333
4.094AlaGlu: 4.094 ± 0.766
3.006AlaPhe: 3.006 ± 0.516
3.058AlaGly: 3.058 ± 0.524
0.726AlaHis: 0.726 ± 0.161
5.027AlaIle: 5.027 ± 0.612
5.442AlaLys: 5.442 ± 0.964
4.146AlaLeu: 4.146 ± 0.475
0.985AlaMet: 0.985 ± 0.224
3.628AlaAsn: 3.628 ± 0.658
1.296AlaPro: 1.296 ± 0.279
1.607AlaGln: 1.607 ± 0.319
2.177AlaArg: 2.177 ± 0.324
4.042AlaSer: 4.042 ± 0.678
3.006AlaThr: 3.006 ± 0.58
3.265AlaVal: 3.265 ± 0.534
0.104AlaTrp: 0.104 ± 0.073
2.177AlaTyr: 2.177 ± 0.302
0.0AlaXaa: 0.0 ± 0.0
Cys
0.311CysAla: 0.311 ± 0.177
0.052CysCys: 0.052 ± 0.051
0.674CysAsp: 0.674 ± 0.219
0.622CysGlu: 0.622 ± 0.205
0.363CysPhe: 0.363 ± 0.15
0.415CysGly: 0.415 ± 0.173
0.259CysHis: 0.259 ± 0.138
0.57CysIle: 0.57 ± 0.206
0.674CysLys: 0.674 ± 0.234
0.777CysLeu: 0.777 ± 0.25
0.155CysMet: 0.155 ± 0.111
0.363CysAsn: 0.363 ± 0.157
0.207CysPro: 0.207 ± 0.126
0.104CysGln: 0.104 ± 0.089
0.415CysArg: 0.415 ± 0.155
0.363CysSer: 0.363 ± 0.17
0.466CysThr: 0.466 ± 0.185
0.363CysVal: 0.363 ± 0.162
0.104CysTrp: 0.104 ± 0.084
0.363CysTyr: 0.363 ± 0.168
0.0CysXaa: 0.0 ± 0.0
Asp
4.198AspAla: 4.198 ± 0.594
0.415AspCys: 0.415 ± 0.182
4.457AspAsp: 4.457 ± 0.474
4.198AspGlu: 4.198 ± 0.667
4.561AspPhe: 4.561 ± 0.401
4.405AspGly: 4.405 ± 0.522
0.518AspHis: 0.518 ± 0.202
5.131AspIle: 5.131 ± 0.555
7.152AspLys: 7.152 ± 0.831
5.96AspLeu: 5.96 ± 0.527
1.866AspMet: 1.866 ± 0.312
5.131AspAsn: 5.131 ± 0.601
1.244AspPro: 1.244 ± 0.241
1.451AspGln: 1.451 ± 0.289
2.125AspArg: 2.125 ± 0.321
4.768AspSer: 4.768 ± 0.62
4.042AspThr: 4.042 ± 0.458
4.042AspVal: 4.042 ± 0.377
0.829AspTrp: 0.829 ± 0.288
2.902AspTyr: 2.902 ± 0.652
0.0AspXaa: 0.0 ± 0.0
Glu
3.939GluAla: 3.939 ± 0.553
0.415GluCys: 0.415 ± 0.212
5.649GluAsp: 5.649 ± 0.541
6.167GluGlu: 6.167 ± 0.879
4.405GluPhe: 4.405 ± 0.614
4.405GluGly: 4.405 ± 0.447
0.674GluHis: 0.674 ± 0.25
6.064GluIle: 6.064 ± 0.521
5.286GluLys: 5.286 ± 0.796
6.893GluLeu: 6.893 ± 0.765
1.555GluMet: 1.555 ± 0.288
4.198GluAsn: 4.198 ± 0.685
1.244GluPro: 1.244 ± 0.265
2.799GluGln: 2.799 ± 0.495
2.28GluArg: 2.28 ± 0.352
5.131GluSer: 5.131 ± 0.558
3.11GluThr: 3.11 ± 0.497
5.235GluVal: 5.235 ± 0.462
0.881GluTrp: 0.881 ± 0.196
3.68GluTyr: 3.68 ± 0.478
0.0GluXaa: 0.0 ± 0.0
Phe
1.607PheAla: 1.607 ± 0.357
0.518PheCys: 0.518 ± 0.22
3.68PheAsp: 3.68 ± 0.357
3.783PheGlu: 3.783 ± 0.439
1.918PhePhe: 1.918 ± 0.339
2.384PheGly: 2.384 ± 0.442
0.881PheHis: 0.881 ± 0.239
3.524PheIle: 3.524 ± 0.354
4.457PheLys: 4.457 ± 0.585
3.317PheLeu: 3.317 ± 0.635
1.037PheMet: 1.037 ± 0.243
4.457PheAsn: 4.457 ± 0.502
0.829PhePro: 0.829 ± 0.187
1.192PheGln: 1.192 ± 0.245
1.347PheArg: 1.347 ± 0.316
3.524PheSer: 3.524 ± 0.345
2.695PheThr: 2.695 ± 0.546
1.918PheVal: 1.918 ± 0.292
0.415PheTrp: 0.415 ± 0.147
1.918PheTyr: 1.918 ± 0.371
0.0PheXaa: 0.0 ± 0.0
Gly
2.591GlyAla: 2.591 ± 0.519
0.363GlyCys: 0.363 ± 0.167
3.576GlyAsp: 3.576 ± 0.374
3.213GlyGlu: 3.213 ± 0.422
2.695GlyPhe: 2.695 ± 0.342
4.25GlyGly: 4.25 ± 0.468
0.622GlyHis: 0.622 ± 0.256
4.872GlyIle: 4.872 ± 0.58
4.768GlyLys: 4.768 ± 0.765
5.079GlyLeu: 5.079 ± 0.601
1.555GlyMet: 1.555 ± 0.234
3.472GlyAsn: 3.472 ± 0.421
1.037GlyPro: 1.037 ± 0.422
1.607GlyGln: 1.607 ± 0.312
2.488GlyArg: 2.488 ± 0.256
3.991GlySer: 3.991 ± 0.501
4.25GlyThr: 4.25 ± 0.704
4.25GlyVal: 4.25 ± 0.803
0.466GlyTrp: 0.466 ± 0.166
3.213GlyTyr: 3.213 ± 0.395
0.0GlyXaa: 0.0 ± 0.0
His
0.518HisAla: 0.518 ± 0.179
0.311HisCys: 0.311 ± 0.131
0.777HisAsp: 0.777 ± 0.266
0.415HisGlu: 0.415 ± 0.149
0.57HisPhe: 0.57 ± 0.205
0.466HisGly: 0.466 ± 0.181
0.466HisHis: 0.466 ± 0.222
1.088HisIle: 1.088 ± 0.285
1.503HisLys: 1.503 ± 0.437
0.674HisLeu: 0.674 ± 0.212
0.052HisMet: 0.052 ± 0.059
0.674HisAsn: 0.674 ± 0.209
0.622HisPro: 0.622 ± 0.193
0.674HisGln: 0.674 ± 0.172
0.415HisArg: 0.415 ± 0.184
0.881HisSer: 0.881 ± 0.233
0.726HisThr: 0.726 ± 0.243
0.207HisVal: 0.207 ± 0.126
0.155HisTrp: 0.155 ± 0.101
0.622HisTyr: 0.622 ± 0.254
0.0HisXaa: 0.0 ± 0.0
Ile
4.198IleAla: 4.198 ± 0.494
0.363IleCys: 0.363 ± 0.17
6.945IleAsp: 6.945 ± 0.491
6.167IleGlu: 6.167 ± 0.511
2.54IlePhe: 2.54 ± 0.483
3.783IleGly: 3.783 ± 0.468
1.14IleHis: 1.14 ± 0.242
5.545IleIle: 5.545 ± 0.605
7.256IleLys: 7.256 ± 0.827
4.872IleLeu: 4.872 ± 0.594
1.14IleMet: 1.14 ± 0.288
6.323IleAsn: 6.323 ± 0.588
1.762IlePro: 1.762 ± 0.297
2.54IleGln: 2.54 ± 0.384
2.384IleArg: 2.384 ± 0.388
6.323IleSer: 6.323 ± 0.686
4.716IleThr: 4.716 ± 0.469
5.183IleVal: 5.183 ± 0.747
0.57IleTrp: 0.57 ± 0.159
2.591IleTyr: 2.591 ± 0.385
0.0IleXaa: 0.0 ± 0.0
Lys
5.701LysAla: 5.701 ± 0.974
0.777LysCys: 0.777 ± 0.265
6.997LysAsp: 6.997 ± 0.634
8.24LysGlu: 8.24 ± 0.803
3.161LysPhe: 3.161 ± 0.406
4.768LysGly: 4.768 ± 0.512
1.037LysHis: 1.037 ± 0.238
6.893LysIle: 6.893 ± 0.87
9.173LysLys: 9.173 ± 1.146
7.308LysLeu: 7.308 ± 1.074
2.902LysMet: 2.902 ± 0.521
6.012LysAsn: 6.012 ± 0.568
2.85LysPro: 2.85 ± 0.442
2.643LysGln: 2.643 ± 0.619
3.213LysArg: 3.213 ± 0.619
5.805LysSer: 5.805 ± 0.741
5.027LysThr: 5.027 ± 0.559
6.375LysVal: 6.375 ± 0.677
0.622LysTrp: 0.622 ± 0.163
4.82LysTyr: 4.82 ± 0.441
0.0LysXaa: 0.0 ± 0.0
Leu
3.939LeuAla: 3.939 ± 0.444
0.674LeuCys: 0.674 ± 0.268
5.701LeuAsp: 5.701 ± 0.549
6.064LeuGlu: 6.064 ± 0.724
3.628LeuPhe: 3.628 ± 0.422
3.732LeuGly: 3.732 ± 0.365
0.881LeuHis: 0.881 ± 0.183
5.235LeuIle: 5.235 ± 0.701
7.1LeuLys: 7.1 ± 0.831
6.375LeuLeu: 6.375 ± 0.558
1.918LeuMet: 1.918 ± 0.439
6.478LeuAsn: 6.478 ± 0.527
3.11LeuPro: 3.11 ± 0.272
2.747LeuGln: 2.747 ± 0.472
1.918LeuArg: 1.918 ± 0.366
6.271LeuSer: 6.271 ± 0.576
4.405LeuThr: 4.405 ± 0.45
3.68LeuVal: 3.68 ± 0.423
0.933LeuTrp: 0.933 ± 0.252
3.317LeuTyr: 3.317 ± 0.537
0.0LeuXaa: 0.0 ± 0.0
Met
2.073MetAla: 2.073 ± 0.258
0.207MetCys: 0.207 ± 0.12
0.777MetAsp: 0.777 ± 0.158
1.658MetGlu: 1.658 ± 0.264
0.829MetPhe: 0.829 ± 0.169
0.881MetGly: 0.881 ± 0.279
0.311MetHis: 0.311 ± 0.128
1.658MetIle: 1.658 ± 0.355
2.695MetLys: 2.695 ± 0.553
1.14MetLeu: 1.14 ± 0.279
0.518MetMet: 0.518 ± 0.18
2.021MetAsn: 2.021 ± 0.36
0.415MetPro: 0.415 ± 0.128
0.985MetGln: 0.985 ± 0.221
0.933MetArg: 0.933 ± 0.308
1.555MetSer: 1.555 ± 0.315
1.192MetThr: 1.192 ± 0.289
1.192MetVal: 1.192 ± 0.215
0.207MetTrp: 0.207 ± 0.118
1.296MetTyr: 1.296 ± 0.321
0.0MetXaa: 0.0 ± 0.0
Asn
4.457AsnAla: 4.457 ± 0.787
0.57AsnCys: 0.57 ± 0.189
4.405AsnAsp: 4.405 ± 0.525
4.716AsnGlu: 4.716 ± 0.447
3.006AsnPhe: 3.006 ± 0.445
4.872AsnGly: 4.872 ± 0.465
0.777AsnHis: 0.777 ± 0.209
4.872AsnIle: 4.872 ± 0.567
7.308AsnLys: 7.308 ± 0.713
4.768AsnLeu: 4.768 ± 0.684
1.866AsnMet: 1.866 ± 0.337
6.116AsnAsn: 6.116 ± 0.526
2.28AsnPro: 2.28 ± 0.437
3.213AsnGln: 3.213 ± 0.501
3.887AsnArg: 3.887 ± 0.482
4.509AsnSer: 4.509 ± 0.487
3.369AsnThr: 3.369 ± 0.375
4.302AsnVal: 4.302 ± 0.686
0.466AsnTrp: 0.466 ± 0.145
2.643AsnTyr: 2.643 ± 0.369
0.0AsnXaa: 0.0 ± 0.0
Pro
0.726ProAla: 0.726 ± 0.241
0.104ProCys: 0.104 ± 0.071
1.814ProAsp: 1.814 ± 0.384
2.125ProGlu: 2.125 ± 0.354
1.296ProPhe: 1.296 ± 0.28
1.399ProGly: 1.399 ± 0.247
0.207ProHis: 0.207 ± 0.108
2.229ProIle: 2.229 ± 0.311
1.658ProLys: 1.658 ± 0.47
2.073ProLeu: 2.073 ± 0.368
0.622ProMet: 0.622 ± 0.18
1.918ProAsn: 1.918 ± 0.343
0.674ProPro: 0.674 ± 0.197
0.777ProGln: 0.777 ± 0.237
1.296ProArg: 1.296 ± 0.275
2.85ProSer: 2.85 ± 0.455
2.54ProThr: 2.54 ± 0.379
1.503ProVal: 1.503 ± 0.278
0.052ProTrp: 0.052 ± 0.046
0.985ProTyr: 0.985 ± 0.187
0.0ProXaa: 0.0 ± 0.0
Gln
2.384GlnAla: 2.384 ± 0.398
0.0GlnCys: 0.0 ± 0.0
1.918GlnAsp: 1.918 ± 0.404
2.799GlnGlu: 2.799 ± 0.614
0.985GlnPhe: 0.985 ± 0.22
1.814GlnGly: 1.814 ± 0.461
0.259GlnHis: 0.259 ± 0.104
2.332GlnIle: 2.332 ± 0.368
2.643GlnLys: 2.643 ± 0.558
2.021GlnLeu: 2.021 ± 0.398
1.14GlnMet: 1.14 ± 0.296
1.555GlnAsn: 1.555 ± 0.23
0.829GlnPro: 0.829 ± 0.201
0.985GlnGln: 0.985 ± 0.31
1.14GlnArg: 1.14 ± 0.352
2.695GlnSer: 2.695 ± 0.399
1.399GlnThr: 1.399 ± 0.266
2.021GlnVal: 2.021 ± 0.255
0.363GlnTrp: 0.363 ± 0.154
1.347GlnTyr: 1.347 ± 0.186
0.0GlnXaa: 0.0 ± 0.0
Arg
1.969ArgAla: 1.969 ± 0.314
0.466ArgCys: 0.466 ± 0.205
2.591ArgAsp: 2.591 ± 0.424
2.488ArgGlu: 2.488 ± 0.321
1.399ArgPhe: 1.399 ± 0.253
2.747ArgGly: 2.747 ± 0.358
0.777ArgHis: 0.777 ± 0.285
2.643ArgIle: 2.643 ± 0.392
3.369ArgLys: 3.369 ± 0.542
3.421ArgLeu: 3.421 ± 0.501
0.829ArgMet: 0.829 ± 0.29
1.762ArgAsn: 1.762 ± 0.268
0.933ArgPro: 0.933 ± 0.238
0.777ArgGln: 0.777 ± 0.187
1.503ArgArg: 1.503 ± 0.338
1.866ArgSer: 1.866 ± 0.284
1.399ArgThr: 1.399 ± 0.316
2.384ArgVal: 2.384 ± 0.276
0.415ArgTrp: 0.415 ± 0.14
1.814ArgTyr: 1.814 ± 0.327
0.0ArgXaa: 0.0 ± 0.0
Ser
2.954SerAla: 2.954 ± 0.479
0.57SerCys: 0.57 ± 0.229
5.338SerAsp: 5.338 ± 0.514
5.494SerGlu: 5.494 ± 0.58
3.265SerPhe: 3.265 ± 0.581
5.027SerGly: 5.027 ± 0.533
0.415SerHis: 0.415 ± 0.16
5.753SerIle: 5.753 ± 0.556
7.411SerLys: 7.411 ± 0.829
4.975SerLeu: 4.975 ± 0.54
1.658SerMet: 1.658 ± 0.305
5.494SerAsn: 5.494 ± 0.431
1.866SerPro: 1.866 ± 0.379
1.607SerGln: 1.607 ± 0.187
2.384SerArg: 2.384 ± 0.387
4.975SerSer: 4.975 ± 0.594
3.421SerThr: 3.421 ± 0.381
4.82SerVal: 4.82 ± 0.588
0.622SerTrp: 0.622 ± 0.19
3.991SerTyr: 3.991 ± 0.569
0.0SerXaa: 0.0 ± 0.0
Thr
2.902ThrAla: 2.902 ± 0.478
0.259ThrCys: 0.259 ± 0.12
3.991ThrAsp: 3.991 ± 0.531
3.369ThrGlu: 3.369 ± 0.468
2.021ThrPhe: 2.021 ± 0.467
3.369ThrGly: 3.369 ± 0.428
0.674ThrHis: 0.674 ± 0.235
4.405ThrIle: 4.405 ± 0.722
5.805ThrLys: 5.805 ± 0.497
4.924ThrLeu: 4.924 ± 0.691
0.518ThrMet: 0.518 ± 0.227
3.939ThrAsn: 3.939 ± 0.618
2.85ThrPro: 2.85 ± 0.373
1.658ThrGln: 1.658 ± 0.28
1.503ThrArg: 1.503 ± 0.237
4.042ThrSer: 4.042 ± 0.446
4.872ThrThr: 4.872 ± 0.657
2.332ThrVal: 2.332 ± 0.433
0.518ThrTrp: 0.518 ± 0.141
2.799ThrTyr: 2.799 ± 0.37
0.0ThrXaa: 0.0 ± 0.0
Val
3.317ValAla: 3.317 ± 0.589
0.466ValCys: 0.466 ± 0.194
3.887ValAsp: 3.887 ± 0.406
4.975ValGlu: 4.975 ± 0.449
2.747ValPhe: 2.747 ± 0.347
3.524ValGly: 3.524 ± 0.383
0.518ValHis: 0.518 ± 0.208
4.664ValIle: 4.664 ± 0.622
5.856ValLys: 5.856 ± 0.544
5.131ValLeu: 5.131 ± 0.512
0.933ValMet: 0.933 ± 0.239
3.991ValAsn: 3.991 ± 0.601
1.607ValPro: 1.607 ± 0.325
1.503ValGln: 1.503 ± 0.196
2.125ValArg: 2.125 ± 0.387
4.509ValSer: 4.509 ± 0.548
2.799ValThr: 2.799 ± 0.417
3.421ValVal: 3.421 ± 0.497
0.57ValTrp: 0.57 ± 0.2
2.954ValTyr: 2.954 ± 0.529
0.0ValXaa: 0.0 ± 0.0
Trp
0.57TrpAla: 0.57 ± 0.154
0.052TrpCys: 0.052 ± 0.06
0.363TrpAsp: 0.363 ± 0.136
0.518TrpGlu: 0.518 ± 0.141
0.726TrpPhe: 0.726 ± 0.264
0.518TrpGly: 0.518 ± 0.198
0.104TrpHis: 0.104 ± 0.09
0.363TrpIle: 0.363 ± 0.155
0.674TrpLys: 0.674 ± 0.183
0.674TrpLeu: 0.674 ± 0.193
0.311TrpMet: 0.311 ± 0.138
0.777TrpAsn: 0.777 ± 0.178
0.0TrpPro: 0.0 ± 0.0
0.363TrpGln: 0.363 ± 0.147
0.622TrpArg: 0.622 ± 0.169
0.933TrpSer: 0.933 ± 0.27
0.363TrpThr: 0.363 ± 0.185
0.415TrpVal: 0.415 ± 0.173
0.052TrpTrp: 0.052 ± 0.053
0.363TrpTyr: 0.363 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.54TyrAla: 2.54 ± 0.405
0.518TyrCys: 0.518 ± 0.229
3.265TyrAsp: 3.265 ± 0.418
2.902TyrGlu: 2.902 ± 0.504
2.28TyrPhe: 2.28 ± 0.599
2.54TyrGly: 2.54 ± 0.414
0.57TyrHis: 0.57 ± 0.159
3.472TyrIle: 3.472 ± 0.486
4.094TyrLys: 4.094 ± 0.409
3.628TyrLeu: 3.628 ± 0.566
0.933TyrMet: 0.933 ± 0.214
4.094TyrAsn: 4.094 ± 0.524
1.192TyrPro: 1.192 ± 0.22
1.399TyrGln: 1.399 ± 0.261
1.399TyrArg: 1.399 ± 0.255
3.006TyrSer: 3.006 ± 0.483
2.954TyrThr: 2.954 ± 0.493
2.591TyrVal: 2.591 ± 0.492
0.415TyrTrp: 0.415 ± 0.171
2.28TyrTyr: 2.28 ± 0.441
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 91 proteins (19296 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski