Amino acid dipepetide frequency for Bacteriophage Phobos

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.06AlaAla: 15.06 ± 1.883
0.977AlaCys: 0.977 ± 0.294
6.38AlaAsp: 6.38 ± 0.625
7.3AlaGlu: 7.3 ± 0.677
3.449AlaPhe: 3.449 ± 0.515
8.22AlaGly: 8.22 ± 1.165
2.012AlaHis: 2.012 ± 0.354
5.518AlaIle: 5.518 ± 0.596
5.461AlaLys: 5.461 ± 0.794
9.427AlaLeu: 9.427 ± 0.993
2.759AlaMet: 2.759 ± 0.447
4.426AlaAsn: 4.426 ± 0.546
4.311AlaPro: 4.311 ± 0.573
5.863AlaGln: 5.863 ± 0.8
6.898AlaArg: 6.898 ± 0.867
4.713AlaSer: 4.713 ± 0.627
5.461AlaThr: 5.461 ± 0.739
5.921AlaVal: 5.921 ± 0.496
1.265AlaTrp: 1.265 ± 0.336
2.759AlaTyr: 2.759 ± 0.333
0.0AlaXaa: 0.0 ± 0.0
Cys
0.517CysAla: 0.517 ± 0.179
0.0CysCys: 0.0 ± 0.0
0.632CysAsp: 0.632 ± 0.193
0.46CysGlu: 0.46 ± 0.198
0.172CysPhe: 0.172 ± 0.093
0.46CysGly: 0.46 ± 0.208
0.115CysHis: 0.115 ± 0.077
0.46CysIle: 0.46 ± 0.183
0.402CysLys: 0.402 ± 0.192
1.035CysLeu: 1.035 ± 0.291
0.46CysMet: 0.46 ± 0.133
0.172CysAsn: 0.172 ± 0.107
1.207CysPro: 1.207 ± 0.328
0.172CysGln: 0.172 ± 0.138
0.747CysArg: 0.747 ± 0.171
0.23CysSer: 0.23 ± 0.112
0.69CysThr: 0.69 ± 0.198
0.46CysVal: 0.46 ± 0.186
0.287CysTrp: 0.287 ± 0.145
0.345CysTyr: 0.345 ± 0.143
0.0CysXaa: 0.0 ± 0.0
Asp
7.243AspAla: 7.243 ± 0.657
0.287AspCys: 0.287 ± 0.12
5.346AspAsp: 5.346 ± 0.938
5.633AspGlu: 5.633 ± 0.68
2.702AspPhe: 2.702 ± 0.414
4.828AspGly: 4.828 ± 0.624
1.092AspHis: 1.092 ± 0.305
3.276AspIle: 3.276 ± 0.514
2.587AspLys: 2.587 ± 0.532
5.921AspLeu: 5.921 ± 0.702
2.069AspMet: 2.069 ± 0.364
1.724AspAsn: 1.724 ± 0.391
3.276AspPro: 3.276 ± 0.486
2.069AspGln: 2.069 ± 0.354
4.426AspArg: 4.426 ± 0.487
2.702AspSer: 2.702 ± 0.419
3.334AspThr: 3.334 ± 0.609
4.254AspVal: 4.254 ± 0.538
1.38AspTrp: 1.38 ± 0.325
2.587AspTyr: 2.587 ± 0.499
0.0AspXaa: 0.0 ± 0.0
Glu
7.932GluAla: 7.932 ± 0.898
0.575GluCys: 0.575 ± 0.205
4.828GluAsp: 4.828 ± 0.642
4.598GluGlu: 4.598 ± 0.569
2.587GluPhe: 2.587 ± 0.341
4.426GluGly: 4.426 ± 0.465
1.437GluHis: 1.437 ± 0.33
1.609GluIle: 1.609 ± 0.347
2.932GluLys: 2.932 ± 0.461
7.875GluLeu: 7.875 ± 0.829
0.977GluMet: 0.977 ± 0.26
1.724GluAsn: 1.724 ± 0.283
2.644GluPro: 2.644 ± 0.513
3.679GluGln: 3.679 ± 0.767
5.058GluArg: 5.058 ± 0.584
2.932GluSer: 2.932 ± 0.4
2.989GluThr: 2.989 ± 0.357
4.771GluVal: 4.771 ± 0.514
1.322GluTrp: 1.322 ± 0.34
1.265GluTyr: 1.265 ± 0.353
0.0GluXaa: 0.0 ± 0.0
Phe
2.702PheAla: 2.702 ± 0.412
0.172PheCys: 0.172 ± 0.116
2.472PheAsp: 2.472 ± 0.457
2.069PheGlu: 2.069 ± 0.331
1.15PhePhe: 1.15 ± 0.374
3.794PheGly: 3.794 ± 0.511
0.747PheHis: 0.747 ± 0.208
1.609PheIle: 1.609 ± 0.305
1.552PheLys: 1.552 ± 0.319
2.874PheLeu: 2.874 ± 0.381
0.69PheMet: 0.69 ± 0.22
1.609PheAsn: 1.609 ± 0.301
1.609PhePro: 1.609 ± 0.33
1.839PheGln: 1.839 ± 0.328
2.184PheArg: 2.184 ± 0.423
1.724PheSer: 1.724 ± 0.324
2.357PheThr: 2.357 ± 0.329
2.012PheVal: 2.012 ± 0.476
0.977PheTrp: 0.977 ± 0.298
1.035PheTyr: 1.035 ± 0.269
0.0PheXaa: 0.0 ± 0.0
Gly
6.898GlyAla: 6.898 ± 0.778
0.747GlyCys: 0.747 ± 0.268
4.828GlyAsp: 4.828 ± 0.641
4.886GlyGlu: 4.886 ± 0.495
3.047GlyPhe: 3.047 ± 0.513
6.036GlyGly: 6.036 ± 0.703
1.437GlyHis: 1.437 ± 0.375
2.817GlyIle: 2.817 ± 0.383
5.116GlyLys: 5.116 ± 0.707
6.208GlyLeu: 6.208 ± 0.689
2.472GlyMet: 2.472 ± 0.391
2.587GlyAsn: 2.587 ± 0.424
2.242GlyPro: 2.242 ± 0.403
2.759GlyGln: 2.759 ± 0.467
4.081GlyArg: 4.081 ± 0.433
3.736GlySer: 3.736 ± 0.423
5.403GlyThr: 5.403 ± 0.682
5.748GlyVal: 5.748 ± 0.759
1.322GlyTrp: 1.322 ± 0.307
2.069GlyTyr: 2.069 ± 0.338
0.0GlyXaa: 0.0 ± 0.0
His
1.839HisAla: 1.839 ± 0.388
0.287HisCys: 0.287 ± 0.135
1.724HisAsp: 1.724 ± 0.316
1.035HisGlu: 1.035 ± 0.236
0.575HisPhe: 0.575 ± 0.181
1.38HisGly: 1.38 ± 0.334
0.632HisHis: 0.632 ± 0.216
0.977HisIle: 0.977 ± 0.252
0.805HisLys: 0.805 ± 0.196
1.207HisLeu: 1.207 ± 0.271
0.46HisMet: 0.46 ± 0.169
0.69HisAsn: 0.69 ± 0.187
0.92HisPro: 0.92 ± 0.291
0.402HisGln: 0.402 ± 0.185
1.667HisArg: 1.667 ± 0.386
0.977HisSer: 0.977 ± 0.234
1.207HisThr: 1.207 ± 0.2
0.92HisVal: 0.92 ± 0.23
0.345HisTrp: 0.345 ± 0.164
0.977HisTyr: 0.977 ± 0.282
0.0HisXaa: 0.0 ± 0.0
Ile
4.426IleAla: 4.426 ± 0.545
0.46IleCys: 0.46 ± 0.176
3.161IleAsp: 3.161 ± 0.381
2.932IleGlu: 2.932 ± 0.385
1.322IlePhe: 1.322 ± 0.291
2.472IleGly: 2.472 ± 0.378
1.092IleHis: 1.092 ± 0.268
2.012IleIle: 2.012 ± 0.478
2.184IleLys: 2.184 ± 0.401
3.794IleLeu: 3.794 ± 0.447
1.092IleMet: 1.092 ± 0.282
1.552IleAsn: 1.552 ± 0.266
2.357IlePro: 2.357 ± 0.396
1.609IleGln: 1.609 ± 0.27
2.529IleArg: 2.529 ± 0.336
2.127IleSer: 2.127 ± 0.324
2.989IleThr: 2.989 ± 0.437
2.587IleVal: 2.587 ± 0.399
0.805IleTrp: 0.805 ± 0.246
0.977IleTyr: 0.977 ± 0.393
0.0IleXaa: 0.0 ± 0.0
Lys
5.058LysAla: 5.058 ± 1.088
0.172LysCys: 0.172 ± 0.116
4.369LysAsp: 4.369 ± 0.538
2.414LysGlu: 2.414 ± 0.395
1.495LysPhe: 1.495 ± 0.257
3.334LysGly: 3.334 ± 0.519
0.747LysHis: 0.747 ± 0.253
2.242LysIle: 2.242 ± 0.333
2.472LysLys: 2.472 ± 0.433
4.886LysLeu: 4.886 ± 0.601
1.38LysMet: 1.38 ± 0.303
1.839LysAsn: 1.839 ± 0.328
3.104LysPro: 3.104 ± 0.533
1.839LysGln: 1.839 ± 0.488
3.161LysArg: 3.161 ± 0.608
2.012LysSer: 2.012 ± 0.408
2.759LysThr: 2.759 ± 0.442
3.334LysVal: 3.334 ± 0.622
0.805LysTrp: 0.805 ± 0.279
1.322LysTyr: 1.322 ± 0.466
0.0LysXaa: 0.0 ± 0.0
Leu
9.657LeuAla: 9.657 ± 0.672
1.092LeuCys: 1.092 ± 0.304
7.588LeuAsp: 7.588 ± 0.868
6.783LeuGlu: 6.783 ± 0.753
2.242LeuPhe: 2.242 ± 0.338
7.415LeuGly: 7.415 ± 0.739
1.495LeuHis: 1.495 ± 0.327
3.851LeuIle: 3.851 ± 0.453
4.369LeuLys: 4.369 ± 0.624
7.07LeuLeu: 7.07 ± 0.634
2.357LeuMet: 2.357 ± 0.424
3.047LeuAsn: 3.047 ± 0.407
5.116LeuPro: 5.116 ± 0.633
3.564LeuGln: 3.564 ± 0.547
5.921LeuArg: 5.921 ± 0.569
4.541LeuSer: 4.541 ± 0.543
5.461LeuThr: 5.461 ± 0.706
4.771LeuVal: 4.771 ± 0.447
0.977LeuTrp: 0.977 ± 0.27
1.897LeuTyr: 1.897 ± 0.314
0.0LeuXaa: 0.0 ± 0.0
Met
2.989MetAla: 2.989 ± 0.451
0.287MetCys: 0.287 ± 0.122
1.552MetAsp: 1.552 ± 0.235
1.782MetGlu: 1.782 ± 0.301
0.46MetPhe: 0.46 ± 0.157
1.265MetGly: 1.265 ± 0.307
0.46MetHis: 0.46 ± 0.138
1.207MetIle: 1.207 ± 0.253
1.15MetLys: 1.15 ± 0.301
1.897MetLeu: 1.897 ± 0.457
0.517MetMet: 0.517 ± 0.132
1.035MetAsn: 1.035 ± 0.227
1.782MetPro: 1.782 ± 0.383
1.667MetGln: 1.667 ± 0.313
1.839MetArg: 1.839 ± 0.319
2.069MetSer: 2.069 ± 0.319
2.932MetThr: 2.932 ± 0.448
1.552MetVal: 1.552 ± 0.341
0.46MetTrp: 0.46 ± 0.144
0.46MetTyr: 0.46 ± 0.173
0.0MetXaa: 0.0 ± 0.0
Asn
4.943AsnAla: 4.943 ± 0.735
0.345AsnCys: 0.345 ± 0.144
1.495AsnAsp: 1.495 ± 0.302
1.839AsnGlu: 1.839 ± 0.321
1.552AsnPhe: 1.552 ± 0.231
3.219AsnGly: 3.219 ± 0.519
0.345AsnHis: 0.345 ± 0.143
1.207AsnIle: 1.207 ± 0.325
1.437AsnLys: 1.437 ± 0.343
3.391AsnLeu: 3.391 ± 0.497
0.862AsnMet: 0.862 ± 0.2
1.437AsnAsn: 1.437 ± 0.343
2.357AsnPro: 2.357 ± 0.35
1.207AsnGln: 1.207 ± 0.308
2.184AsnArg: 2.184 ± 0.349
1.667AsnSer: 1.667 ± 0.283
2.587AsnThr: 2.587 ± 0.439
2.472AsnVal: 2.472 ± 0.346
0.805AsnTrp: 0.805 ± 0.256
1.437AsnTyr: 1.437 ± 0.299
0.0AsnXaa: 0.0 ± 0.0
Pro
7.013ProAla: 7.013 ± 1.209
0.115ProCys: 0.115 ± 0.077
2.759ProAsp: 2.759 ± 0.522
4.426ProGlu: 4.426 ± 0.564
1.322ProPhe: 1.322 ± 0.453
3.966ProGly: 3.966 ± 0.572
0.862ProHis: 0.862 ± 0.272
1.724ProIle: 1.724 ± 0.332
2.644ProLys: 2.644 ± 0.489
3.219ProLeu: 3.219 ± 0.419
0.92ProMet: 0.92 ± 0.196
1.954ProAsn: 1.954 ± 0.462
2.299ProPro: 2.299 ± 0.612
1.552ProGln: 1.552 ± 0.274
2.242ProArg: 2.242 ± 0.373
2.644ProSer: 2.644 ± 0.414
2.529ProThr: 2.529 ± 0.342
5.346ProVal: 5.346 ± 0.749
0.92ProTrp: 0.92 ± 0.246
1.322ProTyr: 1.322 ± 0.36
0.0ProXaa: 0.0 ± 0.0
Gln
5.576GlnAla: 5.576 ± 1.135
0.287GlnCys: 0.287 ± 0.133
1.897GlnAsp: 1.897 ± 0.323
2.932GlnGlu: 2.932 ± 0.508
1.437GlnPhe: 1.437 ± 0.315
3.104GlnGly: 3.104 ± 0.498
0.69GlnHis: 0.69 ± 0.267
2.012GlnIle: 2.012 ± 0.507
1.782GlnLys: 1.782 ± 0.327
3.564GlnLeu: 3.564 ± 0.578
1.897GlnMet: 1.897 ± 0.397
1.609GlnAsn: 1.609 ± 0.375
2.069GlnPro: 2.069 ± 0.441
2.184GlnGln: 2.184 ± 0.815
2.874GlnArg: 2.874 ± 0.351
2.299GlnSer: 2.299 ± 0.622
1.782GlnThr: 1.782 ± 0.351
2.587GlnVal: 2.587 ± 0.425
0.747GlnTrp: 0.747 ± 0.216
0.862GlnTyr: 0.862 ± 0.296
0.0GlnXaa: 0.0 ± 0.0
Arg
6.15ArgAla: 6.15 ± 0.545
0.805ArgCys: 0.805 ± 0.218
4.369ArgAsp: 4.369 ± 0.554
4.656ArgGlu: 4.656 ± 0.702
2.702ArgPhe: 2.702 ± 0.413
4.139ArgGly: 4.139 ± 0.458
1.437ArgHis: 1.437 ± 0.317
2.932ArgIle: 2.932 ± 0.454
2.587ArgLys: 2.587 ± 0.394
7.588ArgLeu: 7.588 ± 0.747
2.012ArgMet: 2.012 ± 0.294
2.414ArgAsn: 2.414 ± 0.316
2.644ArgPro: 2.644 ± 0.452
2.702ArgGln: 2.702 ± 0.586
4.541ArgArg: 4.541 ± 0.75
2.242ArgSer: 2.242 ± 0.356
3.391ArgThr: 3.391 ± 0.429
4.254ArgVal: 4.254 ± 0.539
1.15ArgTrp: 1.15 ± 0.24
1.495ArgTyr: 1.495 ± 0.371
0.0ArgXaa: 0.0 ± 0.0
Ser
4.196SerAla: 4.196 ± 0.74
0.46SerCys: 0.46 ± 0.141
2.587SerAsp: 2.587 ± 0.355
2.644SerGlu: 2.644 ± 0.416
1.782SerPhe: 1.782 ± 0.309
4.426SerGly: 4.426 ± 0.713
0.92SerHis: 0.92 ± 0.267
2.127SerIle: 2.127 ± 0.35
2.702SerLys: 2.702 ± 0.518
4.081SerLeu: 4.081 ± 0.644
1.609SerMet: 1.609 ± 0.462
1.609SerAsn: 1.609 ± 0.326
2.529SerPro: 2.529 ± 0.357
2.184SerGln: 2.184 ± 0.43
2.989SerArg: 2.989 ± 0.442
2.472SerSer: 2.472 ± 0.431
3.276SerThr: 3.276 ± 0.404
3.679SerVal: 3.679 ± 0.472
0.805SerTrp: 0.805 ± 0.194
1.38SerTyr: 1.38 ± 0.231
0.0SerXaa: 0.0 ± 0.0
Thr
6.783ThrAla: 6.783 ± 0.967
0.632ThrCys: 0.632 ± 0.214
3.909ThrAsp: 3.909 ± 0.522
3.391ThrGlu: 3.391 ± 0.467
2.587ThrPhe: 2.587 ± 0.426
4.311ThrGly: 4.311 ± 0.432
1.265ThrHis: 1.265 ± 0.214
2.184ThrIle: 2.184 ± 0.454
2.989ThrLys: 2.989 ± 0.369
5.633ThrLeu: 5.633 ± 0.522
1.552ThrMet: 1.552 ± 0.34
2.242ThrAsn: 2.242 ± 0.403
3.449ThrPro: 3.449 ± 0.698
1.954ThrGln: 1.954 ± 0.603
2.874ThrArg: 2.874 ± 0.502
3.047ThrSer: 3.047 ± 0.502
3.736ThrThr: 3.736 ± 0.505
4.943ThrVal: 4.943 ± 0.828
1.207ThrTrp: 1.207 ± 0.281
1.724ThrTyr: 1.724 ± 0.337
0.0ThrXaa: 0.0 ± 0.0
Val
5.978ValAla: 5.978 ± 0.76
0.575ValCys: 0.575 ± 0.205
3.679ValAsp: 3.679 ± 0.566
4.196ValGlu: 4.196 ± 0.587
2.702ValPhe: 2.702 ± 0.426
4.311ValGly: 4.311 ± 0.504
1.322ValHis: 1.322 ± 0.231
2.529ValIle: 2.529 ± 0.438
4.024ValLys: 4.024 ± 0.524
5.518ValLeu: 5.518 ± 0.735
1.954ValMet: 1.954 ± 0.303
3.276ValAsn: 3.276 ± 0.517
3.334ValPro: 3.334 ± 0.632
2.989ValGln: 2.989 ± 0.419
4.771ValArg: 4.771 ± 0.58
3.219ValSer: 3.219 ± 0.519
4.484ValThr: 4.484 ± 0.703
4.369ValVal: 4.369 ± 0.495
1.322ValTrp: 1.322 ± 0.311
2.184ValTyr: 2.184 ± 0.388
0.0ValXaa: 0.0 ± 0.0
Trp
1.15TrpAla: 1.15 ± 0.259
0.402TrpCys: 0.402 ± 0.187
1.437TrpAsp: 1.437 ± 0.331
0.747TrpGlu: 0.747 ± 0.197
0.632TrpPhe: 0.632 ± 0.147
0.805TrpGly: 0.805 ± 0.243
0.345TrpHis: 0.345 ± 0.178
0.92TrpIle: 0.92 ± 0.279
0.345TrpLys: 0.345 ± 0.128
1.667TrpLeu: 1.667 ± 0.331
0.517TrpMet: 0.517 ± 0.165
0.69TrpAsn: 0.69 ± 0.243
1.092TrpPro: 1.092 ± 0.308
1.092TrpGln: 1.092 ± 0.248
0.92TrpArg: 0.92 ± 0.253
1.667TrpSer: 1.667 ± 0.352
1.495TrpThr: 1.495 ± 0.287
1.207TrpVal: 1.207 ± 0.283
0.287TrpTrp: 0.287 ± 0.149
0.402TrpTyr: 0.402 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.069TyrAla: 2.069 ± 0.314
0.345TyrCys: 0.345 ± 0.136
1.724TyrAsp: 1.724 ± 0.434
1.437TyrGlu: 1.437 ± 0.379
1.265TyrPhe: 1.265 ± 0.439
2.529TyrGly: 2.529 ± 0.587
0.575TyrHis: 0.575 ± 0.17
1.265TyrIle: 1.265 ± 0.265
1.265TyrLys: 1.265 ± 0.361
2.357TyrLeu: 2.357 ± 0.457
0.747TyrMet: 0.747 ± 0.182
1.15TyrAsn: 1.15 ± 0.198
1.437TyrPro: 1.437 ± 0.345
0.862TyrGln: 0.862 ± 0.234
2.299TyrArg: 2.299 ± 0.409
1.437TyrSer: 1.437 ± 0.264
1.552TyrThr: 1.552 ± 0.353
1.552TyrVal: 1.552 ± 0.323
0.632TyrTrp: 0.632 ± 0.192
0.575TyrTyr: 0.575 ± 0.176
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (17398 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski