Amino acid dipepetide frequency for Synechococcus T7-like phage S-TIP37

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.666AlaAla: 9.666 ± 1.171
0.727AlaCys: 0.727 ± 0.264
6.905AlaAsp: 6.905 ± 1.164
5.451AlaGlu: 5.451 ± 0.845
4.143AlaPhe: 4.143 ± 0.583
6.541AlaGly: 6.541 ± 1.092
1.163AlaHis: 1.163 ± 0.298
4.433AlaIle: 4.433 ± 0.446
6.468AlaLys: 6.468 ± 1.052
6.178AlaLeu: 6.178 ± 0.695
2.18AlaMet: 2.18 ± 0.445
4.433AlaAsn: 4.433 ± 1.809
3.198AlaPro: 3.198 ± 0.553
3.634AlaGln: 3.634 ± 0.691
4.433AlaArg: 4.433 ± 0.578
6.832AlaSer: 6.832 ± 0.77
5.596AlaThr: 5.596 ± 0.739
5.524AlaVal: 5.524 ± 1.154
0.945AlaTrp: 0.945 ± 0.251
2.907AlaTyr: 2.907 ± 0.623
0.0AlaXaa: 0.0 ± 0.0
Cys
0.872CysAla: 0.872 ± 0.252
0.291CysCys: 0.291 ± 0.117
0.581CysAsp: 0.581 ± 0.205
0.581CysGlu: 0.581 ± 0.234
0.291CysPhe: 0.291 ± 0.143
0.291CysGly: 0.291 ± 0.152
0.291CysHis: 0.291 ± 0.158
0.945CysIle: 0.945 ± 0.309
0.727CysLys: 0.727 ± 0.235
0.727CysLeu: 0.727 ± 0.232
0.145CysMet: 0.145 ± 0.146
0.218CysAsn: 0.218 ± 0.116
0.218CysPro: 0.218 ± 0.118
0.436CysGln: 0.436 ± 0.178
0.581CysArg: 0.581 ± 0.217
0.727CysSer: 0.727 ± 0.27
1.018CysThr: 1.018 ± 0.277
0.581CysVal: 0.581 ± 0.24
0.0CysTrp: 0.0 ± 0.0
0.218CysTyr: 0.218 ± 0.117
0.0CysXaa: 0.0 ± 0.0
Asp
5.524AspAla: 5.524 ± 0.659
0.436AspCys: 0.436 ± 0.192
3.852AspAsp: 3.852 ± 0.596
4.433AspGlu: 4.433 ± 0.65
2.398AspPhe: 2.398 ± 0.441
3.779AspGly: 3.779 ± 0.5
1.381AspHis: 1.381 ± 0.381
3.779AspIle: 3.779 ± 0.486
3.053AspLys: 3.053 ± 0.448
7.486AspLeu: 7.486 ± 0.583
1.381AspMet: 1.381 ± 0.342
2.471AspAsn: 2.471 ± 0.41
2.689AspPro: 2.689 ± 0.523
3.125AspGln: 3.125 ± 0.447
3.053AspArg: 3.053 ± 0.604
4.361AspSer: 4.361 ± 0.768
3.416AspThr: 3.416 ± 0.489
5.015AspVal: 5.015 ± 0.616
1.09AspTrp: 1.09 ± 0.277
2.544AspTyr: 2.544 ± 0.414
0.0AspXaa: 0.0 ± 0.0
Glu
5.233GluAla: 5.233 ± 0.665
0.581GluCys: 0.581 ± 0.219
3.343GluAsp: 3.343 ± 0.457
5.088GluGlu: 5.088 ± 0.93
2.398GluPhe: 2.398 ± 0.427
3.997GluGly: 3.997 ± 0.532
1.236GluHis: 1.236 ± 0.275
3.053GluIle: 3.053 ± 0.6
2.907GluLys: 2.907 ± 0.591
6.032GluLeu: 6.032 ± 0.754
2.035GluMet: 2.035 ± 0.364
2.98GluAsn: 2.98 ± 0.457
1.744GluPro: 1.744 ± 0.384
2.689GluGln: 2.689 ± 0.671
3.198GluArg: 3.198 ± 0.498
2.762GluSer: 2.762 ± 0.442
3.707GluThr: 3.707 ± 0.455
4.797GluVal: 4.797 ± 0.641
1.454GluTrp: 1.454 ± 0.37
1.962GluTyr: 1.962 ± 0.389
0.0GluXaa: 0.0 ± 0.0
Phe
2.471PheAla: 2.471 ± 0.428
0.581PheCys: 0.581 ± 0.2
2.689PheAsp: 2.689 ± 0.603
2.108PheGlu: 2.108 ± 0.438
1.817PhePhe: 1.817 ± 0.457
2.326PheGly: 2.326 ± 0.486
1.163PheHis: 1.163 ± 0.273
2.253PheIle: 2.253 ± 0.421
1.962PheLys: 1.962 ± 0.418
2.907PheLeu: 2.907 ± 0.536
1.018PheMet: 1.018 ± 0.239
2.689PheAsn: 2.689 ± 0.546
1.308PhePro: 1.308 ± 0.271
1.599PheGln: 1.599 ± 0.362
2.035PheArg: 2.035 ± 0.333
2.253PheSer: 2.253 ± 0.418
2.616PheThr: 2.616 ± 0.516
2.108PheVal: 2.108 ± 0.343
0.291PheTrp: 0.291 ± 0.143
1.163PheTyr: 1.163 ± 0.3
0.0PheXaa: 0.0 ± 0.0
Gly
5.233GlyAla: 5.233 ± 0.721
0.727GlyCys: 0.727 ± 0.241
5.088GlyAsp: 5.088 ± 0.637
4.07GlyGlu: 4.07 ± 0.572
1.962GlyPhe: 1.962 ± 0.345
5.596GlyGly: 5.596 ± 1.296
1.526GlyHis: 1.526 ± 0.373
4.652GlyIle: 4.652 ± 0.637
3.053GlyLys: 3.053 ± 0.421
4.652GlyLeu: 4.652 ± 0.5
1.526GlyMet: 1.526 ± 0.268
4.215GlyAsn: 4.215 ± 0.907
1.817GlyPro: 1.817 ± 0.374
3.198GlyGln: 3.198 ± 0.477
3.779GlyArg: 3.779 ± 0.477
5.814GlySer: 5.814 ± 1.251
5.451GlyThr: 5.451 ± 0.947
4.433GlyVal: 4.433 ± 0.536
0.727GlyTrp: 0.727 ± 0.24
2.98GlyTyr: 2.98 ± 0.444
0.0GlyXaa: 0.0 ± 0.0
His
0.799HisAla: 0.799 ± 0.258
0.436HisCys: 0.436 ± 0.187
1.308HisAsp: 1.308 ± 0.336
1.526HisGlu: 1.526 ± 0.423
0.727HisPhe: 0.727 ± 0.249
1.236HisGly: 1.236 ± 0.323
0.291HisHis: 0.291 ± 0.172
0.363HisIle: 0.363 ± 0.161
0.872HisLys: 0.872 ± 0.25
1.599HisLeu: 1.599 ± 0.506
0.727HisMet: 0.727 ± 0.247
1.526HisAsn: 1.526 ± 0.436
0.945HisPro: 0.945 ± 0.249
0.799HisGln: 0.799 ± 0.251
1.163HisArg: 1.163 ± 0.332
1.744HisSer: 1.744 ± 1.241
1.744HisThr: 1.744 ± 0.409
1.018HisVal: 1.018 ± 0.231
0.291HisTrp: 0.291 ± 0.149
0.799HisTyr: 0.799 ± 0.254
0.0HisXaa: 0.0 ± 0.0
Ile
6.396IleAla: 6.396 ± 1.419
0.436IleCys: 0.436 ± 0.204
4.215IleAsp: 4.215 ± 0.517
3.997IleGlu: 3.997 ± 0.586
1.236IlePhe: 1.236 ± 0.359
2.98IleGly: 2.98 ± 0.487
1.308IleHis: 1.308 ± 0.357
1.744IleIle: 1.744 ± 0.426
2.98IleLys: 2.98 ± 0.521
3.707IleLeu: 3.707 ± 0.576
1.526IleMet: 1.526 ± 0.405
2.18IleAsn: 2.18 ± 0.547
2.471IlePro: 2.471 ± 0.415
1.817IleGln: 1.817 ± 0.332
2.18IleArg: 2.18 ± 0.374
3.561IleSer: 3.561 ± 0.487
4.579IleThr: 4.579 ± 0.642
2.762IleVal: 2.762 ± 0.495
0.436IleTrp: 0.436 ± 0.175
1.599IleTyr: 1.599 ± 0.373
0.0IleXaa: 0.0 ± 0.0
Lys
5.814LysAla: 5.814 ± 0.808
0.436LysCys: 0.436 ± 0.188
3.053LysAsp: 3.053 ± 0.461
3.125LysGlu: 3.125 ± 0.526
2.035LysPhe: 2.035 ± 0.353
3.416LysGly: 3.416 ± 0.635
0.799LysHis: 0.799 ± 0.263
3.707LysIle: 3.707 ± 1.135
3.343LysLys: 3.343 ± 0.575
5.378LysLeu: 5.378 ± 0.594
1.599LysMet: 1.599 ± 0.403
1.962LysAsn: 1.962 ± 0.359
2.835LysPro: 2.835 ± 0.645
1.817LysGln: 1.817 ± 0.461
3.852LysArg: 3.852 ± 0.611
2.762LysSer: 2.762 ± 0.502
3.271LysThr: 3.271 ± 0.528
3.271LysVal: 3.271 ± 0.52
0.581LysTrp: 0.581 ± 0.173
2.18LysTyr: 2.18 ± 0.418
0.0LysXaa: 0.0 ± 0.0
Leu
7.413LeuAla: 7.413 ± 0.777
1.018LeuCys: 1.018 ± 0.304
5.233LeuAsp: 5.233 ± 0.561
4.724LeuGlu: 4.724 ± 0.559
3.416LeuPhe: 3.416 ± 0.606
5.233LeuGly: 5.233 ± 0.533
2.035LeuHis: 2.035 ± 0.386
3.925LeuIle: 3.925 ± 0.742
5.088LeuLys: 5.088 ± 0.808
6.25LeuLeu: 6.25 ± 0.833
1.89LeuMet: 1.89 ± 0.364
4.652LeuAsn: 4.652 ± 0.483
3.561LeuPro: 3.561 ± 0.549
3.561LeuGln: 3.561 ± 0.751
4.797LeuArg: 4.797 ± 0.749
5.088LeuSer: 5.088 ± 0.768
4.652LeuThr: 4.652 ± 0.619
4.215LeuVal: 4.215 ± 0.493
0.363LeuTrp: 0.363 ± 0.14
1.89LeuTyr: 1.89 ± 0.333
0.0LeuXaa: 0.0 ± 0.0
Met
3.125MetAla: 3.125 ± 0.522
0.218MetCys: 0.218 ± 0.132
1.744MetAsp: 1.744 ± 0.336
1.454MetGlu: 1.454 ± 0.327
0.654MetPhe: 0.654 ± 0.198
1.454MetGly: 1.454 ± 0.315
0.509MetHis: 0.509 ± 0.264
1.454MetIle: 1.454 ± 0.342
1.454MetLys: 1.454 ± 0.273
2.035MetLeu: 2.035 ± 0.402
0.436MetMet: 0.436 ± 0.141
1.236MetAsn: 1.236 ± 0.242
0.945MetPro: 0.945 ± 0.279
1.09MetGln: 1.09 ± 0.265
1.817MetArg: 1.817 ± 0.415
2.616MetSer: 2.616 ± 0.383
1.817MetThr: 1.817 ± 0.314
1.308MetVal: 1.308 ± 0.279
0.436MetTrp: 0.436 ± 0.154
0.581MetTyr: 0.581 ± 0.184
0.0MetXaa: 0.0 ± 0.0
Asn
4.652AsnAla: 4.652 ± 0.863
0.363AsnCys: 0.363 ± 0.159
3.997AsnAsp: 3.997 ± 0.535
2.035AsnGlu: 2.035 ± 0.405
1.817AsnPhe: 1.817 ± 0.303
4.724AsnGly: 4.724 ± 0.552
0.945AsnHis: 0.945 ± 0.649
2.398AsnIle: 2.398 ± 0.71
2.253AsnLys: 2.253 ± 0.386
3.779AsnLeu: 3.779 ± 0.487
1.672AsnMet: 1.672 ± 0.366
2.471AsnAsn: 2.471 ± 0.443
2.689AsnPro: 2.689 ± 0.385
1.454AsnGln: 1.454 ± 0.263
2.762AsnArg: 2.762 ± 0.314
2.98AsnSer: 2.98 ± 0.766
4.288AsnThr: 4.288 ± 1.216
4.07AsnVal: 4.07 ± 1.188
0.509AsnTrp: 0.509 ± 0.216
1.817AsnTyr: 1.817 ± 0.356
0.0AsnXaa: 0.0 ± 0.0
Pro
3.343ProAla: 3.343 ± 0.535
0.145ProCys: 0.145 ± 0.097
2.98ProAsp: 2.98 ± 0.381
3.125ProGlu: 3.125 ± 0.586
1.672ProPhe: 1.672 ± 0.429
2.98ProGly: 2.98 ± 0.507
0.509ProHis: 0.509 ± 0.213
2.762ProIle: 2.762 ± 0.515
2.108ProLys: 2.108 ± 0.508
2.471ProLeu: 2.471 ± 0.459
0.945ProMet: 0.945 ± 0.279
2.398ProAsn: 2.398 ± 0.427
2.689ProPro: 2.689 ± 0.826
1.744ProGln: 1.744 ± 0.38
1.308ProArg: 1.308 ± 0.288
2.762ProSer: 2.762 ± 0.521
2.398ProThr: 2.398 ± 0.34
2.108ProVal: 2.108 ± 0.444
1.09ProTrp: 1.09 ± 0.319
1.163ProTyr: 1.163 ± 0.247
0.0ProXaa: 0.0 ± 0.0
Gln
4.361GlnAla: 4.361 ± 0.733
0.654GlnCys: 0.654 ± 0.202
2.398GlnAsp: 2.398 ± 0.376
2.689GlnGlu: 2.689 ± 0.403
1.599GlnPhe: 1.599 ± 0.259
2.616GlnGly: 2.616 ± 0.432
0.727GlnHis: 0.727 ± 0.243
2.18GlnIle: 2.18 ± 0.368
1.526GlnLys: 1.526 ± 0.347
4.07GlnLeu: 4.07 ± 0.547
1.236GlnMet: 1.236 ± 0.318
2.035GlnAsn: 2.035 ± 0.399
1.599GlnPro: 1.599 ± 0.322
2.835GlnGln: 2.835 ± 0.593
1.454GlnArg: 1.454 ± 0.338
2.326GlnSer: 2.326 ± 0.374
2.689GlnThr: 2.689 ± 0.422
2.253GlnVal: 2.253 ± 0.415
0.291GlnTrp: 0.291 ± 0.13
1.381GlnTyr: 1.381 ± 0.423
0.0GlnXaa: 0.0 ± 0.0
Arg
4.215ArgAla: 4.215 ± 0.737
0.436ArgCys: 0.436 ± 0.191
2.835ArgAsp: 2.835 ± 0.407
3.634ArgGlu: 3.634 ± 0.646
1.817ArgPhe: 1.817 ± 0.334
3.416ArgGly: 3.416 ± 0.522
0.945ArgHis: 0.945 ± 0.363
2.398ArgIle: 2.398 ± 0.28
3.271ArgLys: 3.271 ± 0.605
4.07ArgLeu: 4.07 ± 0.572
1.744ArgMet: 1.744 ± 0.341
2.108ArgAsn: 2.108 ± 0.369
2.326ArgPro: 2.326 ± 0.443
2.689ArgGln: 2.689 ± 0.526
2.689ArgArg: 2.689 ± 0.486
3.271ArgSer: 3.271 ± 0.51
2.98ArgThr: 2.98 ± 0.455
2.544ArgVal: 2.544 ± 0.579
0.509ArgTrp: 0.509 ± 0.169
1.89ArgTyr: 1.89 ± 0.381
0.0ArgXaa: 0.0 ± 0.0
Ser
5.524SerAla: 5.524 ± 0.742
0.654SerCys: 0.654 ± 0.265
3.997SerAsp: 3.997 ± 0.539
2.544SerGlu: 2.544 ± 0.469
2.689SerPhe: 2.689 ± 0.388
6.614SerGly: 6.614 ± 0.946
1.744SerHis: 1.744 ± 0.817
3.198SerIle: 3.198 ± 0.527
3.634SerLys: 3.634 ± 0.474
4.87SerLeu: 4.87 ± 0.592
1.236SerMet: 1.236 ± 0.343
4.288SerAsn: 4.288 ± 0.841
1.817SerPro: 1.817 ± 0.342
1.817SerGln: 1.817 ± 0.265
3.053SerArg: 3.053 ± 0.578
5.742SerSer: 5.742 ± 1.149
6.032SerThr: 6.032 ± 1.219
4.797SerVal: 4.797 ± 0.612
1.308SerTrp: 1.308 ± 0.319
2.035SerTyr: 2.035 ± 0.413
0.0SerXaa: 0.0 ± 0.0
Thr
6.687ThrAla: 6.687 ± 1.251
0.654ThrCys: 0.654 ± 0.18
3.707ThrAsp: 3.707 ± 0.592
3.707ThrGlu: 3.707 ± 0.622
2.689ThrPhe: 2.689 ± 0.376
5.524ThrGly: 5.524 ± 1.057
0.799ThrHis: 0.799 ± 0.308
3.416ThrIle: 3.416 ± 0.595
3.707ThrLys: 3.707 ± 0.517
5.233ThrLeu: 5.233 ± 0.404
1.163ThrMet: 1.163 ± 0.296
3.779ThrAsn: 3.779 ± 0.875
4.433ThrPro: 4.433 ± 0.547
2.835ThrGln: 2.835 ± 0.536
3.198ThrArg: 3.198 ± 0.504
5.451ThrSer: 5.451 ± 1.394
6.977ThrThr: 6.977 ± 1.978
5.814ThrVal: 5.814 ± 1.721
0.727ThrTrp: 0.727 ± 0.211
2.326ThrTyr: 2.326 ± 0.421
0.0ThrXaa: 0.0 ± 0.0
Val
6.759ValAla: 6.759 ± 1.713
0.654ValCys: 0.654 ± 0.257
4.215ValAsp: 4.215 ± 0.648
3.561ValGlu: 3.561 ± 0.561
2.253ValPhe: 2.253 ± 0.42
3.925ValGly: 3.925 ± 0.447
1.454ValHis: 1.454 ± 0.472
2.835ValIle: 2.835 ± 0.623
3.561ValLys: 3.561 ± 0.508
3.925ValLeu: 3.925 ± 0.618
1.89ValMet: 1.89 ± 0.314
3.779ValAsn: 3.779 ± 0.566
2.398ValPro: 2.398 ± 0.474
2.471ValGln: 2.471 ± 0.427
2.326ValArg: 2.326 ± 0.4
3.634ValSer: 3.634 ± 0.653
7.486ValThr: 7.486 ± 2.414
4.361ValVal: 4.361 ± 0.605
0.436ValTrp: 0.436 ± 0.21
1.89ValTyr: 1.89 ± 0.323
0.0ValXaa: 0.0 ± 0.0
Trp
0.799TrpAla: 0.799 ± 0.293
0.145TrpCys: 0.145 ± 0.101
0.799TrpAsp: 0.799 ± 0.26
1.018TrpGlu: 1.018 ± 0.276
0.872TrpPhe: 0.872 ± 0.254
0.581TrpGly: 0.581 ± 0.2
0.291TrpHis: 0.291 ± 0.141
0.654TrpIle: 0.654 ± 0.203
0.799TrpLys: 0.799 ± 0.242
1.381TrpLeu: 1.381 ± 0.364
0.581TrpMet: 0.581 ± 0.194
0.291TrpAsn: 0.291 ± 0.152
0.291TrpPro: 0.291 ± 0.14
0.218TrpGln: 0.218 ± 0.118
0.218TrpArg: 0.218 ± 0.139
0.872TrpSer: 0.872 ± 0.226
0.727TrpThr: 0.727 ± 0.223
1.163TrpVal: 1.163 ± 0.292
0.145TrpTrp: 0.145 ± 0.092
0.073TrpTyr: 0.073 ± 0.07
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.471TyrAla: 2.471 ± 0.425
0.218TyrCys: 0.218 ± 0.129
2.326TyrAsp: 2.326 ± 0.423
2.326TyrGlu: 2.326 ± 0.387
0.872TyrPhe: 0.872 ± 0.263
3.271TyrGly: 3.271 ± 0.495
0.799TyrHis: 0.799 ± 0.257
1.962TyrIle: 1.962 ± 0.421
2.471TyrLys: 2.471 ± 0.495
2.253TyrLeu: 2.253 ± 0.47
1.381TyrMet: 1.381 ± 0.308
1.962TyrAsn: 1.962 ± 0.387
0.727TyrPro: 0.727 ± 0.231
1.09TyrGln: 1.09 ± 0.276
1.89TyrArg: 1.89 ± 0.337
2.035TyrSer: 2.035 ± 0.379
1.381TyrThr: 1.381 ± 0.343
1.672TyrVal: 1.672 ± 0.347
0.291TyrTrp: 0.291 ± 0.13
1.308TyrTyr: 1.308 ± 0.27
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (13760 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski