Amino acid dipepetide frequency for Salmonella phage Astrid

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.784AlaAla: 5.784 ± 2.038
0.0AlaCys: 0.0 ± 0.0
6.059AlaAsp: 6.059 ± 1.237
2.754AlaGlu: 2.754 ± 0.966
3.305AlaPhe: 3.305 ± 1.21
4.406AlaGly: 4.406 ± 0.999
0.275AlaHis: 0.275 ± 0.289
4.131AlaIle: 4.131 ± 0.911
6.059AlaLys: 6.059 ± 1.389
6.885AlaLeu: 6.885 ± 1.162
1.652AlaMet: 1.652 ± 0.803
3.856AlaAsn: 3.856 ± 1.087
1.928AlaPro: 1.928 ± 0.93
2.479AlaGln: 2.479 ± 1.051
2.203AlaArg: 2.203 ± 1.009
3.029AlaSer: 3.029 ± 0.694
3.856AlaThr: 3.856 ± 1.339
6.059AlaVal: 6.059 ± 1.652
0.826AlaTrp: 0.826 ± 0.508
2.754AlaTyr: 2.754 ± 0.952
0.0AlaXaa: 0.0 ± 0.0
Cys
0.551CysAla: 0.551 ± 0.647
0.0CysCys: 0.0 ± 0.0
0.826CysAsp: 0.826 ± 0.542
1.102CysGlu: 1.102 ± 0.649
0.0CysPhe: 0.0 ± 0.0
0.275CysGly: 0.275 ± 0.218
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.275CysLys: 0.275 ± 0.238
0.551CysLeu: 0.551 ± 0.391
0.275CysMet: 0.275 ± 0.27
0.551CysAsn: 0.551 ± 0.352
0.275CysPro: 0.275 ± 0.238
0.0CysGln: 0.0 ± 0.0
0.826CysArg: 0.826 ± 0.552
0.826CysSer: 0.826 ± 0.478
0.551CysThr: 0.551 ± 0.795
0.551CysVal: 0.551 ± 0.302
0.0CysTrp: 0.0 ± 0.0
0.551CysTyr: 0.551 ± 0.28
0.0CysXaa: 0.0 ± 0.0
Asp
4.406AspAla: 4.406 ± 0.987
0.275AspCys: 0.275 ± 0.252
1.928AspAsp: 1.928 ± 0.58
3.58AspGlu: 3.58 ± 1.007
1.377AspPhe: 1.377 ± 0.477
5.784AspGly: 5.784 ± 1.112
0.275AspHis: 0.275 ± 0.323
6.059AspIle: 6.059 ± 1.337
4.682AspLys: 4.682 ± 0.821
7.161AspLeu: 7.161 ± 1.241
1.102AspMet: 1.102 ± 0.439
5.508AspAsn: 5.508 ± 2.497
1.377AspPro: 1.377 ± 0.804
1.652AspGln: 1.652 ± 0.533
1.377AspArg: 1.377 ± 1.013
5.508AspSer: 5.508 ± 0.919
4.957AspThr: 4.957 ± 1.799
4.682AspVal: 4.682 ± 0.898
1.102AspTrp: 1.102 ± 0.651
3.305AspTyr: 3.305 ± 0.841
0.0AspXaa: 0.0 ± 0.0
Glu
1.928GluAla: 1.928 ± 0.767
0.0GluCys: 0.0 ± 0.0
2.754GluAsp: 2.754 ± 0.888
1.102GluGlu: 1.102 ± 0.509
3.856GluPhe: 3.856 ± 1.208
3.856GluGly: 3.856 ± 0.941
1.652GluHis: 1.652 ± 0.968
4.957GluIle: 4.957 ± 1.163
3.58GluLys: 3.58 ± 0.996
5.784GluLeu: 5.784 ± 1.24
2.203GluMet: 2.203 ± 0.931
7.161GluAsn: 7.161 ± 0.591
0.551GluPro: 0.551 ± 0.434
3.029GluGln: 3.029 ± 1.338
3.856GluArg: 3.856 ± 0.889
4.682GluSer: 4.682 ± 1.324
5.233GluThr: 5.233 ± 0.977
4.406GluVal: 4.406 ± 1.306
0.826GluTrp: 0.826 ± 0.653
3.305GluTyr: 3.305 ± 0.863
0.0GluXaa: 0.0 ± 0.0
Phe
3.029PheAla: 3.029 ± 0.72
0.0PheCys: 0.0 ± 0.0
4.131PheAsp: 4.131 ± 0.953
1.928PheGlu: 1.928 ± 0.656
2.479PhePhe: 2.479 ± 0.875
2.203PheGly: 2.203 ± 0.517
0.275PheHis: 0.275 ± 0.332
3.029PheIle: 3.029 ± 1.14
2.203PheLys: 2.203 ± 1.022
1.377PheLeu: 1.377 ± 0.727
1.928PheMet: 1.928 ± 0.756
4.406PheAsn: 4.406 ± 1.464
1.652PhePro: 1.652 ± 0.596
0.551PheGln: 0.551 ± 0.302
0.826PheArg: 0.826 ± 0.424
2.754PheSer: 2.754 ± 0.419
4.957PheThr: 4.957 ± 1.292
2.203PheVal: 2.203 ± 0.828
0.0PheTrp: 0.0 ± 0.0
2.754PheTyr: 2.754 ± 0.912
0.0PheXaa: 0.0 ± 0.0
Gly
3.305GlyAla: 3.305 ± 0.882
0.0GlyCys: 0.0 ± 0.0
4.131GlyAsp: 4.131 ± 0.835
6.885GlyGlu: 6.885 ± 0.878
3.029GlyPhe: 3.029 ± 0.683
4.131GlyGly: 4.131 ± 0.713
0.551GlyHis: 0.551 ± 0.307
4.131GlyIle: 4.131 ± 0.928
3.856GlyLys: 3.856 ± 0.8
4.131GlyLeu: 4.131 ± 1.186
1.928GlyMet: 1.928 ± 0.797
4.131GlyAsn: 4.131 ± 1.062
0.0GlyPro: 0.0 ± 0.0
1.652GlyGln: 1.652 ± 0.722
2.754GlyArg: 2.754 ± 0.946
3.856GlySer: 3.856 ± 1.093
3.856GlyThr: 3.856 ± 1.127
7.711GlyVal: 7.711 ± 1.334
0.551GlyTrp: 0.551 ± 0.386
2.754GlyTyr: 2.754 ± 1.121
0.0GlyXaa: 0.0 ± 0.0
His
0.551HisAla: 0.551 ± 0.419
0.0HisCys: 0.0 ± 0.0
1.102HisAsp: 1.102 ± 0.447
1.377HisGlu: 1.377 ± 0.814
0.826HisPhe: 0.826 ± 0.573
0.551HisGly: 0.551 ± 0.504
0.275HisHis: 0.275 ± 0.218
1.652HisIle: 1.652 ± 0.886
0.551HisLys: 0.551 ± 0.435
1.377HisLeu: 1.377 ± 0.492
0.0HisMet: 0.0 ± 0.0
2.203HisAsn: 2.203 ± 0.753
0.275HisPro: 0.275 ± 0.252
0.551HisGln: 0.551 ± 0.435
0.0HisArg: 0.0 ± 0.0
0.826HisSer: 0.826 ± 0.519
0.551HisThr: 0.551 ± 0.435
0.275HisVal: 0.275 ± 0.323
0.0HisTrp: 0.0 ± 0.0
0.826HisTyr: 0.826 ± 0.474
0.0HisXaa: 0.0 ± 0.0
Ile
3.58IleAla: 3.58 ± 1.441
0.826IleCys: 0.826 ± 0.521
4.682IleAsp: 4.682 ± 0.955
4.682IleGlu: 4.682 ± 1.234
2.479IlePhe: 2.479 ± 1.062
3.305IleGly: 3.305 ± 0.982
1.928IleHis: 1.928 ± 0.568
3.305IleIle: 3.305 ± 0.655
7.711IleLys: 7.711 ± 1.45
3.305IleLeu: 3.305 ± 1.009
1.102IleMet: 1.102 ± 0.494
5.784IleAsn: 5.784 ± 1.531
2.479IlePro: 2.479 ± 0.789
1.652IleGln: 1.652 ± 0.576
1.377IleArg: 1.377 ± 0.542
5.508IleSer: 5.508 ± 1.617
7.161IleThr: 7.161 ± 1.271
3.305IleVal: 3.305 ± 0.937
0.275IleTrp: 0.275 ± 0.218
2.754IleTyr: 2.754 ± 1.089
0.0IleXaa: 0.0 ± 0.0
Lys
6.059LysAla: 6.059 ± 0.955
1.377LysCys: 1.377 ± 0.674
3.029LysAsp: 3.029 ± 0.783
5.784LysGlu: 5.784 ± 1.206
3.58LysPhe: 3.58 ± 0.978
3.856LysGly: 3.856 ± 1.213
1.102LysHis: 1.102 ± 0.624
4.131LysIle: 4.131 ± 1.087
2.754LysLys: 2.754 ± 0.723
7.161LysLeu: 7.161 ± 1.313
2.754LysMet: 2.754 ± 0.721
4.131LysAsn: 4.131 ± 0.846
2.754LysPro: 2.754 ± 0.786
1.928LysGln: 1.928 ± 0.539
4.406LysArg: 4.406 ± 1.374
4.131LysSer: 4.131 ± 0.925
6.61LysThr: 6.61 ± 1.309
4.957LysVal: 4.957 ± 1.567
0.275LysTrp: 0.275 ± 0.289
3.856LysTyr: 3.856 ± 1.391
0.0LysXaa: 0.0 ± 0.0
Leu
5.784LeuAla: 5.784 ± 1.502
0.826LeuCys: 0.826 ± 0.452
6.059LeuAsp: 6.059 ± 0.898
3.029LeuGlu: 3.029 ± 0.834
1.928LeuPhe: 1.928 ± 0.572
4.957LeuGly: 4.957 ± 0.859
0.551LeuHis: 0.551 ± 0.28
5.508LeuIle: 5.508 ± 1.447
5.233LeuLys: 5.233 ± 1.498
5.233LeuLeu: 5.233 ± 1.063
1.928LeuMet: 1.928 ± 0.815
5.784LeuAsn: 5.784 ± 1.381
2.479LeuPro: 2.479 ± 0.641
3.029LeuGln: 3.029 ± 0.843
1.377LeuArg: 1.377 ± 0.584
8.538LeuSer: 8.538 ± 1.118
6.059LeuThr: 6.059 ± 1.556
4.957LeuVal: 4.957 ± 0.892
0.826LeuTrp: 0.826 ± 0.564
3.58LeuTyr: 3.58 ± 0.873
0.0LeuXaa: 0.0 ± 0.0
Met
1.102MetAla: 1.102 ± 0.529
0.275MetCys: 0.275 ± 0.218
1.652MetAsp: 1.652 ± 0.815
3.029MetGlu: 3.029 ± 1.157
2.203MetPhe: 2.203 ± 0.888
1.928MetGly: 1.928 ± 0.743
0.275MetHis: 0.275 ± 0.323
1.102MetIle: 1.102 ± 0.402
3.305MetLys: 3.305 ± 0.685
2.479MetLeu: 2.479 ± 0.578
1.102MetMet: 1.102 ± 0.522
2.203MetAsn: 2.203 ± 0.832
0.551MetPro: 0.551 ± 0.567
1.102MetGln: 1.102 ± 0.372
1.377MetArg: 1.377 ± 0.719
1.928MetSer: 1.928 ± 0.957
1.928MetThr: 1.928 ± 0.947
0.826MetVal: 0.826 ± 0.365
0.551MetTrp: 0.551 ± 0.352
1.377MetTyr: 1.377 ± 0.756
0.0MetXaa: 0.0 ± 0.0
Asn
3.029AsnAla: 3.029 ± 0.785
0.275AsnCys: 0.275 ± 0.323
5.233AsnAsp: 5.233 ± 1.384
4.957AsnGlu: 4.957 ± 0.915
2.754AsnPhe: 2.754 ± 0.655
5.784AsnGly: 5.784 ± 1.511
1.377AsnHis: 1.377 ± 0.493
5.233AsnIle: 5.233 ± 1.572
7.987AsnLys: 7.987 ± 1.714
3.58AsnLeu: 3.58 ± 1.467
2.754AsnMet: 2.754 ± 0.839
4.406AsnAsn: 4.406 ± 0.959
1.928AsnPro: 1.928 ± 0.464
2.203AsnGln: 2.203 ± 1.208
1.652AsnArg: 1.652 ± 0.47
3.305AsnSer: 3.305 ± 1.078
3.856AsnThr: 3.856 ± 0.826
4.682AsnVal: 4.682 ± 1.274
0.826AsnTrp: 0.826 ± 0.403
2.754AsnTyr: 2.754 ± 0.809
0.0AsnXaa: 0.0 ± 0.0
Pro
4.131ProAla: 4.131 ± 0.989
0.275ProCys: 0.275 ± 0.397
2.754ProAsp: 2.754 ± 1.331
2.754ProGlu: 2.754 ± 0.603
1.652ProPhe: 1.652 ± 0.659
0.0ProGly: 0.0 ± 0.0
0.551ProHis: 0.551 ± 0.307
1.652ProIle: 1.652 ± 0.753
1.652ProLys: 1.652 ± 0.512
1.102ProLeu: 1.102 ± 0.438
0.826ProMet: 0.826 ± 0.433
1.377ProAsn: 1.377 ± 0.646
0.551ProPro: 0.551 ± 0.577
0.551ProGln: 0.551 ± 0.328
0.275ProArg: 0.275 ± 0.341
1.652ProSer: 1.652 ± 0.74
0.826ProThr: 0.826 ± 0.426
1.102ProVal: 1.102 ± 0.544
0.275ProTrp: 0.275 ± 0.218
2.754ProTyr: 2.754 ± 0.552
0.0ProXaa: 0.0 ± 0.0
Gln
3.856GlnAla: 3.856 ± 1.698
0.826GlnCys: 0.826 ± 0.452
2.479GlnAsp: 2.479 ± 0.902
2.754GlnGlu: 2.754 ± 1.311
1.377GlnPhe: 1.377 ± 0.589
1.377GlnGly: 1.377 ± 0.43
0.275GlnHis: 0.275 ± 0.251
1.377GlnIle: 1.377 ± 0.538
1.928GlnLys: 1.928 ± 0.755
3.029GlnLeu: 3.029 ± 0.801
1.652GlnMet: 1.652 ± 0.735
0.275GlnAsn: 0.275 ± 0.289
0.826GlnPro: 0.826 ± 0.427
1.377GlnGln: 1.377 ± 1.146
1.102GlnArg: 1.102 ± 0.69
1.377GlnSer: 1.377 ± 0.75
1.928GlnThr: 1.928 ± 0.802
1.652GlnVal: 1.652 ± 0.692
0.275GlnTrp: 0.275 ± 0.252
1.652GlnTyr: 1.652 ± 0.496
0.0GlnXaa: 0.0 ± 0.0
Arg
1.102ArgAla: 1.102 ± 0.516
0.551ArgCys: 0.551 ± 0.302
1.652ArgAsp: 1.652 ± 0.604
2.479ArgGlu: 2.479 ± 0.731
1.928ArgPhe: 1.928 ± 0.591
1.652ArgGly: 1.652 ± 0.698
0.275ArgHis: 0.275 ± 0.218
2.479ArgIle: 2.479 ± 1.045
3.029ArgLys: 3.029 ± 0.759
4.131ArgLeu: 4.131 ± 1.225
0.275ArgMet: 0.275 ± 0.251
1.652ArgAsn: 1.652 ± 0.72
0.275ArgPro: 0.275 ± 0.218
1.652ArgGln: 1.652 ± 0.72
1.928ArgArg: 1.928 ± 0.839
1.102ArgSer: 1.102 ± 0.602
1.377ArgThr: 1.377 ± 0.418
2.479ArgVal: 2.479 ± 0.74
0.275ArgTrp: 0.275 ± 0.238
0.826ArgTyr: 0.826 ± 0.497
0.0ArgXaa: 0.0 ± 0.0
Ser
3.856SerAla: 3.856 ± 1.007
0.275SerCys: 0.275 ± 0.341
5.784SerAsp: 5.784 ± 1.623
4.682SerGlu: 4.682 ± 1.363
1.377SerPhe: 1.377 ± 0.661
5.508SerGly: 5.508 ± 2.09
1.377SerHis: 1.377 ± 0.548
4.957SerIle: 4.957 ± 0.948
5.784SerLys: 5.784 ± 1.468
4.406SerLeu: 4.406 ± 1.079
2.754SerMet: 2.754 ± 0.909
2.754SerAsn: 2.754 ± 0.502
2.203SerPro: 2.203 ± 0.498
1.652SerGln: 1.652 ± 0.483
1.102SerArg: 1.102 ± 0.431
4.682SerSer: 4.682 ± 0.96
3.58SerThr: 3.58 ± 0.941
6.059SerVal: 6.059 ± 1.39
0.826SerTrp: 0.826 ± 0.543
1.652SerTyr: 1.652 ± 0.867
0.0SerXaa: 0.0 ± 0.0
Thr
7.711ThrAla: 7.711 ± 2.026
0.0ThrCys: 0.0 ± 0.0
4.957ThrAsp: 4.957 ± 0.904
3.029ThrGlu: 3.029 ± 0.929
1.377ThrPhe: 1.377 ± 0.293
5.784ThrGly: 5.784 ± 1.128
0.826ThrHis: 0.826 ± 0.653
3.856ThrIle: 3.856 ± 1.074
4.406ThrLys: 4.406 ± 1.108
7.711ThrLeu: 7.711 ± 1.681
1.928ThrMet: 1.928 ± 1.004
2.479ThrAsn: 2.479 ± 0.363
2.754ThrPro: 2.754 ± 0.832
1.928ThrGln: 1.928 ± 0.788
1.102ThrArg: 1.102 ± 0.572
4.682ThrSer: 4.682 ± 0.974
4.957ThrThr: 4.957 ± 1.17
6.885ThrVal: 6.885 ± 1.119
0.551ThrTrp: 0.551 ± 0.32
3.58ThrTyr: 3.58 ± 0.972
0.0ThrXaa: 0.0 ± 0.0
Val
4.957ValAla: 4.957 ± 1.613
0.551ValCys: 0.551 ± 0.428
4.957ValAsp: 4.957 ± 1.08
5.233ValGlu: 5.233 ± 1.026
3.305ValPhe: 3.305 ± 0.724
4.131ValGly: 4.131 ± 0.831
1.377ValHis: 1.377 ± 0.683
5.508ValIle: 5.508 ± 1.926
6.059ValLys: 6.059 ± 1.367
4.406ValLeu: 4.406 ± 1.156
2.203ValMet: 2.203 ± 0.664
4.406ValAsn: 4.406 ± 0.99
2.203ValPro: 2.203 ± 0.982
1.102ValGln: 1.102 ± 0.667
2.203ValArg: 2.203 ± 0.669
4.131ValSer: 4.131 ± 1.377
5.233ValThr: 5.233 ± 1.579
5.508ValVal: 5.508 ± 1.877
0.826ValTrp: 0.826 ± 0.47
4.131ValTyr: 4.131 ± 0.841
0.0ValXaa: 0.0 ± 0.0
Trp
0.275TrpAla: 0.275 ± 0.284
0.275TrpCys: 0.275 ± 0.323
0.826TrpAsp: 0.826 ± 0.433
1.377TrpGlu: 1.377 ± 0.641
0.826TrpPhe: 0.826 ± 0.597
0.826TrpGly: 0.826 ± 0.47
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.551TrpLys: 0.551 ± 0.323
0.551TrpLeu: 0.551 ± 0.337
0.275TrpMet: 0.275 ± 0.218
0.826TrpAsn: 0.826 ± 0.52
0.551TrpPro: 0.551 ± 0.555
1.102TrpGln: 1.102 ± 0.569
0.0TrpArg: 0.0 ± 0.0
0.275TrpSer: 0.275 ± 0.218
0.0TrpThr: 0.0 ± 0.0
0.551TrpVal: 0.551 ± 0.28
0.0TrpTrp: 0.0 ± 0.0
0.551TrpTyr: 0.551 ± 0.337
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.305TyrAla: 3.305 ± 1.007
1.377TyrCys: 1.377 ± 0.624
1.377TyrAsp: 1.377 ± 0.484
2.203TyrGlu: 2.203 ± 1.529
3.029TyrPhe: 3.029 ± 1.134
3.029TyrGly: 3.029 ± 0.701
0.551TyrHis: 0.551 ± 0.435
4.131TyrIle: 4.131 ± 1.189
3.029TyrLys: 3.029 ± 0.649
3.305TyrLeu: 3.305 ± 0.891
1.377TyrMet: 1.377 ± 0.529
4.682TyrAsn: 4.682 ± 1.221
1.102TyrPro: 1.102 ± 0.649
2.203TyrGln: 2.203 ± 0.933
1.377TyrArg: 1.377 ± 0.564
2.479TyrSer: 2.479 ± 1.014
3.029TyrThr: 3.029 ± 1.451
3.58TyrVal: 3.58 ± 1.11
0.551TyrTrp: 0.551 ± 0.435
1.928TyrTyr: 1.928 ± 0.861
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (3632 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski