Amino acid dipepetide frequency for Streptococcus phage SFi18

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.143AlaAla: 2.143 ± 0.673
0.238AlaCys: 0.238 ± 0.23
3.572AlaAsp: 3.572 ± 0.814
2.382AlaGlu: 2.382 ± 0.924
1.191AlaPhe: 1.191 ± 0.519
2.858AlaGly: 2.858 ± 1.098
1.191AlaHis: 1.191 ± 0.507
5.239AlaIle: 5.239 ± 0.878
6.668AlaLys: 6.668 ± 1.344
4.287AlaLeu: 4.287 ± 0.874
2.62AlaMet: 2.62 ± 0.943
1.667AlaAsn: 1.667 ± 0.857
2.382AlaPro: 2.382 ± 0.618
0.714AlaGln: 0.714 ± 0.415
4.287AlaArg: 4.287 ± 2.141
2.382AlaSer: 2.382 ± 0.549
2.858AlaThr: 2.858 ± 0.872
2.143AlaVal: 2.143 ± 0.726
2.382AlaTrp: 2.382 ± 1.493
3.096AlaTyr: 3.096 ± 0.852
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.238CysCys: 0.238 ± 0.222
1.667CysAsp: 1.667 ± 0.881
0.476CysGlu: 0.476 ± 0.291
0.238CysPhe: 0.238 ± 0.241
1.191CysGly: 1.191 ± 0.782
0.714CysHis: 0.714 ± 0.396
0.476CysIle: 0.476 ± 0.291
0.953CysLys: 0.953 ± 0.405
0.953CysLeu: 0.953 ± 0.551
0.0CysMet: 0.0 ± 0.0
0.714CysAsn: 0.714 ± 0.367
0.238CysPro: 0.238 ± 0.2
0.238CysGln: 0.238 ± 0.2
0.238CysArg: 0.238 ± 0.209
0.476CysSer: 0.476 ± 0.297
0.238CysThr: 0.238 ± 0.199
0.476CysVal: 0.476 ± 0.323
0.238CysTrp: 0.238 ± 0.24
0.476CysTyr: 0.476 ± 0.337
0.0CysXaa: 0.0 ± 0.0
Asp
2.62AspAla: 2.62 ± 1.218
1.191AspCys: 1.191 ± 0.489
4.763AspAsp: 4.763 ± 0.825
4.287AspGlu: 4.287 ± 0.872
3.81AspPhe: 3.81 ± 1.112
5.001AspGly: 5.001 ± 1.044
0.714AspHis: 0.714 ± 0.367
5.001AspIle: 5.001 ± 1.209
6.668AspLys: 6.668 ± 1.673
3.572AspLeu: 3.572 ± 0.816
1.905AspMet: 1.905 ± 0.643
5.477AspAsn: 5.477 ± 1.244
1.191AspPro: 1.191 ± 0.54
1.191AspGln: 1.191 ± 0.486
2.62AspArg: 2.62 ± 0.776
2.382AspSer: 2.382 ± 0.614
2.858AspThr: 2.858 ± 0.802
4.049AspVal: 4.049 ± 1.07
1.429AspTrp: 1.429 ± 0.545
4.525AspTyr: 4.525 ± 1.303
0.0AspXaa: 0.0 ± 0.0
Glu
5.239GluAla: 5.239 ± 1.028
0.476GluCys: 0.476 ± 0.303
4.049GluAsp: 4.049 ± 1.042
5.239GluGlu: 5.239 ± 1.225
4.049GluPhe: 4.049 ± 0.783
4.049GluGly: 4.049 ± 0.759
1.667GluHis: 1.667 ± 0.632
5.239GluIle: 5.239 ± 1.31
6.192GluLys: 6.192 ± 1.073
7.145GluLeu: 7.145 ± 1.535
2.143GluMet: 2.143 ± 0.721
5.239GluAsn: 5.239 ± 1.262
2.858GluPro: 2.858 ± 1.022
2.382GluGln: 2.382 ± 0.584
2.382GluArg: 2.382 ± 0.656
2.62GluSer: 2.62 ± 0.538
2.858GluThr: 2.858 ± 0.699
7.621GluVal: 7.621 ± 1.569
2.858GluTrp: 2.858 ± 0.736
5.239GluTyr: 5.239 ± 1.228
0.0GluXaa: 0.0 ± 0.0
Phe
1.905PheAla: 1.905 ± 0.601
0.238PheCys: 0.238 ± 0.258
3.334PheAsp: 3.334 ± 0.731
3.572PheGlu: 3.572 ± 0.845
1.667PhePhe: 1.667 ± 0.529
3.334PheGly: 3.334 ± 0.761
0.238PheHis: 0.238 ± 0.227
2.62PheIle: 2.62 ± 0.931
5.954PheLys: 5.954 ± 1.064
3.096PheLeu: 3.096 ± 0.949
1.191PheMet: 1.191 ± 0.543
2.858PheAsn: 2.858 ± 0.999
0.0PhePro: 0.0 ± 0.0
2.143PheGln: 2.143 ± 0.652
1.429PheArg: 1.429 ± 0.601
2.382PheSer: 2.382 ± 0.811
1.667PheThr: 1.667 ± 0.704
1.191PheVal: 1.191 ± 0.532
0.714PheTrp: 0.714 ± 0.342
1.429PheTyr: 1.429 ± 0.615
0.0PheXaa: 0.0 ± 0.0
Gly
3.81GlyAla: 3.81 ± 1.265
0.476GlyCys: 0.476 ± 0.327
4.525GlyAsp: 4.525 ± 0.981
3.096GlyGlu: 3.096 ± 0.644
2.143GlyPhe: 2.143 ± 0.852
3.81GlyGly: 3.81 ± 1.341
0.714GlyHis: 0.714 ± 0.367
4.287GlyIle: 4.287 ± 1.103
6.906GlyLys: 6.906 ± 1.587
5.239GlyLeu: 5.239 ± 1.204
0.953GlyMet: 0.953 ± 0.412
5.477GlyAsn: 5.477 ± 2.641
0.238GlyPro: 0.238 ± 0.24
1.667GlyGln: 1.667 ± 0.879
3.096GlyArg: 3.096 ± 1.107
3.096GlySer: 3.096 ± 0.829
3.81GlyThr: 3.81 ± 0.895
2.382GlyVal: 2.382 ± 0.77
0.953GlyTrp: 0.953 ± 0.411
3.334GlyTyr: 3.334 ± 0.815
0.0GlyXaa: 0.0 ± 0.0
His
0.476HisAla: 0.476 ± 0.332
0.0HisCys: 0.0 ± 0.0
0.953HisAsp: 0.953 ± 0.382
1.191HisGlu: 1.191 ± 0.424
0.238HisPhe: 0.238 ± 0.246
1.429HisGly: 1.429 ± 0.606
0.714HisHis: 0.714 ± 0.413
0.714HisIle: 0.714 ± 0.382
1.429HisLys: 1.429 ± 0.554
0.953HisLeu: 0.953 ± 0.483
0.238HisMet: 0.238 ± 0.239
0.0HisAsn: 0.0 ± 0.0
0.714HisPro: 0.714 ± 0.36
0.476HisGln: 0.476 ± 0.305
1.191HisArg: 1.191 ± 0.409
1.429HisSer: 1.429 ± 0.39
0.953HisThr: 0.953 ± 0.5
1.191HisVal: 1.191 ± 0.459
0.238HisTrp: 0.238 ± 0.233
1.191HisTyr: 1.191 ± 0.543
0.0HisXaa: 0.0 ± 0.0
Ile
4.525IleAla: 4.525 ± 1.344
0.238IleCys: 0.238 ± 0.2
5.477IleAsp: 5.477 ± 1.24
6.668IleGlu: 6.668 ± 1.21
1.191IlePhe: 1.191 ± 0.605
3.096IleGly: 3.096 ± 0.812
0.476IleHis: 0.476 ± 0.324
4.049IleIle: 4.049 ± 1.043
5.954IleLys: 5.954 ± 0.944
4.763IleLeu: 4.763 ± 1.037
1.667IleMet: 1.667 ± 0.739
6.906IleAsn: 6.906 ± 1.187
3.096IlePro: 3.096 ± 0.828
2.382IleGln: 2.382 ± 0.573
3.096IleArg: 3.096 ± 0.788
4.763IleSer: 4.763 ± 0.73
1.905IleThr: 1.905 ± 0.621
4.049IleVal: 4.049 ± 0.995
0.714IleTrp: 0.714 ± 0.327
3.572IleTyr: 3.572 ± 1.015
0.0IleXaa: 0.0 ± 0.0
Lys
6.192LysAla: 6.192 ± 1.205
0.714LysCys: 0.714 ± 0.295
7.145LysAsp: 7.145 ± 1.353
9.764LysGlu: 9.764 ± 1.421
3.572LysPhe: 3.572 ± 1.193
4.763LysGly: 4.763 ± 0.944
0.953LysHis: 0.953 ± 0.405
5.477LysIle: 5.477 ± 1.644
9.05LysLys: 9.05 ± 1.531
5.954LysLeu: 5.954 ± 1.112
2.62LysMet: 2.62 ± 0.854
6.192LysAsn: 6.192 ± 1.321
4.287LysPro: 4.287 ± 0.851
3.334LysGln: 3.334 ± 0.927
2.858LysArg: 2.858 ± 0.804
3.334LysSer: 3.334 ± 1.062
5.239LysThr: 5.239 ± 0.891
6.192LysVal: 6.192 ± 1.239
1.191LysTrp: 1.191 ± 0.462
4.525LysTyr: 4.525 ± 1.118
0.0LysXaa: 0.0 ± 0.0
Leu
4.763LeuAla: 4.763 ± 0.941
0.953LeuCys: 0.953 ± 0.434
5.477LeuAsp: 5.477 ± 0.798
6.906LeuGlu: 6.906 ± 2.026
4.525LeuPhe: 4.525 ± 0.875
4.287LeuGly: 4.287 ± 0.854
1.191LeuHis: 1.191 ± 0.607
4.525LeuIle: 4.525 ± 1.056
5.716LeuLys: 5.716 ± 1.166
4.763LeuLeu: 4.763 ± 0.878
1.429LeuMet: 1.429 ± 0.375
4.525LeuAsn: 4.525 ± 1.065
2.858LeuPro: 2.858 ± 0.699
2.62LeuGln: 2.62 ± 0.747
4.049LeuArg: 4.049 ± 1.061
5.239LeuSer: 5.239 ± 0.921
4.525LeuThr: 4.525 ± 1.454
5.239LeuVal: 5.239 ± 0.976
0.953LeuTrp: 0.953 ± 0.61
1.905LeuTyr: 1.905 ± 0.67
0.0LeuXaa: 0.0 ± 0.0
Met
2.143MetAla: 2.143 ± 0.706
0.476MetCys: 0.476 ± 0.291
0.953MetAsp: 0.953 ± 0.511
2.858MetGlu: 2.858 ± 1.008
0.714MetPhe: 0.714 ± 0.395
0.953MetGly: 0.953 ± 0.522
0.714MetHis: 0.714 ± 0.384
1.667MetIle: 1.667 ± 0.573
3.572MetLys: 3.572 ± 0.945
0.953MetLeu: 0.953 ± 0.548
0.476MetMet: 0.476 ± 0.36
1.905MetAsn: 1.905 ± 0.568
0.714MetPro: 0.714 ± 0.327
0.714MetGln: 0.714 ± 0.353
1.667MetArg: 1.667 ± 0.528
1.429MetSer: 1.429 ± 0.663
1.905MetThr: 1.905 ± 0.556
2.143MetVal: 2.143 ± 0.583
0.0MetTrp: 0.0 ± 0.0
1.191MetTyr: 1.191 ± 0.518
0.0MetXaa: 0.0 ± 0.0
Asn
6.43AsnAla: 6.43 ± 2.816
0.714AsnCys: 0.714 ± 0.357
4.049AsnAsp: 4.049 ± 0.818
4.525AsnGlu: 4.525 ± 0.938
1.429AsnPhe: 1.429 ± 0.885
4.049AsnGly: 4.049 ± 1.517
1.667AsnHis: 1.667 ± 0.6
3.81AsnIle: 3.81 ± 0.656
5.239AsnLys: 5.239 ± 1.03
5.001AsnLeu: 5.001 ± 0.916
1.667AsnMet: 1.667 ± 0.55
2.858AsnAsn: 2.858 ± 1.198
1.429AsnPro: 1.429 ± 0.609
2.62AsnGln: 2.62 ± 0.657
2.62AsnArg: 2.62 ± 0.716
2.143AsnSer: 2.143 ± 0.541
3.334AsnThr: 3.334 ± 0.702
3.572AsnVal: 3.572 ± 0.723
2.382AsnTrp: 2.382 ± 0.588
3.81AsnTyr: 3.81 ± 0.984
0.0AsnXaa: 0.0 ± 0.0
Pro
0.714ProAla: 0.714 ± 0.359
0.238ProCys: 0.238 ± 0.252
1.905ProAsp: 1.905 ± 0.517
2.858ProGlu: 2.858 ± 0.738
1.191ProPhe: 1.191 ± 0.423
1.191ProGly: 1.191 ± 0.379
0.0ProHis: 0.0 ± 0.0
1.429ProIle: 1.429 ± 0.619
2.382ProLys: 2.382 ± 0.822
1.905ProLeu: 1.905 ± 0.942
0.476ProMet: 0.476 ± 0.301
1.667ProAsn: 1.667 ± 0.576
1.429ProPro: 1.429 ± 0.805
1.191ProGln: 1.191 ± 0.754
0.953ProArg: 0.953 ± 0.475
2.62ProSer: 2.62 ± 0.651
1.905ProThr: 1.905 ± 0.739
3.096ProVal: 3.096 ± 1.068
0.238ProTrp: 0.238 ± 0.24
0.953ProTyr: 0.953 ± 0.464
0.0ProXaa: 0.0 ± 0.0
Gln
3.334GlnAla: 3.334 ± 0.655
0.714GlnCys: 0.714 ± 0.354
1.429GlnAsp: 1.429 ± 0.438
3.334GlnGlu: 3.334 ± 0.823
2.143GlnPhe: 2.143 ± 0.662
1.905GlnGly: 1.905 ± 0.661
0.476GlnHis: 0.476 ± 0.296
2.143GlnIle: 2.143 ± 0.818
2.382GlnLys: 2.382 ± 0.535
3.334GlnLeu: 3.334 ± 0.784
0.714GlnMet: 0.714 ± 0.392
1.191GlnAsn: 1.191 ± 0.472
0.0GlnPro: 0.0 ± 0.0
1.667GlnGln: 1.667 ± 0.484
1.191GlnArg: 1.191 ± 0.552
2.143GlnSer: 2.143 ± 0.809
1.429GlnThr: 1.429 ± 0.491
0.953GlnVal: 0.953 ± 0.409
0.476GlnTrp: 0.476 ± 0.318
1.191GlnTyr: 1.191 ± 0.479
0.0GlnXaa: 0.0 ± 0.0
Arg
1.191ArgAla: 1.191 ± 0.597
0.0ArgCys: 0.0 ± 0.0
2.143ArgAsp: 2.143 ± 0.607
4.049ArgGlu: 4.049 ± 0.938
1.905ArgPhe: 1.905 ± 0.687
1.905ArgGly: 1.905 ± 0.525
0.714ArgHis: 0.714 ± 0.455
4.525ArgIle: 4.525 ± 0.787
3.572ArgLys: 3.572 ± 0.831
3.81ArgLeu: 3.81 ± 0.919
2.143ArgMet: 2.143 ± 0.58
3.334ArgAsn: 3.334 ± 0.92
0.714ArgPro: 0.714 ± 0.411
0.953ArgGln: 0.953 ± 0.382
2.382ArgArg: 2.382 ± 0.675
2.382ArgSer: 2.382 ± 0.622
2.382ArgThr: 2.382 ± 0.93
4.763ArgVal: 4.763 ± 2.443
0.476ArgTrp: 0.476 ± 0.336
1.667ArgTyr: 1.667 ± 0.533
0.0ArgXaa: 0.0 ± 0.0
Ser
0.953SerAla: 0.953 ± 0.456
0.953SerCys: 0.953 ± 0.448
2.62SerAsp: 2.62 ± 0.733
3.334SerGlu: 3.334 ± 0.844
2.143SerPhe: 2.143 ± 0.682
3.572SerGly: 3.572 ± 0.77
1.191SerHis: 1.191 ± 0.495
4.525SerIle: 4.525 ± 0.883
4.763SerLys: 4.763 ± 1.162
3.81SerLeu: 3.81 ± 0.826
1.905SerMet: 1.905 ± 0.696
3.096SerAsn: 3.096 ± 0.995
1.191SerPro: 1.191 ± 0.341
1.667SerGln: 1.667 ± 0.536
1.905SerArg: 1.905 ± 0.591
3.096SerSer: 3.096 ± 0.883
2.858SerThr: 2.858 ± 0.838
4.049SerVal: 4.049 ± 0.965
1.191SerTrp: 1.191 ± 0.53
1.667SerTyr: 1.667 ± 0.672
0.0SerXaa: 0.0 ± 0.0
Thr
1.905ThrAla: 1.905 ± 0.716
0.476ThrCys: 0.476 ± 0.337
2.143ThrAsp: 2.143 ± 0.528
2.62ThrGlu: 2.62 ± 0.729
3.096ThrPhe: 3.096 ± 0.862
2.62ThrGly: 2.62 ± 0.798
1.429ThrHis: 1.429 ± 0.421
3.81ThrIle: 3.81 ± 0.952
5.954ThrLys: 5.954 ± 0.924
5.001ThrLeu: 5.001 ± 1.49
0.953ThrMet: 0.953 ± 0.432
2.858ThrAsn: 2.858 ± 0.491
2.143ThrPro: 2.143 ± 0.731
2.62ThrGln: 2.62 ± 0.615
2.382ThrArg: 2.382 ± 0.78
3.096ThrSer: 3.096 ± 0.86
2.382ThrThr: 2.382 ± 0.731
2.858ThrVal: 2.858 ± 0.593
0.238ThrTrp: 0.238 ± 0.252
2.858ThrTyr: 2.858 ± 0.621
0.0ThrXaa: 0.0 ± 0.0
Val
2.382ValAla: 2.382 ± 0.65
1.667ValCys: 1.667 ± 1.439
5.477ValAsp: 5.477 ± 1.294
5.239ValGlu: 5.239 ± 1.348
1.667ValPhe: 1.667 ± 0.538
4.049ValGly: 4.049 ± 0.924
0.476ValHis: 0.476 ± 0.352
4.525ValIle: 4.525 ± 1.028
6.43ValLys: 6.43 ± 1.216
5.001ValLeu: 5.001 ± 1.28
1.667ValMet: 1.667 ± 0.715
5.001ValAsn: 5.001 ± 1.509
1.667ValPro: 1.667 ± 0.592
0.953ValGln: 0.953 ± 0.398
1.905ValArg: 1.905 ± 0.531
1.905ValSer: 1.905 ± 0.551
5.239ValThr: 5.239 ± 1.192
3.572ValVal: 3.572 ± 0.885
1.191ValTrp: 1.191 ± 0.596
5.477ValTyr: 5.477 ± 2.58
0.0ValXaa: 0.0 ± 0.0
Trp
0.953TrpAla: 0.953 ± 0.46
0.238TrpCys: 0.238 ± 0.209
1.429TrpAsp: 1.429 ± 0.712
1.191TrpGlu: 1.191 ± 0.432
1.667TrpPhe: 1.667 ± 0.641
0.238TrpGly: 0.238 ± 0.24
0.238TrpHis: 0.238 ± 0.209
0.953TrpIle: 0.953 ± 0.414
0.476TrpLys: 0.476 ± 0.342
1.905TrpLeu: 1.905 ± 0.573
0.476TrpMet: 0.476 ± 0.326
0.953TrpAsn: 0.953 ± 0.774
0.238TrpPro: 0.238 ± 0.199
0.953TrpGln: 0.953 ± 0.406
1.667TrpArg: 1.667 ± 0.538
0.714TrpSer: 0.714 ± 0.442
0.714TrpThr: 0.714 ± 0.408
2.858TrpVal: 2.858 ± 1.45
0.0TrpTrp: 0.0 ± 0.0
0.953TrpTyr: 0.953 ± 0.682
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.62TyrAla: 2.62 ± 0.754
0.0TyrCys: 0.0 ± 0.0
2.143TyrAsp: 2.143 ± 0.663
5.954TyrGlu: 5.954 ± 1.22
2.382TyrPhe: 2.382 ± 0.641
5.716TyrGly: 5.716 ± 3.059
0.238TyrHis: 0.238 ± 0.231
3.572TyrIle: 3.572 ± 0.821
3.096TyrLys: 3.096 ± 0.828
5.001TyrLeu: 5.001 ± 0.837
1.667TyrMet: 1.667 ± 0.56
1.429TyrAsn: 1.429 ± 0.571
0.953TyrPro: 0.953 ± 0.435
1.905TyrGln: 1.905 ± 0.489
3.096TyrArg: 3.096 ± 1.435
2.858TyrSer: 2.858 ± 0.814
2.382TyrThr: 2.382 ± 0.559
3.096TyrVal: 3.096 ± 0.703
0.953TyrTrp: 0.953 ± 0.436
4.287TyrTyr: 4.287 ± 1.251
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 26 proteins (4200 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski