Amino acid dipepetide frequency for Streptococcus satellite phage Javan365

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.326AlaAla: 0.326 ± 0.352
0.652AlaCys: 0.652 ± 0.396
5.219AlaAsp: 5.219 ± 1.743
5.545AlaGlu: 5.545 ± 1.743
4.566AlaPhe: 4.566 ± 1.361
3.588AlaGly: 3.588 ± 1.024
1.305AlaHis: 1.305 ± 0.647
2.935AlaIle: 2.935 ± 0.851
3.262AlaLys: 3.262 ± 0.652
5.871AlaLeu: 5.871 ± 0.95
2.935AlaMet: 2.935 ± 1.213
3.262AlaAsn: 3.262 ± 1.007
1.957AlaPro: 1.957 ± 0.623
0.652AlaGln: 0.652 ± 0.35
1.957AlaArg: 1.957 ± 0.875
4.892AlaSer: 4.892 ± 1.141
3.262AlaThr: 3.262 ± 1.032
2.935AlaVal: 2.935 ± 0.85
0.326AlaTrp: 0.326 ± 0.33
1.305AlaTyr: 1.305 ± 0.515
0.0AlaXaa: 0.0 ± 0.0
Cys
0.652CysAla: 0.652 ± 0.602
0.0CysCys: 0.0 ± 0.0
0.652CysAsp: 0.652 ± 0.367
0.326CysGlu: 0.326 ± 0.364
0.978CysPhe: 0.978 ± 0.989
0.652CysGly: 0.652 ± 0.654
0.0CysHis: 0.0 ± 0.0
0.652CysIle: 0.652 ± 0.411
0.326CysLys: 0.326 ± 0.272
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.978CysAsn: 0.978 ± 0.555
0.326CysPro: 0.326 ± 0.327
1.305CysGln: 1.305 ± 0.756
0.978CysArg: 0.978 ± 0.47
0.326CysSer: 0.326 ± 0.379
0.652CysThr: 0.652 ± 0.373
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.978AspAla: 0.978 ± 0.846
0.978AspCys: 0.978 ± 0.525
2.609AspAsp: 2.609 ± 0.581
2.935AspGlu: 2.935 ± 0.843
3.262AspPhe: 3.262 ± 0.839
3.588AspGly: 3.588 ± 1.2
0.326AspHis: 0.326 ± 0.291
5.219AspIle: 5.219 ± 1.169
5.871AspLys: 5.871 ± 1.456
7.502AspLeu: 7.502 ± 1.609
3.588AspMet: 3.588 ± 0.915
1.305AspAsn: 1.305 ± 0.5
0.652AspPro: 0.652 ± 0.381
1.305AspGln: 1.305 ± 0.52
2.283AspArg: 2.283 ± 0.689
4.24AspSer: 4.24 ± 1.249
3.914AspThr: 3.914 ± 1.042
2.609AspVal: 2.609 ± 0.929
1.305AspTrp: 1.305 ± 0.599
4.566AspTyr: 4.566 ± 1.132
0.0AspXaa: 0.0 ± 0.0
Glu
5.545GluAla: 5.545 ± 1.116
0.652GluCys: 0.652 ± 0.397
6.849GluAsp: 6.849 ± 1.682
4.892GluGlu: 4.892 ± 1.245
2.609GluPhe: 2.609 ± 0.837
3.588GluGly: 3.588 ± 0.968
1.305GluHis: 1.305 ± 0.474
4.892GluIle: 4.892 ± 1.478
9.132GluLys: 9.132 ± 1.586
8.806GluLeu: 8.806 ± 1.051
1.957GluMet: 1.957 ± 0.936
4.892GluAsn: 4.892 ± 0.967
2.609GluPro: 2.609 ± 0.843
2.935GluGln: 2.935 ± 1.092
5.545GluArg: 5.545 ± 1.598
4.566GluSer: 4.566 ± 1.074
2.935GluThr: 2.935 ± 1.026
2.935GluVal: 2.935 ± 0.845
1.631GluTrp: 1.631 ± 0.598
3.262GluTyr: 3.262 ± 1.121
0.0GluXaa: 0.0 ± 0.0
Phe
0.978PheAla: 0.978 ± 0.503
0.326PheCys: 0.326 ± 0.272
2.935PheAsp: 2.935 ± 0.83
4.892PheGlu: 4.892 ± 1.147
1.631PhePhe: 1.631 ± 0.7
3.914PheGly: 3.914 ± 1.146
0.326PheHis: 0.326 ± 0.292
3.262PheIle: 3.262 ± 0.949
4.892PheLys: 4.892 ± 1.614
2.935PheLeu: 2.935 ± 0.909
0.326PheMet: 0.326 ± 0.348
2.935PheAsn: 2.935 ± 1.146
0.978PhePro: 0.978 ± 0.635
2.283PheGln: 2.283 ± 0.741
1.631PheArg: 1.631 ± 0.692
2.283PheSer: 2.283 ± 0.851
2.609PheThr: 2.609 ± 0.912
1.631PheVal: 1.631 ± 0.913
0.652PheTrp: 0.652 ± 0.37
2.283PheTyr: 2.283 ± 0.862
0.0PheXaa: 0.0 ± 0.0
Gly
3.914GlyAla: 3.914 ± 1.89
0.978GlyCys: 0.978 ± 0.632
4.566GlyAsp: 4.566 ± 1.099
5.545GlyGlu: 5.545 ± 1.41
0.978GlyPhe: 0.978 ± 0.645
2.283GlyGly: 2.283 ± 0.87
0.978GlyHis: 0.978 ± 0.5
4.892GlyIle: 4.892 ± 1.17
4.24GlyLys: 4.24 ± 1.125
5.871GlyLeu: 5.871 ± 1.319
1.631GlyMet: 1.631 ± 0.698
2.283GlyAsn: 2.283 ± 0.951
0.0GlyPro: 0.0 ± 0.0
0.326GlyGln: 0.326 ± 0.294
2.609GlyArg: 2.609 ± 0.897
2.935GlySer: 2.935 ± 1.259
4.892GlyThr: 4.892 ± 1.393
2.935GlyVal: 2.935 ± 1.341
1.305GlyTrp: 1.305 ± 0.728
2.283GlyTyr: 2.283 ± 0.893
0.0GlyXaa: 0.0 ± 0.0
His
3.262HisAla: 3.262 ± 1.036
0.0HisCys: 0.0 ± 0.0
0.978HisAsp: 0.978 ± 0.714
0.978HisGlu: 0.978 ± 0.6
0.652HisPhe: 0.652 ± 0.37
2.609HisGly: 2.609 ± 1.159
0.652HisHis: 0.652 ± 0.397
0.326HisIle: 0.326 ± 0.354
1.305HisLys: 1.305 ± 0.589
1.957HisLeu: 1.957 ± 0.573
0.326HisMet: 0.326 ± 0.292
0.326HisAsn: 0.326 ± 0.272
0.652HisPro: 0.652 ± 0.432
0.326HisGln: 0.326 ± 0.304
0.0HisArg: 0.0 ± 0.0
1.305HisSer: 1.305 ± 0.566
1.631HisThr: 1.631 ± 0.55
0.326HisVal: 0.326 ± 0.43
0.0HisTrp: 0.0 ± 0.0
1.305HisTyr: 1.305 ± 0.469
0.0HisXaa: 0.0 ± 0.0
Ile
4.24IleAla: 4.24 ± 1.028
0.652IleCys: 0.652 ± 0.416
5.545IleAsp: 5.545 ± 1.56
5.219IleGlu: 5.219 ± 1.6
2.609IlePhe: 2.609 ± 0.838
3.262IleGly: 3.262 ± 0.741
0.978IleHis: 0.978 ± 0.591
4.892IleIle: 4.892 ± 1.085
5.871IleLys: 5.871 ± 1.127
2.609IleLeu: 2.609 ± 0.856
0.652IleMet: 0.652 ± 0.396
4.24IleAsn: 4.24 ± 0.986
3.588IlePro: 3.588 ± 0.967
2.609IleGln: 2.609 ± 0.881
2.609IleArg: 2.609 ± 1.044
7.175IleSer: 7.175 ± 1.197
2.935IleThr: 2.935 ± 1.149
2.935IleVal: 2.935 ± 1.049
0.326IleTrp: 0.326 ± 0.354
2.609IleTyr: 2.609 ± 0.723
0.0IleXaa: 0.0 ± 0.0
Lys
5.219LysAla: 5.219 ± 1.667
0.652LysCys: 0.652 ± 0.437
2.935LysAsp: 2.935 ± 0.862
8.806LysGlu: 8.806 ± 1.245
1.305LysPhe: 1.305 ± 0.433
5.871LysGly: 5.871 ± 1.936
3.588LysHis: 3.588 ± 0.729
5.219LysIle: 5.219 ± 1.455
7.502LysLys: 7.502 ± 1.686
8.806LysLeu: 8.806 ± 1.27
1.305LysMet: 1.305 ± 0.658
5.545LysAsn: 5.545 ± 0.994
6.197LysPro: 6.197 ± 1.66
4.566LysGln: 4.566 ± 1.141
6.197LysArg: 6.197 ± 1.596
3.262LysSer: 3.262 ± 1.078
4.892LysThr: 4.892 ± 0.789
4.892LysVal: 4.892 ± 0.856
1.305LysTrp: 1.305 ± 0.605
2.935LysTyr: 2.935 ± 0.929
0.0LysXaa: 0.0 ± 0.0
Leu
5.545LeuAla: 5.545 ± 1.571
0.0LeuCys: 0.0 ± 0.0
7.502LeuAsp: 7.502 ± 1.253
11.089LeuGlu: 11.089 ± 2.984
4.566LeuPhe: 4.566 ± 1.319
4.24LeuGly: 4.24 ± 1.994
0.978LeuHis: 0.978 ± 0.463
4.566LeuIle: 4.566 ± 1.232
6.849LeuLys: 6.849 ± 1.145
11.742LeuLeu: 11.742 ± 1.989
2.609LeuMet: 2.609 ± 0.809
5.219LeuAsn: 5.219 ± 1.113
2.935LeuPro: 2.935 ± 1.129
4.24LeuGln: 4.24 ± 1.12
2.609LeuArg: 2.609 ± 0.697
6.197LeuSer: 6.197 ± 1.382
5.871LeuThr: 5.871 ± 1.236
6.523LeuVal: 6.523 ± 1.105
0.652LeuTrp: 0.652 ± 0.373
3.914LeuTyr: 3.914 ± 1.401
0.0LeuXaa: 0.0 ± 0.0
Met
1.957MetAla: 1.957 ± 1.249
0.0MetCys: 0.0 ± 0.0
1.957MetAsp: 1.957 ± 0.596
2.283MetGlu: 2.283 ± 0.764
0.652MetPhe: 0.652 ± 0.511
0.652MetGly: 0.652 ± 0.411
0.0MetHis: 0.0 ± 0.0
1.305MetIle: 1.305 ± 0.547
1.957MetLys: 1.957 ± 0.823
1.631MetLeu: 1.631 ± 0.827
0.0MetMet: 0.0 ± 0.0
2.283MetAsn: 2.283 ± 0.791
0.652MetPro: 0.652 ± 0.505
0.978MetGln: 0.978 ± 0.463
1.305MetArg: 1.305 ± 0.69
1.631MetSer: 1.631 ± 0.556
3.914MetThr: 3.914 ± 1.118
2.935MetVal: 2.935 ± 0.998
0.0MetTrp: 0.0 ± 0.0
0.978MetTyr: 0.978 ± 0.656
0.0MetXaa: 0.0 ± 0.0
Asn
1.957AsnAla: 1.957 ± 0.861
0.0AsnCys: 0.0 ± 0.0
2.609AsnAsp: 2.609 ± 0.755
2.935AsnGlu: 2.935 ± 0.916
1.305AsnPhe: 1.305 ± 0.644
5.545AsnGly: 5.545 ± 0.799
1.957AsnHis: 1.957 ± 0.576
3.262AsnIle: 3.262 ± 1.035
4.892AsnLys: 4.892 ± 1.103
6.523AsnLeu: 6.523 ± 1.025
1.631AsnMet: 1.631 ± 0.685
2.935AsnAsn: 2.935 ± 0.884
3.914AsnPro: 3.914 ± 1.071
0.978AsnGln: 0.978 ± 0.499
2.609AsnArg: 2.609 ± 0.997
1.957AsnSer: 1.957 ± 0.692
1.631AsnThr: 1.631 ± 0.759
1.631AsnVal: 1.631 ± 0.72
1.305AsnTrp: 1.305 ± 0.554
2.609AsnTyr: 2.609 ± 0.858
0.0AsnXaa: 0.0 ± 0.0
Pro
2.609ProAla: 2.609 ± 0.694
0.0ProCys: 0.0 ± 0.0
2.283ProAsp: 2.283 ± 0.701
1.631ProGlu: 1.631 ± 0.874
3.262ProPhe: 3.262 ± 1.299
0.978ProGly: 0.978 ± 0.748
0.326ProHis: 0.326 ± 0.43
2.283ProIle: 2.283 ± 0.896
4.566ProLys: 4.566 ± 1.265
1.957ProLeu: 1.957 ± 0.659
1.305ProMet: 1.305 ± 0.611
2.609ProAsn: 2.609 ± 1.357
0.652ProPro: 0.652 ± 0.44
0.978ProGln: 0.978 ± 0.555
2.609ProArg: 2.609 ± 0.945
0.978ProSer: 0.978 ± 0.483
1.957ProThr: 1.957 ± 0.815
3.588ProVal: 3.588 ± 0.876
0.326ProTrp: 0.326 ± 0.272
0.978ProTyr: 0.978 ± 0.529
0.0ProXaa: 0.0 ± 0.0
Gln
3.914GlnAla: 3.914 ± 1.33
0.326GlnCys: 0.326 ± 0.379
1.305GlnAsp: 1.305 ± 0.531
5.871GlnGlu: 5.871 ± 0.855
0.978GlnPhe: 0.978 ± 0.645
1.631GlnGly: 1.631 ± 0.95
0.0GlnHis: 0.0 ± 0.0
1.631GlnIle: 1.631 ± 0.619
3.262GlnLys: 3.262 ± 0.825
4.24GlnLeu: 4.24 ± 1.043
0.652GlnMet: 0.652 ± 0.488
1.631GlnAsn: 1.631 ± 0.837
0.652GlnPro: 0.652 ± 0.349
0.652GlnGln: 0.652 ± 0.37
2.283GlnArg: 2.283 ± 0.588
0.652GlnSer: 0.652 ± 0.445
1.631GlnThr: 1.631 ± 0.572
1.305GlnVal: 1.305 ± 0.547
0.0GlnTrp: 0.0 ± 0.0
1.631GlnTyr: 1.631 ± 0.728
0.0GlnXaa: 0.0 ± 0.0
Arg
3.262ArgAla: 3.262 ± 0.9
0.978ArgCys: 0.978 ± 0.566
0.978ArgAsp: 0.978 ± 0.528
3.914ArgGlu: 3.914 ± 0.801
2.609ArgPhe: 2.609 ± 1.056
3.914ArgGly: 3.914 ± 1.143
1.305ArgHis: 1.305 ± 0.478
2.935ArgIle: 2.935 ± 0.941
4.892ArgLys: 4.892 ± 1.632
4.566ArgLeu: 4.566 ± 1.645
0.978ArgMet: 0.978 ± 0.507
2.935ArgAsn: 2.935 ± 0.896
1.631ArgPro: 1.631 ± 0.682
1.305ArgGln: 1.305 ± 0.874
1.957ArgArg: 1.957 ± 0.543
1.957ArgSer: 1.957 ± 0.758
3.588ArgThr: 3.588 ± 1.091
2.935ArgVal: 2.935 ± 1.135
0.652ArgTrp: 0.652 ± 0.477
1.957ArgTyr: 1.957 ± 0.851
0.0ArgXaa: 0.0 ± 0.0
Ser
2.935SerAla: 2.935 ± 0.698
0.326SerCys: 0.326 ± 0.327
2.935SerAsp: 2.935 ± 0.786
4.566SerGlu: 4.566 ± 1.4
2.283SerPhe: 2.283 ± 0.875
1.957SerGly: 1.957 ± 0.726
1.957SerHis: 1.957 ± 0.749
5.545SerIle: 5.545 ± 1.473
3.262SerLys: 3.262 ± 0.733
6.197SerLeu: 6.197 ± 0.978
1.631SerMet: 1.631 ± 0.762
3.262SerAsn: 3.262 ± 0.82
0.978SerPro: 0.978 ± 0.48
1.957SerGln: 1.957 ± 0.868
2.609SerArg: 2.609 ± 1.255
2.283SerSer: 2.283 ± 0.853
3.262SerThr: 3.262 ± 0.79
2.609SerVal: 2.609 ± 1.264
0.326SerTrp: 0.326 ± 0.272
4.24SerTyr: 4.24 ± 1.404
0.0SerXaa: 0.0 ± 0.0
Thr
2.935ThrAla: 2.935 ± 0.835
0.326ThrCys: 0.326 ± 0.33
2.283ThrAsp: 2.283 ± 0.914
3.914ThrGlu: 3.914 ± 1.045
5.219ThrPhe: 5.219 ± 1.68
4.24ThrGly: 4.24 ± 1.172
1.305ThrHis: 1.305 ± 0.426
4.24ThrIle: 4.24 ± 1.137
5.219ThrLys: 5.219 ± 1.446
6.849ThrLeu: 6.849 ± 1.147
1.957ThrMet: 1.957 ± 0.658
0.652ThrAsn: 0.652 ± 0.39
2.283ThrPro: 2.283 ± 0.824
2.283ThrGln: 2.283 ± 0.798
2.609ThrArg: 2.609 ± 0.898
3.262ThrSer: 3.262 ± 0.929
2.283ThrThr: 2.283 ± 0.762
3.914ThrVal: 3.914 ± 1.09
0.326ThrTrp: 0.326 ± 0.304
3.262ThrTyr: 3.262 ± 1.205
0.0ThrXaa: 0.0 ± 0.0
Val
3.588ValAla: 3.588 ± 1.081
0.652ValCys: 0.652 ± 0.37
2.935ValAsp: 2.935 ± 1.074
3.262ValGlu: 3.262 ± 0.952
1.957ValPhe: 1.957 ± 0.637
0.978ValGly: 0.978 ± 0.471
0.652ValHis: 0.652 ± 0.584
1.631ValIle: 1.631 ± 0.712
8.154ValLys: 8.154 ± 1.654
3.588ValLeu: 3.588 ± 1.129
1.305ValMet: 1.305 ± 0.715
2.283ValAsn: 2.283 ± 0.809
3.262ValPro: 3.262 ± 1.218
0.978ValGln: 0.978 ± 0.833
2.935ValArg: 2.935 ± 0.84
2.283ValSer: 2.283 ± 1.276
5.545ValThr: 5.545 ± 1.103
4.24ValVal: 4.24 ± 1.372
0.0ValTrp: 0.0 ± 0.0
2.609ValTyr: 2.609 ± 0.68
0.0ValXaa: 0.0 ± 0.0
Trp
1.305TrpAla: 1.305 ± 0.724
0.0TrpCys: 0.0 ± 0.0
0.326TrpAsp: 0.326 ± 0.33
1.305TrpGlu: 1.305 ± 0.811
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.652TrpHis: 0.652 ± 0.39
1.631TrpIle: 1.631 ± 0.771
0.0TrpLys: 0.0 ± 0.0
1.631TrpLeu: 1.631 ± 0.497
0.0TrpMet: 0.0 ± 0.0
0.326TrpAsn: 0.326 ± 0.354
0.326TrpPro: 0.326 ± 0.272
0.326TrpGln: 0.326 ± 0.272
0.326TrpArg: 0.326 ± 0.272
1.305TrpSer: 1.305 ± 0.556
0.326TrpThr: 0.326 ± 0.304
0.652TrpVal: 0.652 ± 0.381
0.0TrpTrp: 0.0 ± 0.0
0.326TrpTyr: 0.326 ± 0.272
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.631TyrAla: 1.631 ± 0.599
1.305TyrCys: 1.305 ± 0.731
1.957TyrAsp: 1.957 ± 0.644
1.631TyrGlu: 1.631 ± 0.781
2.283TyrPhe: 2.283 ± 0.794
1.957TyrGly: 1.957 ± 0.606
0.326TyrHis: 0.326 ± 0.272
4.24TyrIle: 4.24 ± 1.286
5.545TyrLys: 5.545 ± 1.394
4.566TyrLeu: 4.566 ± 1.18
1.957TyrMet: 1.957 ± 0.765
2.283TyrAsn: 2.283 ± 0.772
1.631TyrPro: 1.631 ± 0.951
3.262TyrGln: 3.262 ± 1.159
3.588TyrArg: 3.588 ± 0.922
1.631TyrSer: 1.631 ± 0.938
1.631TyrThr: 1.631 ± 0.663
1.305TyrVal: 1.305 ± 0.738
0.326TyrTrp: 0.326 ± 0.327
1.631TyrTyr: 1.631 ± 0.806
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (3067 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski