Amino acid dipepetide frequency for Streptococcus satellite phage Javan732

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.078AlaAla: 1.078 ± 0.5
0.809AlaCys: 0.809 ± 0.614
4.044AlaAsp: 4.044 ± 1.178
5.392AlaGlu: 5.392 ± 1.336
2.157AlaPhe: 2.157 ± 0.824
2.157AlaGly: 2.157 ± 0.693
0.539AlaHis: 0.539 ± 0.337
4.583AlaIle: 4.583 ± 0.937
5.662AlaLys: 5.662 ± 1.007
5.123AlaLeu: 5.123 ± 0.94
2.696AlaMet: 2.696 ± 1.097
3.505AlaAsn: 3.505 ± 0.836
1.078AlaPro: 1.078 ± 0.461
1.618AlaGln: 1.618 ± 0.608
4.583AlaArg: 4.583 ± 0.838
2.157AlaSer: 2.157 ± 0.702
3.505AlaThr: 3.505 ± 0.985
4.044AlaVal: 4.044 ± 0.798
0.0AlaTrp: 0.0 ± 0.0
1.887AlaTyr: 1.887 ± 0.783
0.0AlaXaa: 0.0 ± 0.0
Cys
0.539CysAla: 0.539 ± 0.339
0.0CysCys: 0.0 ± 0.0
0.539CysAsp: 0.539 ± 0.349
0.27CysGlu: 0.27 ± 0.318
0.539CysPhe: 0.539 ± 0.406
0.27CysGly: 0.27 ± 0.243
0.539CysHis: 0.539 ± 0.301
0.809CysIle: 0.809 ± 0.533
0.27CysLys: 0.27 ± 0.28
1.348CysLeu: 1.348 ± 0.509
0.539CysMet: 0.539 ± 0.431
0.539CysAsn: 0.539 ± 0.32
0.27CysPro: 0.27 ± 0.252
0.27CysGln: 0.27 ± 0.297
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.27CysVal: 0.27 ± 0.28
0.0CysTrp: 0.0 ± 0.0
0.539CysTyr: 0.539 ± 0.337
0.0CysXaa: 0.0 ± 0.0
Asp
1.618AspAla: 1.618 ± 0.61
0.809AspCys: 0.809 ± 0.413
4.583AspAsp: 4.583 ± 0.887
7.01AspGlu: 7.01 ± 1.466
3.775AspPhe: 3.775 ± 0.639
2.427AspGly: 2.427 ± 0.689
0.0AspHis: 0.0 ± 0.0
5.932AspIle: 5.932 ± 1.005
5.123AspLys: 5.123 ± 0.944
6.471AspLeu: 6.471 ± 0.834
2.966AspMet: 2.966 ± 0.818
2.966AspAsn: 2.966 ± 0.764
1.618AspPro: 1.618 ± 0.663
0.809AspGln: 0.809 ± 0.5
2.696AspArg: 2.696 ± 0.833
1.618AspSer: 1.618 ± 0.602
2.427AspThr: 2.427 ± 0.898
1.348AspVal: 1.348 ± 0.507
0.0AspTrp: 0.0 ± 0.0
3.505AspTyr: 3.505 ± 1.131
0.0AspXaa: 0.0 ± 0.0
Glu
5.932GluAla: 5.932 ± 1.404
0.539GluCys: 0.539 ± 0.37
3.775GluAsp: 3.775 ± 0.937
6.201GluGlu: 6.201 ± 1.8
2.966GluPhe: 2.966 ± 0.867
2.696GluGly: 2.696 ± 0.699
1.618GluHis: 1.618 ± 0.662
9.167GluIle: 9.167 ± 1.464
8.358GluLys: 8.358 ± 1.605
13.481GluLeu: 13.481 ± 2.105
2.696GluMet: 2.696 ± 0.798
7.549GluAsn: 7.549 ± 1.5
1.887GluPro: 1.887 ± 0.837
4.314GluGln: 4.314 ± 0.954
5.662GluArg: 5.662 ± 1.18
5.392GluSer: 5.392 ± 1.594
4.314GluThr: 4.314 ± 1.394
5.662GluVal: 5.662 ± 1.366
0.809GluTrp: 0.809 ± 0.338
3.235GluTyr: 3.235 ± 0.785
0.0GluXaa: 0.0 ± 0.0
Phe
1.078PheAla: 1.078 ± 0.415
0.539PheCys: 0.539 ± 0.559
3.775PheAsp: 3.775 ± 1.03
3.235PheGlu: 3.235 ± 1.172
2.157PhePhe: 2.157 ± 0.895
2.427PheGly: 2.427 ± 0.757
0.809PheHis: 0.809 ± 0.41
3.235PheIle: 3.235 ± 0.968
5.123PheLys: 5.123 ± 1.401
2.966PheLeu: 2.966 ± 0.697
0.539PheMet: 0.539 ± 0.373
1.887PheAsn: 1.887 ± 0.571
0.27PhePro: 0.27 ± 0.287
2.427PheGln: 2.427 ± 0.608
2.427PheArg: 2.427 ± 0.698
2.966PheSer: 2.966 ± 0.777
1.078PheThr: 1.078 ± 0.488
1.887PheVal: 1.887 ± 0.65
0.539PheTrp: 0.539 ± 0.381
1.887PheTyr: 1.887 ± 0.682
0.0PheXaa: 0.0 ± 0.0
Gly
2.157GlyAla: 2.157 ± 0.672
0.27GlyCys: 0.27 ± 0.252
2.696GlyAsp: 2.696 ± 0.965
2.427GlyGlu: 2.427 ± 0.732
2.157GlyPhe: 2.157 ± 0.9
1.887GlyGly: 1.887 ± 0.651
0.809GlyHis: 0.809 ± 0.404
4.853GlyIle: 4.853 ± 1.316
4.583GlyLys: 4.583 ± 1.144
5.123GlyLeu: 5.123 ± 1.302
1.348GlyMet: 1.348 ± 0.517
2.157GlyAsn: 2.157 ± 0.664
0.27GlyPro: 0.27 ± 0.252
2.427GlyGln: 2.427 ± 0.501
1.887GlyArg: 1.887 ± 0.626
1.887GlySer: 1.887 ± 0.658
2.427GlyThr: 2.427 ± 0.66
4.583GlyVal: 4.583 ± 0.933
1.078GlyTrp: 1.078 ± 0.704
2.966GlyTyr: 2.966 ± 1.047
0.0GlyXaa: 0.0 ± 0.0
His
1.348HisAla: 1.348 ± 0.828
0.27HisCys: 0.27 ± 0.23
0.0HisAsp: 0.0 ± 0.0
1.618HisGlu: 1.618 ± 0.654
1.348HisPhe: 1.348 ± 0.607
0.809HisGly: 0.809 ± 0.453
0.27HisHis: 0.27 ± 0.252
0.809HisIle: 0.809 ± 0.531
1.078HisLys: 1.078 ± 0.485
1.348HisLeu: 1.348 ± 0.64
0.27HisMet: 0.27 ± 0.295
1.618HisAsn: 1.618 ± 0.962
0.27HisPro: 0.27 ± 0.28
0.27HisGln: 0.27 ± 0.28
1.078HisArg: 1.078 ± 0.574
0.809HisSer: 0.809 ± 0.392
0.539HisThr: 0.539 ± 0.503
0.27HisVal: 0.27 ± 0.227
0.0HisTrp: 0.0 ± 0.0
0.27HisTyr: 0.27 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
4.044IleAla: 4.044 ± 1.016
0.539IleCys: 0.539 ± 0.355
4.583IleAsp: 4.583 ± 1.414
8.088IleGlu: 8.088 ± 2.015
3.775IlePhe: 3.775 ± 0.883
3.505IleGly: 3.505 ± 1.066
0.539IleHis: 0.539 ± 0.305
4.044IleIle: 4.044 ± 1.04
8.628IleLys: 8.628 ± 1.173
4.583IleLeu: 4.583 ± 1.194
0.27IleMet: 0.27 ± 0.269
2.696IleAsn: 2.696 ± 1.015
2.966IlePro: 2.966 ± 0.886
2.966IleGln: 2.966 ± 0.897
2.696IleArg: 2.696 ± 0.839
5.662IleSer: 5.662 ± 1.342
3.235IleThr: 3.235 ± 0.888
4.044IleVal: 4.044 ± 0.9
0.809IleTrp: 0.809 ± 0.487
2.696IleTyr: 2.696 ± 0.783
0.0IleXaa: 0.0 ± 0.0
Lys
8.088LysAla: 8.088 ± 1.948
0.539LysCys: 0.539 ± 0.357
4.044LysAsp: 4.044 ± 1.256
10.245LysGlu: 10.245 ± 1.382
1.887LysPhe: 1.887 ± 0.631
3.775LysGly: 3.775 ± 0.797
2.427LysHis: 2.427 ± 0.809
5.932LysIle: 5.932 ± 1.115
6.74LysLys: 6.74 ± 1.559
7.28LysLeu: 7.28 ± 1.503
1.618LysMet: 1.618 ± 0.758
6.471LysAsn: 6.471 ± 0.893
3.235LysPro: 3.235 ± 1.076
2.696LysGln: 2.696 ± 0.841
6.201LysArg: 6.201 ± 1.422
5.932LysSer: 5.932 ± 1.038
7.28LysThr: 7.28 ± 1.438
4.044LysVal: 4.044 ± 0.973
1.078LysTrp: 1.078 ± 0.498
2.427LysTyr: 2.427 ± 1.059
0.0LysXaa: 0.0 ± 0.0
Leu
6.201LeuAla: 6.201 ± 1.196
0.809LeuCys: 0.809 ± 0.506
8.628LeuAsp: 8.628 ± 1.436
13.481LeuGlu: 13.481 ± 2.011
4.314LeuPhe: 4.314 ± 1.206
4.853LeuGly: 4.853 ± 1.139
0.809LeuHis: 0.809 ± 0.585
5.392LeuIle: 5.392 ± 1.137
7.549LeuLys: 7.549 ± 1.432
9.167LeuLeu: 9.167 ± 1.285
2.427LeuMet: 2.427 ± 0.757
6.471LeuAsn: 6.471 ± 1.606
2.966LeuPro: 2.966 ± 0.874
4.583LeuGln: 4.583 ± 0.877
4.044LeuArg: 4.044 ± 1.038
4.853LeuSer: 4.853 ± 0.981
6.471LeuThr: 6.471 ± 1.461
3.505LeuVal: 3.505 ± 1.061
0.539LeuTrp: 0.539 ± 0.33
3.775LeuTyr: 3.775 ± 0.92
0.0LeuXaa: 0.0 ± 0.0
Met
2.966MetAla: 2.966 ± 0.787
0.0MetCys: 0.0 ± 0.0
1.887MetAsp: 1.887 ± 0.841
2.427MetGlu: 2.427 ± 0.792
0.539MetPhe: 0.539 ± 0.316
2.427MetGly: 2.427 ± 0.782
0.27MetHis: 0.27 ± 0.28
1.078MetIle: 1.078 ± 0.494
1.618MetLys: 1.618 ± 0.529
1.887MetLeu: 1.887 ± 0.654
0.27MetMet: 0.27 ± 0.254
1.887MetAsn: 1.887 ± 0.649
1.078MetPro: 1.078 ± 0.421
1.348MetGln: 1.348 ± 0.551
1.348MetArg: 1.348 ± 0.692
1.618MetSer: 1.618 ± 0.552
2.427MetThr: 2.427 ± 0.91
1.078MetVal: 1.078 ± 0.574
0.27MetTrp: 0.27 ± 0.227
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.505AsnAla: 3.505 ± 0.903
0.27AsnCys: 0.27 ± 0.286
3.505AsnAsp: 3.505 ± 1.1
3.235AsnGlu: 3.235 ± 0.712
1.887AsnPhe: 1.887 ± 0.759
4.853AsnGly: 4.853 ± 1.05
1.078AsnHis: 1.078 ± 0.535
3.775AsnIle: 3.775 ± 1.082
7.01AsnLys: 7.01 ± 1.126
5.123AsnLeu: 5.123 ± 0.918
1.078AsnMet: 1.078 ± 0.463
1.887AsnAsn: 1.887 ± 0.609
2.427AsnPro: 2.427 ± 0.634
2.157AsnGln: 2.157 ± 0.925
1.887AsnArg: 1.887 ± 0.594
2.966AsnSer: 2.966 ± 0.937
4.044AsnThr: 4.044 ± 0.971
2.696AsnVal: 2.696 ± 0.902
0.27AsnTrp: 0.27 ± 0.252
2.157AsnTyr: 2.157 ± 0.847
0.0AsnXaa: 0.0 ± 0.0
Pro
1.348ProAla: 1.348 ± 0.57
0.27ProCys: 0.27 ± 0.227
2.157ProAsp: 2.157 ± 0.778
2.966ProGlu: 2.966 ± 0.838
1.887ProPhe: 1.887 ± 0.785
0.27ProGly: 0.27 ± 0.28
0.0ProHis: 0.0 ± 0.0
1.887ProIle: 1.887 ± 0.491
1.887ProLys: 1.887 ± 0.566
1.887ProLeu: 1.887 ± 0.55
0.27ProMet: 0.27 ± 0.227
1.618ProAsn: 1.618 ± 0.691
1.618ProPro: 1.618 ± 0.553
1.348ProGln: 1.348 ± 0.531
4.044ProArg: 4.044 ± 0.912
1.887ProSer: 1.887 ± 0.743
1.078ProThr: 1.078 ± 0.545
1.887ProVal: 1.887 ± 0.807
0.0ProTrp: 0.0 ± 0.0
1.348ProTyr: 1.348 ± 0.449
0.0ProXaa: 0.0 ± 0.0
Gln
3.775GlnAla: 3.775 ± 0.973
0.0GlnCys: 0.0 ± 0.0
2.696GlnAsp: 2.696 ± 0.984
3.505GlnGlu: 3.505 ± 1.206
1.078GlnPhe: 1.078 ± 0.529
2.696GlnGly: 2.696 ± 0.993
0.539GlnHis: 0.539 ± 0.395
1.348GlnIle: 1.348 ± 0.644
4.044GlnLys: 4.044 ± 0.879
4.853GlnLeu: 4.853 ± 1.211
0.809GlnMet: 0.809 ± 0.672
1.348GlnAsn: 1.348 ± 0.619
0.809GlnPro: 0.809 ± 0.487
2.157GlnGln: 2.157 ± 0.774
2.427GlnArg: 2.427 ± 0.839
1.348GlnSer: 1.348 ± 0.587
2.157GlnThr: 2.157 ± 0.672
2.427GlnVal: 2.427 ± 0.648
0.539GlnTrp: 0.539 ± 0.301
0.539GlnTyr: 0.539 ± 0.349
0.0GlnXaa: 0.0 ± 0.0
Arg
2.966ArgAla: 2.966 ± 0.714
0.27ArgCys: 0.27 ± 0.243
2.696ArgAsp: 2.696 ± 0.968
5.932ArgGlu: 5.932 ± 1.338
2.966ArgPhe: 2.966 ± 0.954
1.348ArgGly: 1.348 ± 0.676
1.348ArgHis: 1.348 ± 0.471
4.314ArgIle: 4.314 ± 1.276
5.123ArgLys: 5.123 ± 1.175
6.74ArgLeu: 6.74 ± 1.3
1.618ArgMet: 1.618 ± 0.739
2.696ArgAsn: 2.696 ± 0.839
0.27ArgPro: 0.27 ± 0.227
3.505ArgGln: 3.505 ± 1.079
2.427ArgArg: 2.427 ± 0.694
2.157ArgSer: 2.157 ± 0.695
3.235ArgThr: 3.235 ± 0.762
2.157ArgVal: 2.157 ± 0.6
0.27ArgTrp: 0.27 ± 0.227
2.966ArgTyr: 2.966 ± 0.846
0.0ArgXaa: 0.0 ± 0.0
Ser
2.696SerAla: 2.696 ± 0.778
0.27SerCys: 0.27 ± 0.28
1.618SerAsp: 1.618 ± 0.558
6.201SerGlu: 6.201 ± 1.834
1.887SerPhe: 1.887 ± 0.852
3.775SerGly: 3.775 ± 0.88
1.348SerHis: 1.348 ± 0.608
3.235SerIle: 3.235 ± 0.974
4.853SerLys: 4.853 ± 0.976
4.853SerLeu: 4.853 ± 1.008
2.157SerMet: 2.157 ± 0.706
4.044SerAsn: 4.044 ± 1.343
2.427SerPro: 2.427 ± 0.761
1.078SerGln: 1.078 ± 0.486
2.427SerArg: 2.427 ± 0.876
2.966SerSer: 2.966 ± 0.995
2.157SerThr: 2.157 ± 0.84
2.696SerVal: 2.696 ± 0.783
0.539SerTrp: 0.539 ± 0.387
2.966SerTyr: 2.966 ± 1.099
0.0SerXaa: 0.0 ± 0.0
Thr
2.157ThrAla: 2.157 ± 0.783
0.539ThrCys: 0.539 ± 0.409
1.618ThrAsp: 1.618 ± 0.697
3.775ThrGlu: 3.775 ± 0.811
2.157ThrPhe: 2.157 ± 0.555
3.775ThrGly: 3.775 ± 0.995
0.809ThrHis: 0.809 ± 0.338
2.696ThrIle: 2.696 ± 0.824
4.583ThrLys: 4.583 ± 1.515
5.932ThrLeu: 5.932 ± 1.361
2.427ThrMet: 2.427 ± 0.952
0.809ThrAsn: 0.809 ± 0.609
2.427ThrPro: 2.427 ± 0.902
2.696ThrGln: 2.696 ± 0.766
2.696ThrArg: 2.696 ± 0.844
2.696ThrSer: 2.696 ± 0.79
4.044ThrThr: 4.044 ± 1.023
4.853ThrVal: 4.853 ± 1.494
0.539ThrTrp: 0.539 ± 0.325
3.505ThrTyr: 3.505 ± 1.146
0.0ThrXaa: 0.0 ± 0.0
Val
2.696ValAla: 2.696 ± 1.049
0.27ValCys: 0.27 ± 0.227
2.696ValAsp: 2.696 ± 0.965
7.28ValGlu: 7.28 ± 1.601
1.348ValPhe: 1.348 ± 0.618
2.427ValGly: 2.427 ± 1.08
0.0ValHis: 0.0 ± 0.0
2.696ValIle: 2.696 ± 0.885
4.583ValLys: 4.583 ± 0.994
4.853ValLeu: 4.853 ± 1.165
1.887ValMet: 1.887 ± 0.661
3.505ValAsn: 3.505 ± 0.716
2.157ValPro: 2.157 ± 0.807
1.618ValGln: 1.618 ± 0.554
2.427ValArg: 2.427 ± 0.855
4.044ValSer: 4.044 ± 0.785
2.696ValThr: 2.696 ± 0.879
3.235ValVal: 3.235 ± 1.381
0.539ValTrp: 0.539 ± 0.355
1.887ValTyr: 1.887 ± 0.79
0.0ValXaa: 0.0 ± 0.0
Trp
0.27TrpAla: 0.27 ± 0.252
0.0TrpCys: 0.0 ± 0.0
0.27TrpAsp: 0.27 ± 0.328
1.348TrpGlu: 1.348 ± 0.565
0.27TrpPhe: 0.27 ± 0.227
0.539TrpGly: 0.539 ± 0.355
0.27TrpHis: 0.27 ± 0.227
0.809TrpIle: 0.809 ± 0.506
0.539TrpLys: 0.539 ± 0.365
0.809TrpLeu: 0.809 ± 0.373
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.27TrpGln: 0.27 ± 0.227
0.539TrpArg: 0.539 ± 0.305
0.539TrpSer: 0.539 ± 0.325
0.0TrpThr: 0.0 ± 0.0
0.809TrpVal: 0.809 ± 0.503
0.0TrpTrp: 0.0 ± 0.0
0.539TrpTyr: 0.539 ± 0.305
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.157TyrAla: 2.157 ± 0.864
0.539TyrCys: 0.539 ± 0.325
1.887TyrAsp: 1.887 ± 0.549
1.618TyrGlu: 1.618 ± 0.8
2.157TyrPhe: 2.157 ± 0.794
1.078TyrGly: 1.078 ± 0.562
0.27TyrHis: 0.27 ± 0.227
3.775TyrIle: 3.775 ± 1.151
4.583TyrLys: 4.583 ± 1.289
7.01TyrLeu: 7.01 ± 1.509
0.27TyrMet: 0.27 ± 0.271
2.157TyrAsn: 2.157 ± 0.569
1.618TyrPro: 1.618 ± 0.835
0.809TyrGln: 0.809 ± 0.451
3.505TyrArg: 3.505 ± 0.847
2.696TyrSer: 2.696 ± 0.828
1.618TyrThr: 1.618 ± 0.569
1.348TyrVal: 1.348 ± 0.591
0.0TyrTrp: 0.0 ± 0.0
0.539TyrTyr: 0.539 ± 0.337
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 27 proteins (3710 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski