Amino acid dipepetide frequency for Streptococcus satellite phage Javan352

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.317AlaAla: 0.317 ± 0.292
0.0AlaCys: 0.0 ± 0.0
1.899AlaAsp: 1.899 ± 0.6
3.166AlaGlu: 3.166 ± 0.907
1.899AlaPhe: 1.899 ± 0.687
1.266AlaGly: 1.266 ± 0.632
0.633AlaHis: 0.633 ± 0.617
2.216AlaIle: 2.216 ± 0.906
5.381AlaLys: 5.381 ± 1.271
3.799AlaLeu: 3.799 ± 0.964
0.95AlaMet: 0.95 ± 0.709
3.482AlaAsn: 3.482 ± 0.775
0.317AlaPro: 0.317 ± 0.331
0.95AlaGln: 0.95 ± 0.481
2.216AlaArg: 2.216 ± 0.853
1.266AlaSer: 1.266 ± 0.704
4.115AlaThr: 4.115 ± 0.981
1.899AlaVal: 1.899 ± 0.777
0.0AlaTrp: 0.0 ± 0.0
2.849AlaTyr: 2.849 ± 0.786
0.0AlaXaa: 0.0 ± 0.0
Cys
0.317CysAla: 0.317 ± 0.331
0.0CysCys: 0.0 ± 0.0
0.633CysAsp: 0.633 ± 0.39
0.317CysGlu: 0.317 ± 0.309
0.0CysPhe: 0.0 ± 0.0
0.95CysGly: 0.95 ± 0.509
0.0CysHis: 0.0 ± 0.0
0.317CysIle: 0.317 ± 0.292
0.633CysLys: 0.633 ± 0.443
0.633CysLeu: 0.633 ± 0.539
0.317CysMet: 0.317 ± 0.397
0.0CysAsn: 0.0 ± 0.0
0.317CysPro: 0.317 ± 0.315
0.317CysGln: 0.317 ± 0.282
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.317CysThr: 0.317 ± 0.27
0.317CysVal: 0.317 ± 0.315
0.0CysTrp: 0.0 ± 0.0
0.633CysTyr: 0.633 ± 0.378
0.0CysXaa: 0.0 ± 0.0
Asp
0.633AspAla: 0.633 ± 0.397
0.633AspCys: 0.633 ± 0.395
4.115AspAsp: 4.115 ± 1.167
3.799AspGlu: 3.799 ± 0.898
6.331AspPhe: 6.331 ± 1.317
3.166AspGly: 3.166 ± 0.926
0.633AspHis: 0.633 ± 0.363
6.331AspIle: 6.331 ± 1.473
8.547AspLys: 8.547 ± 1.433
7.597AspLeu: 7.597 ± 1.427
0.95AspMet: 0.95 ± 0.541
3.799AspAsn: 3.799 ± 1.072
0.317AspPro: 0.317 ± 0.382
0.633AspGln: 0.633 ± 0.438
1.899AspArg: 1.899 ± 0.661
2.849AspSer: 2.849 ± 0.924
4.432AspThr: 4.432 ± 1.262
4.115AspVal: 4.115 ± 0.904
0.317AspTrp: 0.317 ± 0.315
1.583AspTyr: 1.583 ± 0.765
0.0AspXaa: 0.0 ± 0.0
Glu
3.799GluAla: 3.799 ± 1.127
0.95GluCys: 0.95 ± 0.475
6.331GluAsp: 6.331 ± 1.765
4.115GluGlu: 4.115 ± 0.933
3.482GluPhe: 3.482 ± 1.031
2.849GluGly: 2.849 ± 1.121
0.633GluHis: 0.633 ± 0.379
7.914GluIle: 7.914 ± 1.337
10.446GluLys: 10.446 ± 1.851
10.446GluLeu: 10.446 ± 1.916
1.266GluMet: 1.266 ± 0.605
6.331GluAsn: 6.331 ± 1.108
1.266GluPro: 1.266 ± 0.754
4.432GluGln: 4.432 ± 1.005
3.799GluArg: 3.799 ± 1.159
4.748GluSer: 4.748 ± 0.982
4.748GluThr: 4.748 ± 1.339
2.532GluVal: 2.532 ± 0.969
0.633GluTrp: 0.633 ± 0.371
2.849GluTyr: 2.849 ± 0.839
0.0GluXaa: 0.0 ± 0.0
Phe
0.95PheAla: 0.95 ± 0.473
0.317PheCys: 0.317 ± 0.315
3.166PheAsp: 3.166 ± 0.767
6.015PheGlu: 6.015 ± 0.934
3.482PhePhe: 3.482 ± 1.141
2.532PheGly: 2.532 ± 0.699
1.583PheHis: 1.583 ± 0.638
1.899PheIle: 1.899 ± 0.799
5.381PheLys: 5.381 ± 1.665
6.015PheLeu: 6.015 ± 1.317
0.633PheMet: 0.633 ± 0.422
2.216PheAsn: 2.216 ± 0.947
1.266PhePro: 1.266 ± 0.737
1.583PheGln: 1.583 ± 0.92
1.899PheArg: 1.899 ± 0.739
4.748PheSer: 4.748 ± 1.336
2.216PheThr: 2.216 ± 0.948
3.799PheVal: 3.799 ± 0.989
0.633PheTrp: 0.633 ± 0.371
1.583PheTyr: 1.583 ± 0.54
0.0PheXaa: 0.0 ± 0.0
Gly
2.216GlyAla: 2.216 ± 0.912
0.317GlyCys: 0.317 ± 0.27
1.899GlyAsp: 1.899 ± 0.929
1.583GlyGlu: 1.583 ± 0.71
2.532GlyPhe: 2.532 ± 0.877
2.216GlyGly: 2.216 ± 1.117
0.633GlyHis: 0.633 ± 0.363
3.799GlyIle: 3.799 ± 0.875
4.115GlyLys: 4.115 ± 1.825
3.799GlyLeu: 3.799 ± 0.955
0.95GlyMet: 0.95 ± 0.462
2.216GlyAsn: 2.216 ± 0.888
0.317GlyPro: 0.317 ± 0.309
1.583GlyGln: 1.583 ± 0.51
1.266GlyArg: 1.266 ± 0.536
3.166GlySer: 3.166 ± 1.192
2.532GlyThr: 2.532 ± 0.688
3.166GlyVal: 3.166 ± 0.871
0.633GlyTrp: 0.633 ± 0.42
4.748GlyTyr: 4.748 ± 1.305
0.0GlyXaa: 0.0 ± 0.0
His
2.216HisAla: 2.216 ± 1.093
0.317HisCys: 0.317 ± 0.309
0.633HisAsp: 0.633 ± 0.478
0.317HisGlu: 0.317 ± 0.315
0.0HisPhe: 0.0 ± 0.0
0.317HisGly: 0.317 ± 0.309
0.317HisHis: 0.317 ± 0.331
1.899HisIle: 1.899 ± 0.665
1.899HisLys: 1.899 ± 0.664
1.899HisLeu: 1.899 ± 0.877
0.0HisMet: 0.0 ± 0.0
0.95HisAsn: 0.95 ± 0.5
0.317HisPro: 0.317 ± 0.309
0.633HisGln: 0.633 ± 0.418
0.0HisArg: 0.0 ± 0.0
1.583HisSer: 1.583 ± 0.666
2.216HisThr: 2.216 ± 0.877
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.266HisTyr: 1.266 ± 0.687
0.0HisXaa: 0.0 ± 0.0
Ile
1.583IleAla: 1.583 ± 0.772
0.317IleCys: 0.317 ± 0.309
5.381IleAsp: 5.381 ± 1.428
6.015IleGlu: 6.015 ± 1.683
4.115IlePhe: 4.115 ± 1.072
2.532IleGly: 2.532 ± 0.716
1.583IleHis: 1.583 ± 0.639
4.432IleIle: 4.432 ± 0.932
7.597IleLys: 7.597 ± 1.411
9.497IleLeu: 9.497 ± 2.013
0.633IleMet: 0.633 ± 0.402
6.331IleAsn: 6.331 ± 1.122
2.849IlePro: 2.849 ± 0.579
2.849IleGln: 2.849 ± 0.743
1.266IleArg: 1.266 ± 0.488
5.698IleSer: 5.698 ± 1.253
3.799IleThr: 3.799 ± 1.091
2.532IleVal: 2.532 ± 0.745
0.0IleTrp: 0.0 ± 0.0
2.216IleTyr: 2.216 ± 0.669
0.0IleXaa: 0.0 ± 0.0
Lys
5.065LysAla: 5.065 ± 1.613
0.317LysCys: 0.317 ± 0.326
5.381LysAsp: 5.381 ± 1.184
12.979LysGlu: 12.979 ± 1.997
3.166LysPhe: 3.166 ± 1.203
5.065LysGly: 5.065 ± 1.618
2.216LysHis: 2.216 ± 0.607
9.813LysIle: 9.813 ± 1.109
9.497LysLys: 9.497 ± 1.712
7.597LysLeu: 7.597 ± 0.948
4.432LysMet: 4.432 ± 1.045
4.748LysAsn: 4.748 ± 1.256
2.216LysPro: 2.216 ± 0.625
5.381LysGln: 5.381 ± 1.058
7.281LysArg: 7.281 ± 1.238
7.281LysSer: 7.281 ± 1.505
7.281LysThr: 7.281 ± 1.592
5.698LysVal: 5.698 ± 1.116
0.633LysTrp: 0.633 ± 0.439
2.532LysTyr: 2.532 ± 0.663
0.0LysXaa: 0.0 ± 0.0
Leu
4.115LeuAla: 4.115 ± 1.092
0.95LeuCys: 0.95 ± 0.68
10.13LeuAsp: 10.13 ± 1.427
11.079LeuGlu: 11.079 ± 1.519
5.381LeuPhe: 5.381 ± 1.523
6.964LeuGly: 6.964 ± 1.07
1.583LeuHis: 1.583 ± 0.862
6.015LeuIle: 6.015 ± 1.644
11.396LeuLys: 11.396 ± 1.699
8.23LeuLeu: 8.23 ± 1.613
1.266LeuMet: 1.266 ± 0.558
5.065LeuAsn: 5.065 ± 1.28
2.532LeuPro: 2.532 ± 0.954
3.799LeuGln: 3.799 ± 0.941
3.799LeuArg: 3.799 ± 1.007
6.964LeuSer: 6.964 ± 1.539
5.065LeuThr: 5.065 ± 1.194
5.065LeuVal: 5.065 ± 1.506
0.317LeuTrp: 0.317 ± 0.292
3.482LeuTyr: 3.482 ± 1.239
0.0LeuXaa: 0.0 ± 0.0
Met
0.95MetAla: 0.95 ± 0.465
0.0MetCys: 0.0 ± 0.0
1.899MetAsp: 1.899 ± 0.615
2.532MetGlu: 2.532 ± 1.018
0.633MetPhe: 0.633 ± 0.427
0.317MetGly: 0.317 ± 0.318
0.0MetHis: 0.0 ± 0.0
1.266MetIle: 1.266 ± 0.601
1.583MetLys: 1.583 ± 0.535
1.266MetLeu: 1.266 ± 0.591
0.317MetMet: 0.317 ± 0.292
2.532MetAsn: 2.532 ± 0.852
0.95MetPro: 0.95 ± 0.473
0.317MetGln: 0.317 ± 0.331
2.216MetArg: 2.216 ± 0.836
0.317MetSer: 0.317 ± 0.382
1.899MetThr: 1.899 ± 0.97
0.95MetVal: 0.95 ± 0.486
0.0MetTrp: 0.0 ± 0.0
0.633MetTyr: 0.633 ± 0.434
0.0MetXaa: 0.0 ± 0.0
Asn
1.899AsnAla: 1.899 ± 0.649
0.317AsnCys: 0.317 ± 0.27
1.583AsnAsp: 1.583 ± 0.634
2.532AsnGlu: 2.532 ± 0.84
5.065AsnPhe: 5.065 ± 1.13
3.799AsnGly: 3.799 ± 1.212
2.216AsnHis: 2.216 ± 0.913
3.799AsnIle: 3.799 ± 1.491
6.964AsnLys: 6.964 ± 1.309
6.015AsnLeu: 6.015 ± 1.067
1.583AsnMet: 1.583 ± 0.819
3.799AsnAsn: 3.799 ± 0.759
1.899AsnPro: 1.899 ± 0.566
2.849AsnGln: 2.849 ± 1.044
3.166AsnArg: 3.166 ± 0.893
2.849AsnSer: 2.849 ± 0.52
3.799AsnThr: 3.799 ± 1.4
1.583AsnVal: 1.583 ± 0.985
0.0AsnTrp: 0.0 ± 0.0
4.115AsnTyr: 4.115 ± 0.855
0.0AsnXaa: 0.0 ± 0.0
Pro
0.317ProAla: 0.317 ± 0.27
0.0ProCys: 0.0 ± 0.0
1.266ProAsp: 1.266 ± 0.549
2.216ProGlu: 2.216 ± 0.764
1.266ProPhe: 1.266 ± 0.507
0.317ProGly: 0.317 ± 0.292
0.0ProHis: 0.0 ± 0.0
1.266ProIle: 1.266 ± 0.489
3.166ProLys: 3.166 ± 1.199
1.583ProLeu: 1.583 ± 0.737
0.0ProMet: 0.0 ± 0.0
1.266ProAsn: 1.266 ± 0.566
0.95ProPro: 0.95 ± 0.485
0.95ProGln: 0.95 ± 0.748
0.633ProArg: 0.633 ± 0.37
1.266ProSer: 1.266 ± 0.537
1.899ProThr: 1.899 ± 0.672
0.95ProVal: 0.95 ± 0.475
0.317ProTrp: 0.317 ± 0.309
0.95ProTyr: 0.95 ± 0.467
0.0ProXaa: 0.0 ± 0.0
Gln
3.799GlnAla: 3.799 ± 1.154
0.95GlnCys: 0.95 ± 0.477
2.532GlnAsp: 2.532 ± 0.739
4.432GlnGlu: 4.432 ± 0.779
1.583GlnPhe: 1.583 ± 0.739
0.633GlnGly: 0.633 ± 0.504
1.266GlnHis: 1.266 ± 0.635
2.532GlnIle: 2.532 ± 0.679
4.115GlnLys: 4.115 ± 1.039
2.532GlnLeu: 2.532 ± 0.896
0.95GlnMet: 0.95 ± 0.628
0.633GlnAsn: 0.633 ± 0.37
0.317GlnPro: 0.317 ± 0.331
1.583GlnGln: 1.583 ± 0.679
2.849GlnArg: 2.849 ± 0.886
1.583GlnSer: 1.583 ± 0.639
1.899GlnThr: 1.899 ± 1.002
2.849GlnVal: 2.849 ± 0.827
0.317GlnTrp: 0.317 ± 0.314
2.532GlnTyr: 2.532 ± 0.641
0.0GlnXaa: 0.0 ± 0.0
Arg
2.216ArgAla: 2.216 ± 0.63
0.0ArgCys: 0.0 ± 0.0
3.482ArgAsp: 3.482 ± 0.879
2.532ArgGlu: 2.532 ± 0.756
1.899ArgPhe: 1.899 ± 0.649
1.899ArgGly: 1.899 ± 0.905
0.317ArgHis: 0.317 ± 0.27
2.216ArgIle: 2.216 ± 0.706
6.964ArgLys: 6.964 ± 1.267
4.115ArgLeu: 4.115 ± 1.01
0.633ArgMet: 0.633 ± 0.401
4.432ArgAsn: 4.432 ± 1.394
0.317ArgPro: 0.317 ± 0.292
3.799ArgGln: 3.799 ± 0.931
1.899ArgArg: 1.899 ± 0.766
2.532ArgSer: 2.532 ± 0.664
3.166ArgThr: 3.166 ± 0.96
0.633ArgVal: 0.633 ± 0.37
0.0ArgTrp: 0.0 ± 0.0
3.482ArgTyr: 3.482 ± 1.571
0.0ArgXaa: 0.0 ± 0.0
Ser
1.583SerAla: 1.583 ± 0.817
0.0SerCys: 0.0 ± 0.0
4.748SerAsp: 4.748 ± 0.859
4.748SerGlu: 4.748 ± 1.053
3.799SerPhe: 3.799 ± 0.915
2.849SerGly: 2.849 ± 1.066
1.583SerHis: 1.583 ± 0.781
3.799SerIle: 3.799 ± 1.273
7.597SerLys: 7.597 ± 2.283
6.331SerLeu: 6.331 ± 1.813
2.216SerMet: 2.216 ± 0.935
2.216SerAsn: 2.216 ± 0.758
1.266SerPro: 1.266 ± 0.782
2.216SerGln: 2.216 ± 0.869
1.583SerArg: 1.583 ± 0.792
4.432SerSer: 4.432 ± 1.168
3.482SerThr: 3.482 ± 0.784
4.432SerVal: 4.432 ± 1.13
1.266SerTrp: 1.266 ± 0.646
3.166SerTyr: 3.166 ± 0.738
0.0SerXaa: 0.0 ± 0.0
Thr
3.482ThrAla: 3.482 ± 1.034
0.317ThrCys: 0.317 ± 0.292
2.216ThrAsp: 2.216 ± 0.737
5.381ThrGlu: 5.381 ± 1.279
2.532ThrPhe: 2.532 ± 0.961
4.432ThrGly: 4.432 ± 0.937
0.633ThrHis: 0.633 ± 0.438
4.432ThrIle: 4.432 ± 1.242
4.115ThrLys: 4.115 ± 1.21
9.497ThrLeu: 9.497 ± 1.894
0.95ThrMet: 0.95 ± 0.457
3.166ThrAsn: 3.166 ± 0.695
1.266ThrPro: 1.266 ± 0.619
2.849ThrGln: 2.849 ± 1.208
4.115ThrArg: 4.115 ± 1.791
2.532ThrSer: 2.532 ± 1.139
3.482ThrThr: 3.482 ± 0.858
3.799ThrVal: 3.799 ± 1.145
0.95ThrTrp: 0.95 ± 0.553
2.216ThrTyr: 2.216 ± 1.34
0.0ThrXaa: 0.0 ± 0.0
Val
1.899ValAla: 1.899 ± 0.666
0.317ValCys: 0.317 ± 0.315
4.115ValAsp: 4.115 ± 1.531
4.748ValGlu: 4.748 ± 1.244
1.266ValPhe: 1.266 ± 0.476
0.317ValGly: 0.317 ± 0.29
0.633ValHis: 0.633 ± 0.563
3.799ValIle: 3.799 ± 0.967
3.799ValLys: 3.799 ± 1.535
4.432ValLeu: 4.432 ± 1.007
0.317ValMet: 0.317 ± 0.29
3.799ValAsn: 3.799 ± 1.048
0.95ValPro: 0.95 ± 0.547
0.95ValGln: 0.95 ± 0.481
2.216ValArg: 2.216 ± 0.846
5.381ValSer: 5.381 ± 1.287
3.482ValThr: 3.482 ± 0.797
2.849ValVal: 2.849 ± 0.785
0.0ValTrp: 0.0 ± 0.0
3.166ValTyr: 3.166 ± 0.794
0.0ValXaa: 0.0 ± 0.0
Trp
0.633TrpAla: 0.633 ± 0.381
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.899TrpGlu: 1.899 ± 0.813
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.95TrpIle: 0.95 ± 0.56
0.317TrpLys: 0.317 ± 0.292
0.633TrpLeu: 0.633 ± 0.585
0.317TrpMet: 0.317 ± 0.273
0.317TrpAsn: 0.317 ± 0.314
0.0TrpPro: 0.0 ± 0.0
0.317TrpGln: 0.317 ± 0.27
0.0TrpArg: 0.0 ± 0.0
0.633TrpSer: 0.633 ± 0.363
0.0TrpThr: 0.0 ± 0.0
0.317TrpVal: 0.317 ± 0.314
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.633TyrAla: 0.633 ± 0.454
0.0TyrCys: 0.0 ± 0.0
1.266TyrAsp: 1.266 ± 0.524
3.482TyrGlu: 3.482 ± 0.886
3.482TyrPhe: 3.482 ± 0.881
1.266TyrGly: 1.266 ± 0.664
0.317TyrHis: 0.317 ± 0.282
2.532TyrIle: 2.532 ± 0.857
5.065TyrLys: 5.065 ± 1.226
7.281TyrLeu: 7.281 ± 1.256
1.583TyrMet: 1.583 ± 0.781
2.532TyrAsn: 2.532 ± 0.657
0.95TyrPro: 0.95 ± 0.63
1.899TyrGln: 1.899 ± 0.583
4.432TyrArg: 4.432 ± 1.436
3.482TyrSer: 3.482 ± 1.17
2.532TyrThr: 2.532 ± 0.878
0.95TyrVal: 0.95 ± 0.534
0.317TyrTrp: 0.317 ± 0.315
1.899TyrTyr: 1.899 ± 0.75
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 17 proteins (3160 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski