Amino acid dipepetide frequency for Streptococcus satellite phage Javan225

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.447AlaAla: 0.447 ± 0.423
0.447AlaCys: 0.447 ± 0.385
3.579AlaAsp: 3.579 ± 1.244
4.922AlaGlu: 4.922 ± 2.091
4.474AlaPhe: 4.474 ± 1.553
3.132AlaGly: 3.132 ± 1.755
0.447AlaHis: 0.447 ± 0.423
3.132AlaIle: 3.132 ± 1.33
4.922AlaLys: 4.922 ± 1.563
5.369AlaLeu: 5.369 ± 1.458
2.685AlaMet: 2.685 ± 1.235
2.685AlaAsn: 2.685 ± 1.192
2.685AlaPro: 2.685 ± 1.215
1.79AlaGln: 1.79 ± 0.942
3.579AlaArg: 3.579 ± 1.15
2.237AlaSer: 2.237 ± 1.161
3.132AlaThr: 3.132 ± 1.238
2.237AlaVal: 2.237 ± 1.197
1.342AlaTrp: 1.342 ± 1.058
3.579AlaTyr: 3.579 ± 0.849
0.0AlaXaa: 0.0 ± 0.0
Cys
0.447CysAla: 0.447 ± 0.513
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.447CysGlu: 0.447 ± 0.484
0.895CysPhe: 0.895 ± 0.603
0.447CysGly: 0.447 ± 0.484
0.447CysHis: 0.447 ± 0.385
0.895CysIle: 0.895 ± 0.771
0.895CysLys: 0.895 ± 0.64
0.895CysLeu: 0.895 ± 0.907
0.0CysMet: 0.0 ± 0.469
0.895CysAsn: 0.895 ± 0.771
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.447CysArg: 0.447 ± 0.423
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.342CysTyr: 1.342 ± 0.877
0.0CysXaa: 0.0 ± 0.0
Asp
0.895AspAla: 0.895 ± 0.711
1.79AspCys: 1.79 ± 1.202
3.579AspAsp: 3.579 ± 1.73
5.817AspGlu: 5.817 ± 2.025
4.027AspPhe: 4.027 ± 1.272
2.685AspGly: 2.685 ± 1.11
0.447AspHis: 0.447 ± 0.551
4.027AspIle: 4.027 ± 1.133
7.606AspLys: 7.606 ± 1.484
5.817AspLeu: 5.817 ± 1.87
2.685AspMet: 2.685 ± 1.132
5.817AspAsn: 5.817 ± 1.601
1.79AspPro: 1.79 ± 0.738
1.342AspGln: 1.342 ± 0.701
0.895AspArg: 0.895 ± 0.968
1.79AspSer: 1.79 ± 1.115
2.685AspThr: 2.685 ± 0.75
3.132AspVal: 3.132 ± 1.001
1.342AspTrp: 1.342 ± 0.769
3.579AspTyr: 3.579 ± 1.149
0.0AspXaa: 0.0 ± 0.0
Glu
4.922GluAla: 4.922 ± 1.613
0.447GluCys: 0.447 ± 0.513
5.817GluAsp: 5.817 ± 1.577
6.264GluGlu: 6.264 ± 1.788
2.685GluPhe: 2.685 ± 1.406
3.132GluGly: 3.132 ± 1.211
0.895GluHis: 0.895 ± 0.603
6.264GluIle: 6.264 ± 1.88
5.817GluLys: 5.817 ± 1.527
10.291GluLeu: 10.291 ± 2.806
2.237GluMet: 2.237 ± 1.181
5.817GluAsn: 5.817 ± 1.67
2.237GluPro: 2.237 ± 0.888
5.369GluGln: 5.369 ± 1.914
6.264GluArg: 6.264 ± 1.44
2.237GluSer: 2.237 ± 1.152
2.685GluThr: 2.685 ± 1.215
4.922GluVal: 4.922 ± 1.454
0.895GluTrp: 0.895 ± 0.506
3.579GluTyr: 3.579 ± 2.165
0.0GluXaa: 0.0 ± 0.0
Phe
0.895PheAla: 0.895 ± 0.771
0.447PheCys: 0.447 ± 0.484
4.474PheAsp: 4.474 ± 1.096
2.237PheGlu: 2.237 ± 1.36
2.237PhePhe: 2.237 ± 1.064
3.132PheGly: 3.132 ± 0.653
0.895PheHis: 0.895 ± 0.546
4.027PheIle: 4.027 ± 2.149
4.922PheLys: 4.922 ± 1.872
3.132PheLeu: 3.132 ± 1.429
1.342PheMet: 1.342 ± 0.65
2.685PheAsn: 2.685 ± 1.057
1.342PhePro: 1.342 ± 1.058
1.79PheGln: 1.79 ± 0.777
1.342PheArg: 1.342 ± 0.661
4.027PheSer: 4.027 ± 1.234
1.342PheThr: 1.342 ± 0.497
1.79PheVal: 1.79 ± 1.278
0.0PheTrp: 0.0 ± 0.0
3.579PheTyr: 3.579 ± 1.339
0.0PheXaa: 0.0 ± 0.0
Gly
5.817GlyAla: 5.817 ± 1.947
0.895GlyCys: 0.895 ± 0.58
1.79GlyAsp: 1.79 ± 1.141
2.237GlyGlu: 2.237 ± 0.695
1.79GlyPhe: 1.79 ± 0.745
0.447GlyGly: 0.447 ± 0.385
0.447GlyHis: 0.447 ± 0.423
4.027GlyIle: 4.027 ± 1.24
4.922GlyLys: 4.922 ± 1.382
3.579GlyLeu: 3.579 ± 1.674
0.0GlyMet: 0.0 ± 0.0
2.685GlyAsn: 2.685 ± 1.508
0.0GlyPro: 0.0 ± 0.0
1.342GlyGln: 1.342 ± 0.769
2.685GlyArg: 2.685 ± 0.93
1.79GlySer: 1.79 ± 0.57
3.132GlyThr: 3.132 ± 0.873
3.579GlyVal: 3.579 ± 1.404
1.79GlyTrp: 1.79 ± 1.167
2.237GlyTyr: 2.237 ± 0.669
0.0GlyXaa: 0.0 ± 0.0
His
1.342HisAla: 1.342 ± 1.268
0.895HisCys: 0.895 ± 0.591
0.447HisAsp: 0.447 ± 0.453
1.79HisGlu: 1.79 ± 0.896
0.895HisPhe: 0.895 ± 0.504
0.895HisGly: 0.895 ± 0.695
0.0HisHis: 0.0 ± 0.0
1.79HisIle: 1.79 ± 1.224
0.895HisLys: 0.895 ± 0.673
1.79HisLeu: 1.79 ± 0.595
0.0HisMet: 0.0 ± 0.0
0.895HisAsn: 0.895 ± 0.914
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.342HisArg: 1.342 ± 0.849
0.0HisSer: 0.0 ± 0.0
0.895HisThr: 0.895 ± 0.546
1.342HisVal: 1.342 ± 0.74
0.0HisTrp: 0.0 ± 0.0
0.447HisTyr: 0.447 ± 0.551
0.0HisXaa: 0.0 ± 0.0
Ile
6.264IleAla: 6.264 ± 1.793
0.447IleCys: 0.447 ± 0.524
6.264IleAsp: 6.264 ± 2.629
5.817IleGlu: 5.817 ± 2.052
3.132IlePhe: 3.132 ± 1.431
5.369IleGly: 5.369 ± 1.743
1.342IleHis: 1.342 ± 0.759
4.027IleIle: 4.027 ± 0.887
6.711IleLys: 6.711 ± 1.66
5.369IleLeu: 5.369 ± 1.773
0.447IleMet: 0.447 ± 0.45
5.369IleAsn: 5.369 ± 1.66
3.132IlePro: 3.132 ± 1.535
2.237IleGln: 2.237 ± 0.913
1.79IleArg: 1.79 ± 0.787
3.132IleSer: 3.132 ± 1.154
5.817IleThr: 5.817 ± 1.745
4.474IleVal: 4.474 ± 0.937
0.0IleTrp: 0.0 ± 0.0
1.342IleTyr: 1.342 ± 0.913
0.0IleXaa: 0.0 ± 0.0
Lys
6.264LysAla: 6.264 ± 2.574
0.0LysCys: 0.0 ± 0.0
2.237LysAsp: 2.237 ± 0.882
14.318LysGlu: 14.318 ± 3.147
4.027LysPhe: 4.027 ± 1.593
3.132LysGly: 3.132 ± 1.067
1.342LysHis: 1.342 ± 0.692
6.711LysIle: 6.711 ± 2.248
5.369LysLys: 5.369 ± 1.675
8.501LysLeu: 8.501 ± 1.823
2.237LysMet: 2.237 ± 1.003
3.579LysAsn: 3.579 ± 1.251
2.685LysPro: 2.685 ± 1.053
6.264LysGln: 6.264 ± 1.409
4.922LysArg: 4.922 ± 1.916
4.027LysSer: 4.027 ± 1.727
5.817LysThr: 5.817 ± 2.164
4.027LysVal: 4.027 ± 1.183
0.447LysTrp: 0.447 ± 0.385
2.685LysTyr: 2.685 ± 1.512
0.0LysXaa: 0.0 ± 0.0
Leu
4.922LeuAla: 4.922 ± 1.374
0.895LeuCys: 0.895 ± 0.771
8.501LeuAsp: 8.501 ± 1.44
11.186LeuGlu: 11.186 ± 1.87
2.685LeuPhe: 2.685 ± 1.092
5.369LeuGly: 5.369 ± 1.838
0.447LeuHis: 0.447 ± 0.484
4.922LeuIle: 4.922 ± 1.493
6.264LeuLys: 6.264 ± 0.909
11.633LeuLeu: 11.633 ± 1.981
3.579LeuMet: 3.579 ± 0.913
7.159LeuAsn: 7.159 ± 2.284
5.369LeuPro: 5.369 ± 0.975
3.132LeuGln: 3.132 ± 1.22
2.237LeuArg: 2.237 ± 1.05
4.474LeuSer: 4.474 ± 1.127
5.817LeuThr: 5.817 ± 1.17
5.817LeuVal: 5.817 ± 1.051
0.895LeuTrp: 0.895 ± 0.506
4.922LeuTyr: 4.922 ± 1.128
0.0LeuXaa: 0.0 ± 0.0
Met
3.132MetAla: 3.132 ± 1.527
0.0MetCys: 0.0 ± 0.0
0.895MetAsp: 0.895 ± 0.64
1.342MetGlu: 1.342 ± 0.848
0.895MetPhe: 0.895 ± 0.493
0.895MetGly: 0.895 ± 0.826
0.447MetHis: 0.447 ± 0.484
0.447MetIle: 0.447 ± 0.524
1.79MetLys: 1.79 ± 1.194
2.237MetLeu: 2.237 ± 0.825
0.447MetMet: 0.447 ± 0.45
3.579MetAsn: 3.579 ± 0.958
0.895MetPro: 0.895 ± 0.881
0.895MetGln: 0.895 ± 0.845
0.447MetArg: 0.447 ± 0.513
1.342MetSer: 1.342 ± 0.769
2.685MetThr: 2.685 ± 1.208
1.342MetVal: 1.342 ± 0.766
0.447MetTrp: 0.447 ± 0.461
1.342MetTyr: 1.342 ± 0.61
0.0MetXaa: 0.0 ± 0.0
Asn
4.027AsnAla: 4.027 ± 0.7
0.447AsnCys: 0.447 ± 0.453
3.579AsnAsp: 3.579 ± 1.419
4.474AsnGlu: 4.474 ± 1.856
1.79AsnPhe: 1.79 ± 0.792
4.027AsnGly: 4.027 ± 2.073
2.237AsnHis: 2.237 ± 0.743
4.922AsnIle: 4.922 ± 1.375
7.606AsnLys: 7.606 ± 1.412
6.264AsnLeu: 6.264 ± 1.713
1.342AsnMet: 1.342 ± 0.802
3.132AsnAsn: 3.132 ± 1.975
2.685AsnPro: 2.685 ± 0.754
2.685AsnGln: 2.685 ± 1.046
4.027AsnArg: 4.027 ± 1.175
3.579AsnSer: 3.579 ± 1.339
3.132AsnThr: 3.132 ± 1.28
2.685AsnVal: 2.685 ± 0.827
0.447AsnTrp: 0.447 ± 0.484
2.685AsnTyr: 2.685 ± 1.13
0.0AsnXaa: 0.0 ± 0.0
Pro
2.237ProAla: 2.237 ± 1.237
0.0ProCys: 0.0 ± 0.0
2.237ProAsp: 2.237 ± 0.854
1.79ProGlu: 1.79 ± 0.906
2.237ProPhe: 2.237 ± 0.746
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
3.579ProIle: 3.579 ± 1.556
3.579ProLys: 3.579 ± 1.464
3.132ProLeu: 3.132 ± 1.002
0.447ProMet: 0.447 ± 0.385
3.579ProAsn: 3.579 ± 1.653
1.79ProPro: 1.79 ± 0.969
2.685ProGln: 2.685 ± 1.082
0.895ProArg: 0.895 ± 0.58
0.895ProSer: 0.895 ± 0.504
1.342ProThr: 1.342 ± 0.646
2.685ProVal: 2.685 ± 0.883
0.0ProTrp: 0.0 ± 0.0
0.447ProTyr: 0.447 ± 0.513
0.0ProXaa: 0.0 ± 0.0
Gln
3.579GlnAla: 3.579 ± 1.136
0.0GlnCys: 0.0 ± 0.0
2.685GlnAsp: 2.685 ± 1.215
2.237GlnGlu: 2.237 ± 0.812
2.685GlnPhe: 2.685 ± 0.864
0.895GlnGly: 0.895 ± 0.673
0.447GlnHis: 0.447 ± 0.423
4.027GlnIle: 4.027 ± 1.878
2.685GlnLys: 2.685 ± 1.096
7.159GlnLeu: 7.159 ± 1.394
0.895GlnMet: 0.895 ± 0.67
1.342GlnAsn: 1.342 ± 0.716
1.79GlnPro: 1.79 ± 1.167
1.342GlnGln: 1.342 ± 0.641
0.895GlnArg: 0.895 ± 0.574
1.79GlnSer: 1.79 ± 0.942
2.685GlnThr: 2.685 ± 0.831
2.237GlnVal: 2.237 ± 1.05
0.0GlnTrp: 0.0 ± 0.0
2.685GlnTyr: 2.685 ± 1.217
0.0GlnXaa: 0.0 ± 0.0
Arg
2.237ArgAla: 2.237 ± 1.363
0.447ArgCys: 0.447 ± 0.423
1.342ArgAsp: 1.342 ± 0.678
2.685ArgGlu: 2.685 ± 0.763
1.342ArgPhe: 1.342 ± 0.755
1.342ArgGly: 1.342 ± 0.948
1.342ArgHis: 1.342 ± 0.912
4.027ArgIle: 4.027 ± 1.641
3.579ArgLys: 3.579 ± 1.491
4.027ArgLeu: 4.027 ± 0.597
0.447ArgMet: 0.447 ± 0.447
2.685ArgAsn: 2.685 ± 0.775
1.79ArgPro: 1.79 ± 0.963
2.685ArgGln: 2.685 ± 1.352
0.447ArgArg: 0.447 ± 0.385
0.447ArgSer: 0.447 ± 0.423
4.027ArgThr: 4.027 ± 1.101
1.79ArgVal: 1.79 ± 0.897
0.895ArgTrp: 0.895 ± 0.812
3.132ArgTyr: 3.132 ± 1.593
0.0ArgXaa: 0.0 ± 0.0
Ser
2.237SerAla: 2.237 ± 0.884
0.447SerCys: 0.447 ± 0.45
4.922SerAsp: 4.922 ± 1.631
2.685SerGlu: 2.685 ± 1.385
1.79SerPhe: 1.79 ± 1.168
0.895SerGly: 0.895 ± 0.771
0.895SerHis: 0.895 ± 0.506
4.474SerIle: 4.474 ± 1.134
4.027SerLys: 4.027 ± 1.271
3.132SerLeu: 3.132 ± 1.097
1.79SerMet: 1.79 ± 1.187
2.685SerAsn: 2.685 ± 1.244
1.342SerPro: 1.342 ± 0.64
0.895SerGln: 0.895 ± 0.609
0.447SerArg: 0.447 ± 0.551
0.895SerSer: 0.895 ± 0.541
3.132SerThr: 3.132 ± 1.008
3.132SerVal: 3.132 ± 1.208
0.447SerTrp: 0.447 ± 0.423
2.685SerTyr: 2.685 ± 1.451
0.0SerXaa: 0.0 ± 0.0
Thr
2.237ThrAla: 2.237 ± 1.036
0.0ThrCys: 0.0 ± 0.0
4.474ThrAsp: 4.474 ± 2.637
3.579ThrGlu: 3.579 ± 1.476
3.579ThrPhe: 3.579 ± 0.802
2.685ThrGly: 2.685 ± 1.188
0.447ThrHis: 0.447 ± 0.423
4.922ThrIle: 4.922 ± 2.077
6.711ThrLys: 6.711 ± 1.663
6.264ThrLeu: 6.264 ± 1.106
1.79ThrMet: 1.79 ± 0.686
0.447ThrAsn: 0.447 ± 0.453
1.342ThrPro: 1.342 ± 0.794
1.342ThrGln: 1.342 ± 1.177
2.685ThrArg: 2.685 ± 0.831
3.132ThrSer: 3.132 ± 1.316
3.579ThrThr: 3.579 ± 1.115
5.817ThrVal: 5.817 ± 1.47
0.0ThrTrp: 0.0 ± 0.0
3.132ThrTyr: 3.132 ± 1.264
0.0ThrXaa: 0.0 ± 0.0
Val
3.579ValAla: 3.579 ± 1.054
0.895ValCys: 0.895 ± 0.493
1.79ValAsp: 1.79 ± 1.075
4.922ValGlu: 4.922 ± 1.716
3.579ValPhe: 3.579 ± 1.358
1.79ValGly: 1.79 ± 0.904
0.0ValHis: 0.0 ± 0.0
1.79ValIle: 1.79 ± 0.974
5.369ValLys: 5.369 ± 2.073
5.817ValLeu: 5.817 ± 1.296
1.342ValMet: 1.342 ± 0.61
4.474ValAsn: 4.474 ± 1.197
2.237ValPro: 2.237 ± 0.812
1.79ValGln: 1.79 ± 0.851
2.237ValArg: 2.237 ± 1.361
4.474ValSer: 4.474 ± 1.913
3.132ValThr: 3.132 ± 1.478
0.447ValVal: 0.447 ± 0.385
1.342ValTrp: 1.342 ± 0.769
2.685ValTyr: 2.685 ± 1.139
0.0ValXaa: 0.0 ± 0.0
Trp
0.447TrpAla: 0.447 ± 0.551
0.0TrpCys: 0.0 ± 0.0
0.447TrpAsp: 0.447 ± 0.461
1.342TrpGlu: 1.342 ± 0.931
0.447TrpPhe: 0.447 ± 0.385
0.447TrpGly: 0.447 ± 0.385
0.447TrpHis: 0.447 ± 0.385
0.447TrpIle: 0.447 ± 0.45
0.0TrpLys: 0.0 ± 0.0
1.342TrpLeu: 1.342 ± 0.572
0.0TrpMet: 0.0 ± 0.0
0.447TrpAsn: 0.447 ± 0.551
0.0TrpPro: 0.0 ± 0.0
0.895TrpGln: 0.895 ± 0.541
0.447TrpArg: 0.447 ± 0.385
0.447TrpSer: 0.447 ± 0.423
0.447TrpThr: 0.447 ± 0.453
1.79TrpVal: 1.79 ± 0.79
0.0TrpTrp: 0.0 ± 0.0
0.447TrpTyr: 0.447 ± 0.385
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.895TyrAla: 0.895 ± 0.603
0.0TyrCys: 0.0 ± 0.0
2.685TyrAsp: 2.685 ± 0.684
2.685TyrGlu: 2.685 ± 1.364
0.447TyrPhe: 0.447 ± 0.442
4.474TyrGly: 4.474 ± 1.419
2.685TyrHis: 2.685 ± 1.12
4.027TyrIle: 4.027 ± 1.625
4.922TyrLys: 4.922 ± 1.318
4.474TyrLeu: 4.474 ± 1.382
1.342TyrMet: 1.342 ± 0.732
5.817TyrAsn: 5.817 ± 1.588
0.447TyrPro: 0.447 ± 0.423
3.579TyrGln: 3.579 ± 1.59
2.685TyrArg: 2.685 ± 0.87
2.237TyrSer: 2.237 ± 0.864
2.685TyrThr: 2.685 ± 1.69
0.447TyrVal: 0.447 ± 0.385
0.0TyrTrp: 0.0 ± 0.0
3.579TyrTyr: 3.579 ± 1.252
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14 proteins (2236 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski