Amino acid dipepetide frequency for Streptococcus satellite phage Javan280

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.663AlaAla: 1.663 ± 0.835
0.554AlaCys: 0.554 ± 0.345
2.494AlaAsp: 2.494 ± 0.932
4.156AlaGlu: 4.156 ± 1.153
2.771AlaPhe: 2.771 ± 0.806
1.94AlaGly: 1.94 ± 0.74
0.831AlaHis: 0.831 ± 0.403
4.156AlaIle: 4.156 ± 0.858
4.433AlaLys: 4.433 ± 0.991
5.265AlaLeu: 5.265 ± 0.963
0.831AlaMet: 0.831 ± 0.459
3.325AlaAsn: 3.325 ± 0.772
1.663AlaPro: 1.663 ± 0.715
1.94AlaGln: 1.94 ± 0.829
2.494AlaArg: 2.494 ± 0.723
3.602AlaSer: 3.602 ± 0.876
2.217AlaThr: 2.217 ± 0.708
2.771AlaVal: 2.771 ± 0.783
0.0AlaTrp: 0.0 ± 0.0
3.048AlaTyr: 3.048 ± 0.778
0.0AlaXaa: 0.0 ± 0.0
Cys
0.831CysAla: 0.831 ± 0.381
0.0CysCys: 0.0 ± 0.0
0.554CysAsp: 0.554 ± 0.449
0.554CysGlu: 0.554 ± 0.593
0.554CysPhe: 0.554 ± 0.525
0.277CysGly: 0.277 ± 0.272
0.554CysHis: 0.554 ± 0.308
0.554CysIle: 0.554 ± 0.378
0.831CysLys: 0.831 ± 0.443
0.277CysLeu: 0.277 ± 0.246
0.277CysMet: 0.277 ± 0.246
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.554CysArg: 0.554 ± 0.404
0.0CysSer: 0.0 ± 0.0
0.277CysThr: 0.277 ± 0.246
0.0CysVal: 0.0 ± 0.0
0.277CysTrp: 0.277 ± 0.277
0.831CysTyr: 0.831 ± 0.496
0.0CysXaa: 0.0 ± 0.0
Asp
1.663AspAla: 1.663 ± 0.503
0.831AspCys: 0.831 ± 0.466
2.494AspAsp: 2.494 ± 0.857
3.048AspGlu: 3.048 ± 0.885
3.602AspPhe: 3.602 ± 1.015
2.771AspGly: 2.771 ± 0.971
0.277AspHis: 0.277 ± 0.267
5.542AspIle: 5.542 ± 1.095
5.819AspLys: 5.819 ± 1.353
5.265AspLeu: 5.265 ± 1.445
2.494AspMet: 2.494 ± 0.751
3.325AspAsn: 3.325 ± 0.934
0.831AspPro: 0.831 ± 0.4
1.94AspGln: 1.94 ± 0.702
1.94AspArg: 1.94 ± 0.82
3.602AspSer: 3.602 ± 0.864
3.048AspThr: 3.048 ± 0.997
1.94AspVal: 1.94 ± 0.753
0.831AspTrp: 0.831 ± 0.486
3.602AspTyr: 3.602 ± 0.832
0.0AspXaa: 0.0 ± 0.0
Glu
3.879GluAla: 3.879 ± 0.888
0.554GluCys: 0.554 ± 0.37
4.433GluAsp: 4.433 ± 1.427
6.373GluGlu: 6.373 ± 1.572
3.048GluPhe: 3.048 ± 0.884
2.771GluGly: 2.771 ± 0.905
0.831GluHis: 0.831 ± 0.456
7.204GluIle: 7.204 ± 1.774
8.313GluLys: 8.313 ± 1.51
9.421GluLeu: 9.421 ± 1.734
1.663GluMet: 1.663 ± 0.638
4.71GluAsn: 4.71 ± 1.044
1.108GluPro: 1.108 ± 0.484
4.988GluGln: 4.988 ± 1.296
3.879GluArg: 3.879 ± 1.366
4.156GluSer: 4.156 ± 1.144
4.156GluThr: 4.156 ± 0.825
3.602GluVal: 3.602 ± 0.854
0.554GluTrp: 0.554 ± 0.32
2.771GluTyr: 2.771 ± 0.799
0.0GluXaa: 0.0 ± 0.0
Phe
2.217PheAla: 2.217 ± 0.771
0.277PheCys: 0.277 ± 0.324
3.325PheAsp: 3.325 ± 1.077
3.048PheGlu: 3.048 ± 0.891
1.94PhePhe: 1.94 ± 0.793
1.108PheGly: 1.108 ± 0.604
0.554PheHis: 0.554 ± 0.339
3.325PheIle: 3.325 ± 0.984
4.988PheLys: 4.988 ± 1.041
4.433PheLeu: 4.433 ± 0.904
1.385PheMet: 1.385 ± 0.539
2.771PheAsn: 2.771 ± 0.866
1.108PhePro: 1.108 ± 0.521
1.663PheGln: 1.663 ± 0.502
1.108PheArg: 1.108 ± 0.537
2.494PheSer: 2.494 ± 0.749
2.771PheThr: 2.771 ± 0.637
0.831PheVal: 0.831 ± 0.515
0.277PheTrp: 0.277 ± 0.272
1.94PheTyr: 1.94 ± 0.681
0.0PheXaa: 0.0 ± 0.0
Gly
1.94GlyAla: 1.94 ± 0.768
0.0GlyCys: 0.0 ± 0.0
1.385GlyAsp: 1.385 ± 0.517
3.602GlyGlu: 3.602 ± 1.073
1.108GlyPhe: 1.108 ± 0.593
1.663GlyGly: 1.663 ± 0.723
0.554GlyHis: 0.554 ± 0.326
3.879GlyIle: 3.879 ± 1.132
4.156GlyLys: 4.156 ± 0.972
4.988GlyLeu: 4.988 ± 1.524
1.663GlyMet: 1.663 ± 0.745
2.217GlyAsn: 2.217 ± 0.702
0.277GlyPro: 0.277 ± 0.247
1.663GlyGln: 1.663 ± 0.632
1.94GlyArg: 1.94 ± 0.745
3.048GlySer: 3.048 ± 0.679
3.048GlyThr: 3.048 ± 0.737
3.048GlyVal: 3.048 ± 1.03
0.554GlyTrp: 0.554 ± 0.342
2.217GlyTyr: 2.217 ± 0.75
0.0GlyXaa: 0.0 ± 0.0
His
1.108HisAla: 1.108 ± 0.753
0.0HisCys: 0.0 ± 0.0
1.385HisAsp: 1.385 ± 0.593
0.831HisGlu: 0.831 ± 0.445
0.554HisPhe: 0.554 ± 0.361
0.831HisGly: 0.831 ± 0.473
0.831HisHis: 0.831 ± 0.63
3.325HisIle: 3.325 ± 1.063
1.108HisLys: 1.108 ± 0.694
2.494HisLeu: 2.494 ± 0.658
0.0HisMet: 0.0 ± 0.0
1.108HisAsn: 1.108 ± 0.519
0.831HisPro: 0.831 ± 0.452
0.831HisGln: 0.831 ± 0.44
0.831HisArg: 0.831 ± 0.476
1.663HisSer: 1.663 ± 0.527
0.277HisThr: 0.277 ± 0.243
0.831HisVal: 0.831 ± 0.457
0.0HisTrp: 0.0 ± 0.0
1.385HisTyr: 1.385 ± 0.58
0.0HisXaa: 0.0 ± 0.0
Ile
2.771IleAla: 2.771 ± 0.931
0.554IleCys: 0.554 ± 0.378
6.927IleAsp: 6.927 ± 1.189
6.65IleGlu: 6.65 ± 1.61
3.048IlePhe: 3.048 ± 1.227
3.879IleGly: 3.879 ± 0.954
1.385IleHis: 1.385 ± 0.528
6.096IleIle: 6.096 ± 1.447
8.035IleLys: 8.035 ± 1.378
7.481IleLeu: 7.481 ± 1.777
2.771IleMet: 2.771 ± 0.998
4.71IleAsn: 4.71 ± 1.019
2.217IlePro: 2.217 ± 0.679
4.988IleGln: 4.988 ± 1.05
4.433IleArg: 4.433 ± 0.826
6.096IleSer: 6.096 ± 1.154
4.988IleThr: 4.988 ± 1.242
3.325IleVal: 3.325 ± 0.991
0.831IleTrp: 0.831 ± 0.45
3.048IleTyr: 3.048 ± 0.941
0.0IleXaa: 0.0 ± 0.0
Lys
6.096LysAla: 6.096 ± 1.323
0.554LysCys: 0.554 ± 0.544
3.325LysAsp: 3.325 ± 0.637
9.698LysGlu: 9.698 ± 1.253
2.217LysPhe: 2.217 ± 0.639
3.048LysGly: 3.048 ± 0.923
3.325LysHis: 3.325 ± 0.867
6.096LysIle: 6.096 ± 0.876
9.421LysLys: 9.421 ± 1.37
7.204LysLeu: 7.204 ± 1.461
1.94LysMet: 1.94 ± 0.718
5.819LysAsn: 5.819 ± 1.301
3.602LysPro: 3.602 ± 0.805
4.433LysGln: 4.433 ± 1.246
4.71LysArg: 4.71 ± 1.167
5.542LysSer: 5.542 ± 1.092
7.481LysThr: 7.481 ± 0.981
5.265LysVal: 5.265 ± 1.15
0.831LysTrp: 0.831 ± 0.49
2.494LysTyr: 2.494 ± 0.849
0.0LysXaa: 0.0 ± 0.0
Leu
5.819LeuAla: 5.819 ± 1.361
0.831LeuCys: 0.831 ± 0.579
8.035LeuAsp: 8.035 ± 1.314
10.529LeuGlu: 10.529 ± 1.887
3.325LeuPhe: 3.325 ± 1.256
6.373LeuGly: 6.373 ± 1.129
1.663LeuHis: 1.663 ± 0.815
6.65LeuIle: 6.65 ± 1.24
9.698LeuLys: 9.698 ± 1.311
12.192LeuLeu: 12.192 ± 2.208
3.602LeuMet: 3.602 ± 0.77
5.542LeuAsn: 5.542 ± 1.004
3.879LeuPro: 3.879 ± 1.037
5.819LeuGln: 5.819 ± 1.041
1.108LeuArg: 1.108 ± 0.591
5.542LeuSer: 5.542 ± 1.107
5.265LeuThr: 5.265 ± 1.053
4.433LeuVal: 4.433 ± 1.246
1.385LeuTrp: 1.385 ± 0.633
3.048LeuTyr: 3.048 ± 0.735
0.0LeuXaa: 0.0 ± 0.0
Met
1.94MetAla: 1.94 ± 0.644
0.0MetCys: 0.0 ± 0.0
1.385MetAsp: 1.385 ± 0.619
3.048MetGlu: 3.048 ± 0.724
0.831MetPhe: 0.831 ± 0.448
1.108MetGly: 1.108 ± 0.526
0.554MetHis: 0.554 ± 0.398
1.663MetIle: 1.663 ± 0.556
2.771MetLys: 2.771 ± 0.722
1.663MetLeu: 1.663 ± 0.818
0.0MetMet: 0.0 ± 0.0
1.94MetAsn: 1.94 ± 0.813
0.277MetPro: 0.277 ± 0.272
0.277MetGln: 0.277 ± 0.262
1.663MetArg: 1.663 ± 0.533
1.663MetSer: 1.663 ± 0.649
3.879MetThr: 3.879 ± 0.775
1.94MetVal: 1.94 ± 0.813
0.277MetTrp: 0.277 ± 0.324
1.663MetTyr: 1.663 ± 0.577
0.0MetXaa: 0.0 ± 0.0
Asn
3.048AsnAla: 3.048 ± 0.928
0.277AsnCys: 0.277 ± 0.243
1.94AsnAsp: 1.94 ± 0.618
3.879AsnGlu: 3.879 ± 1.16
1.94AsnPhe: 1.94 ± 0.71
3.325AsnGly: 3.325 ± 0.88
1.385AsnHis: 1.385 ± 0.571
5.265AsnIle: 5.265 ± 1.156
4.433AsnLys: 4.433 ± 1.035
6.373AsnLeu: 6.373 ± 1.406
0.831AsnMet: 0.831 ± 0.409
3.602AsnAsn: 3.602 ± 0.885
1.663AsnPro: 1.663 ± 0.612
1.94AsnGln: 1.94 ± 0.591
1.94AsnArg: 1.94 ± 0.582
3.879AsnSer: 3.879 ± 0.924
4.433AsnThr: 4.433 ± 1.05
1.385AsnVal: 1.385 ± 0.644
1.663AsnTrp: 1.663 ± 0.809
1.663AsnTyr: 1.663 ± 0.585
0.0AsnXaa: 0.0 ± 0.0
Pro
2.217ProAla: 2.217 ± 0.791
0.0ProCys: 0.0 ± 0.0
0.831ProAsp: 0.831 ± 0.467
2.217ProGlu: 2.217 ± 0.784
1.385ProPhe: 1.385 ± 0.662
1.108ProGly: 1.108 ± 0.419
0.277ProHis: 0.277 ± 0.281
1.663ProIle: 1.663 ± 0.571
1.385ProLys: 1.385 ± 0.884
3.048ProLeu: 3.048 ± 0.852
0.277ProMet: 0.277 ± 0.264
1.663ProAsn: 1.663 ± 0.628
0.554ProPro: 0.554 ± 0.359
0.554ProGln: 0.554 ± 0.325
1.108ProArg: 1.108 ± 0.682
1.108ProSer: 1.108 ± 0.409
1.385ProThr: 1.385 ± 0.506
1.94ProVal: 1.94 ± 0.818
0.0ProTrp: 0.0 ± 0.0
0.831ProTyr: 0.831 ± 0.378
0.0ProXaa: 0.0 ± 0.0
Gln
3.325GlnAla: 3.325 ± 1.221
0.277GlnCys: 0.277 ± 0.281
2.217GlnAsp: 2.217 ± 0.552
1.94GlnGlu: 1.94 ± 0.569
2.217GlnPhe: 2.217 ± 0.897
1.663GlnGly: 1.663 ± 0.68
1.94GlnHis: 1.94 ± 0.634
3.602GlnIle: 3.602 ± 0.911
3.602GlnLys: 3.602 ± 0.937
3.879GlnLeu: 3.879 ± 0.735
1.663GlnMet: 1.663 ± 0.659
3.325GlnAsn: 3.325 ± 0.96
0.277GlnPro: 0.277 ± 0.283
2.217GlnGln: 2.217 ± 1.071
1.94GlnArg: 1.94 ± 0.624
2.771GlnSer: 2.771 ± 0.99
3.048GlnThr: 3.048 ± 0.978
3.325GlnVal: 3.325 ± 0.948
0.277GlnTrp: 0.277 ± 0.258
2.217GlnTyr: 2.217 ± 0.686
0.0GlnXaa: 0.0 ± 0.0
Arg
1.385ArgAla: 1.385 ± 0.584
0.831ArgCys: 0.831 ± 0.417
1.663ArgAsp: 1.663 ± 0.502
3.879ArgGlu: 3.879 ± 0.813
0.831ArgPhe: 0.831 ± 0.444
1.385ArgGly: 1.385 ± 0.607
1.385ArgHis: 1.385 ± 0.555
5.542ArgIle: 5.542 ± 1.144
3.879ArgLys: 3.879 ± 0.936
5.819ArgLeu: 5.819 ± 1.292
1.108ArgMet: 1.108 ± 0.536
1.663ArgAsn: 1.663 ± 0.817
1.108ArgPro: 1.108 ± 0.461
2.494ArgGln: 2.494 ± 1.23
1.94ArgArg: 1.94 ± 0.579
1.94ArgSer: 1.94 ± 0.746
2.494ArgThr: 2.494 ± 0.753
1.385ArgVal: 1.385 ± 0.469
0.0ArgTrp: 0.0 ± 0.0
1.94ArgTyr: 1.94 ± 0.62
0.0ArgXaa: 0.0 ± 0.0
Ser
2.494SerAla: 2.494 ± 0.887
0.277SerCys: 0.277 ± 0.277
4.156SerAsp: 4.156 ± 0.802
4.71SerGlu: 4.71 ± 1.224
2.771SerPhe: 2.771 ± 0.915
2.217SerGly: 2.217 ± 0.619
0.831SerHis: 0.831 ± 0.354
4.988SerIle: 4.988 ± 0.745
5.542SerLys: 5.542 ± 0.95
6.927SerLeu: 6.927 ± 1.118
1.94SerMet: 1.94 ± 0.748
3.048SerAsn: 3.048 ± 1.219
1.108SerPro: 1.108 ± 0.47
1.385SerGln: 1.385 ± 0.547
3.325SerArg: 3.325 ± 0.983
2.494SerSer: 2.494 ± 0.597
4.71SerThr: 4.71 ± 0.854
3.879SerVal: 3.879 ± 0.99
0.277SerTrp: 0.277 ± 0.281
4.156SerTyr: 4.156 ± 1.038
0.0SerXaa: 0.0 ± 0.0
Thr
2.771ThrAla: 2.771 ± 0.744
0.554ThrCys: 0.554 ± 0.421
3.048ThrAsp: 3.048 ± 0.944
5.542ThrGlu: 5.542 ± 1.372
3.048ThrPhe: 3.048 ± 0.906
4.433ThrGly: 4.433 ± 0.987
1.108ThrHis: 1.108 ± 0.409
6.373ThrIle: 6.373 ± 0.901
5.265ThrLys: 5.265 ± 1.088
4.433ThrLeu: 4.433 ± 1.046
2.494ThrMet: 2.494 ± 0.882
2.217ThrAsn: 2.217 ± 0.755
1.108ThrPro: 1.108 ± 0.522
2.771ThrGln: 2.771 ± 0.896
1.94ThrArg: 1.94 ± 0.884
3.879ThrSer: 3.879 ± 0.846
3.048ThrThr: 3.048 ± 1.019
5.265ThrVal: 5.265 ± 1.273
0.277ThrTrp: 0.277 ± 0.289
2.494ThrTyr: 2.494 ± 0.826
0.0ThrXaa: 0.0 ± 0.0
Val
2.494ValAla: 2.494 ± 1.001
0.0ValCys: 0.0 ± 0.0
3.048ValAsp: 3.048 ± 0.753
1.94ValGlu: 1.94 ± 0.744
3.325ValPhe: 3.325 ± 0.725
1.385ValGly: 1.385 ± 0.715
0.0ValHis: 0.0 ± 0.0
4.156ValIle: 4.156 ± 0.95
5.265ValLys: 5.265 ± 1.736
4.988ValLeu: 4.988 ± 1.253
1.663ValMet: 1.663 ± 0.702
1.663ValAsn: 1.663 ± 0.791
1.385ValPro: 1.385 ± 0.568
3.048ValGln: 3.048 ± 1.051
2.494ValArg: 2.494 ± 0.739
3.879ValSer: 3.879 ± 0.776
1.94ValThr: 1.94 ± 0.596
2.771ValVal: 2.771 ± 1.002
1.108ValTrp: 1.108 ± 0.529
3.325ValTyr: 3.325 ± 0.862
0.0ValXaa: 0.0 ± 0.0
Trp
1.94TrpAla: 1.94 ± 0.545
0.0TrpCys: 0.0 ± 0.0
0.554TrpAsp: 0.554 ± 0.421
0.831TrpGlu: 0.831 ± 0.558
0.554TrpPhe: 0.554 ± 0.378
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.831TrpIle: 0.831 ± 0.475
0.277TrpLys: 0.277 ± 0.281
1.108TrpLeu: 1.108 ± 0.482
0.277TrpMet: 0.277 ± 0.254
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.554TrpGln: 0.554 ± 0.363
0.554TrpArg: 0.554 ± 0.364
1.108TrpSer: 1.108 ± 0.605
0.277TrpThr: 0.277 ± 0.289
0.277TrpVal: 0.277 ± 0.258
0.0TrpTrp: 0.0 ± 0.0
0.277TrpTyr: 0.277 ± 0.296
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.554TyrAla: 0.554 ± 0.363
0.831TyrCys: 0.831 ± 0.434
1.663TyrAsp: 1.663 ± 0.726
1.94TyrGlu: 1.94 ± 0.624
2.494TyrPhe: 2.494 ± 0.951
1.663TyrGly: 1.663 ± 0.846
1.94TyrHis: 1.94 ± 0.586
3.325TyrIle: 3.325 ± 1.139
4.156TyrLys: 4.156 ± 0.807
7.481TyrLeu: 7.481 ± 1.297
1.385TyrMet: 1.385 ± 0.642
2.217TyrAsn: 2.217 ± 0.598
0.554TyrPro: 0.554 ± 0.354
2.217TyrGln: 2.217 ± 0.73
2.771TyrArg: 2.771 ± 0.709
2.771TyrSer: 2.771 ± 0.725
3.325TyrThr: 3.325 ± 0.789
1.663TyrVal: 1.663 ± 0.9
0.0TyrTrp: 0.0 ± 0.0
1.663TyrTyr: 1.663 ± 0.686
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (3610 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski