Amino acid dipepetide frequency for Streptococcus satellite phage Javan574

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.864AlaCys: 0.864 ± 0.522
4.32AlaAsp: 4.32 ± 1.85
5.616AlaGlu: 5.616 ± 1.498
2.592AlaPhe: 2.592 ± 1.081
3.024AlaGly: 3.024 ± 0.97
0.432AlaHis: 0.432 ± 0.461
3.888AlaIle: 3.888 ± 1.116
3.888AlaLys: 3.888 ± 1.305
7.775AlaLeu: 7.775 ± 1.506
2.592AlaMet: 2.592 ± 1.205
2.592AlaAsn: 2.592 ± 0.915
2.592AlaPro: 2.592 ± 1.046
3.456AlaGln: 3.456 ± 1.549
2.592AlaArg: 2.592 ± 0.84
4.32AlaSer: 4.32 ± 1.237
4.752AlaThr: 4.752 ± 1.844
3.888AlaVal: 3.888 ± 1.441
0.432AlaTrp: 0.432 ± 0.344
3.456AlaTyr: 3.456 ± 1.192
0.0AlaXaa: 0.0 ± 0.0
Cys
0.432CysAla: 0.432 ± 0.461
0.0CysCys: 0.0 ± 0.0
0.864CysAsp: 0.864 ± 0.603
0.432CysGlu: 0.432 ± 0.45
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.432CysLys: 0.432 ± 0.349
0.432CysLeu: 0.432 ± 0.405
0.0CysMet: 0.0 ± 0.0
0.864CysAsn: 0.864 ± 0.698
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.432CysArg: 0.432 ± 0.344
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.432AspCys: 0.432 ± 0.405
4.32AspAsp: 4.32 ± 2.493
2.592AspGlu: 2.592 ± 0.971
3.888AspPhe: 3.888 ± 1.397
2.16AspGly: 2.16 ± 0.867
0.0AspHis: 0.0 ± 0.0
5.184AspIle: 5.184 ± 1.46
6.479AspLys: 6.479 ± 1.722
6.048AspLeu: 6.048 ± 1.473
3.024AspMet: 3.024 ± 0.841
3.888AspAsn: 3.888 ± 0.917
1.296AspPro: 1.296 ± 0.502
1.728AspGln: 1.728 ± 1.156
1.728AspArg: 1.728 ± 0.594
3.024AspSer: 3.024 ± 1.277
3.888AspThr: 3.888 ± 1.321
3.888AspVal: 3.888 ± 0.89
0.864AspTrp: 0.864 ± 0.462
5.184AspTyr: 5.184 ± 1.56
0.0AspXaa: 0.0 ± 0.0
Glu
7.343GluAla: 7.343 ± 1.19
0.432GluCys: 0.432 ± 0.461
5.184GluAsp: 5.184 ± 2.162
6.048GluGlu: 6.048 ± 2.384
3.024GluPhe: 3.024 ± 1.032
3.888GluGly: 3.888 ± 1.096
0.864GluHis: 0.864 ± 0.444
6.048GluIle: 6.048 ± 1.648
5.616GluLys: 5.616 ± 1.489
11.231GluLeu: 11.231 ± 2.288
1.296GluMet: 1.296 ± 0.545
6.911GluAsn: 6.911 ± 1.789
3.024GluPro: 3.024 ± 0.84
2.16GluGln: 2.16 ± 1.029
3.456GluArg: 3.456 ± 1.173
4.32GluSer: 4.32 ± 1.215
3.888GluThr: 3.888 ± 1.065
3.024GluVal: 3.024 ± 1.004
0.432GluTrp: 0.432 ± 0.349
3.888GluTyr: 3.888 ± 1.502
0.0GluXaa: 0.0 ± 0.0
Phe
1.296PheAla: 1.296 ± 0.701
0.432PheCys: 0.432 ± 0.349
2.16PheAsp: 2.16 ± 0.939
4.32PheGlu: 4.32 ± 1.114
1.296PhePhe: 1.296 ± 0.676
3.024PheGly: 3.024 ± 1.162
0.432PheHis: 0.432 ± 0.344
3.024PheIle: 3.024 ± 0.878
3.024PheLys: 3.024 ± 0.844
3.456PheLeu: 3.456 ± 0.801
0.432PheMet: 0.432 ± 0.422
3.888PheAsn: 3.888 ± 1.358
0.432PhePro: 0.432 ± 0.487
2.592PheGln: 2.592 ± 0.873
2.592PheArg: 2.592 ± 1.219
1.728PheSer: 1.728 ± 0.648
3.024PheThr: 3.024 ± 1.152
0.432PheVal: 0.432 ± 0.405
0.432PheTrp: 0.432 ± 0.349
2.592PheTyr: 2.592 ± 0.774
0.0PheXaa: 0.0 ± 0.0
Gly
3.888GlyAla: 3.888 ± 1.251
0.432GlyCys: 0.432 ± 0.344
3.456GlyAsp: 3.456 ± 1.547
3.024GlyGlu: 3.024 ± 1.007
3.888GlyPhe: 3.888 ± 1.452
1.728GlyGly: 1.728 ± 0.482
0.432GlyHis: 0.432 ± 0.344
3.456GlyIle: 3.456 ± 0.968
5.184GlyLys: 5.184 ± 1.073
6.479GlyLeu: 6.479 ± 2.031
0.864GlyMet: 0.864 ± 0.623
1.728GlyAsn: 1.728 ± 1.069
0.0GlyPro: 0.0 ± 0.0
3.888GlyGln: 3.888 ± 1.257
3.024GlyArg: 3.024 ± 0.703
0.864GlySer: 0.864 ± 0.556
0.864GlyThr: 0.864 ± 0.522
4.32GlyVal: 4.32 ± 1.443
0.864GlyTrp: 0.864 ± 0.698
0.864GlyTyr: 0.864 ± 0.617
0.0GlyXaa: 0.0 ± 0.0
His
3.024HisAla: 3.024 ± 0.822
0.0HisCys: 0.0 ± 0.0
1.296HisAsp: 1.296 ± 0.523
1.296HisGlu: 1.296 ± 0.724
1.296HisPhe: 1.296 ± 0.708
1.728HisGly: 1.728 ± 0.675
0.432HisHis: 0.432 ± 0.461
0.0HisIle: 0.0 ± 0.0
1.296HisLys: 1.296 ± 0.923
2.16HisLeu: 2.16 ± 0.606
0.0HisMet: 0.0 ± 0.0
0.432HisAsn: 0.432 ± 0.349
0.0HisPro: 0.0 ± 0.0
0.432HisGln: 0.432 ± 0.45
0.864HisArg: 0.864 ± 0.584
0.864HisSer: 0.864 ± 0.49
2.592HisThr: 2.592 ± 0.684
0.432HisVal: 0.432 ± 0.344
0.0HisTrp: 0.0 ± 0.0
0.864HisTyr: 0.864 ± 0.439
0.0HisXaa: 0.0 ± 0.0
Ile
5.184IleAla: 5.184 ± 1.196
0.432IleCys: 0.432 ± 0.45
4.752IleAsp: 4.752 ± 0.918
6.048IleGlu: 6.048 ± 2.191
3.456IlePhe: 3.456 ± 1.17
2.16IleGly: 2.16 ± 0.604
1.728IleHis: 1.728 ± 0.837
3.024IleIle: 3.024 ± 1.069
6.048IleLys: 6.048 ± 1.494
3.456IleLeu: 3.456 ± 0.871
1.728IleMet: 1.728 ± 0.72
3.024IleAsn: 3.024 ± 1.315
2.592IlePro: 2.592 ± 1.252
1.296IleGln: 1.296 ± 0.83
1.728IleArg: 1.728 ± 1.017
3.024IleSer: 3.024 ± 1.217
7.343IleThr: 7.343 ± 1.077
2.16IleVal: 2.16 ± 0.963
0.432IleTrp: 0.432 ± 0.487
2.16IleTyr: 2.16 ± 0.604
0.0IleXaa: 0.0 ± 0.0
Lys
6.911LysAla: 6.911 ± 1.826
0.0LysCys: 0.0 ± 0.0
2.16LysAsp: 2.16 ± 1.257
7.343LysGlu: 7.343 ± 1.918
1.296LysPhe: 1.296 ± 0.708
4.752LysGly: 4.752 ± 1.288
3.456LysHis: 3.456 ± 0.613
6.048LysIle: 6.048 ± 1.397
9.503LysLys: 9.503 ± 2.389
10.367LysLeu: 10.367 ± 1.936
1.728LysMet: 1.728 ± 0.669
3.024LysAsn: 3.024 ± 0.937
6.048LysPro: 6.048 ± 1.355
4.752LysGln: 4.752 ± 1.827
6.479LysArg: 6.479 ± 1.422
2.592LysSer: 2.592 ± 1.029
6.479LysThr: 6.479 ± 0.972
5.184LysVal: 5.184 ± 1.174
0.864LysTrp: 0.864 ± 0.569
4.32LysTyr: 4.32 ± 1.206
0.0LysXaa: 0.0 ± 0.0
Leu
5.616LeuAla: 5.616 ± 2.269
0.0LeuCys: 0.0 ± 0.0
10.799LeuAsp: 10.799 ± 1.834
12.527LeuGlu: 12.527 ± 2.532
4.32LeuPhe: 4.32 ± 1.502
5.616LeuGly: 5.616 ± 2.077
0.864LeuHis: 0.864 ± 0.49
6.048LeuIle: 6.048 ± 1.716
9.503LeuLys: 9.503 ± 1.992
10.367LeuLeu: 10.367 ± 1.834
2.16LeuMet: 2.16 ± 0.704
5.184LeuAsn: 5.184 ± 1.608
3.456LeuPro: 3.456 ± 1.221
1.728LeuGln: 1.728 ± 0.649
3.888LeuArg: 3.888 ± 1.338
4.752LeuSer: 4.752 ± 1.138
6.048LeuThr: 6.048 ± 1.411
7.775LeuVal: 7.775 ± 2.142
1.296LeuTrp: 1.296 ± 0.639
3.888LeuTyr: 3.888 ± 0.934
0.0LeuXaa: 0.0 ± 0.0
Met
3.888MetAla: 3.888 ± 1.678
0.0MetCys: 0.0 ± 0.0
0.864MetAsp: 0.864 ± 0.49
2.16MetGlu: 2.16 ± 0.752
0.864MetPhe: 0.864 ± 0.597
0.432MetGly: 0.432 ± 0.349
0.0MetHis: 0.0 ± 0.0
1.296MetIle: 1.296 ± 0.652
2.592MetLys: 2.592 ± 0.797
0.864MetLeu: 0.864 ± 0.439
0.0MetMet: 0.0 ± 0.0
2.16MetAsn: 2.16 ± 0.969
0.432MetPro: 0.432 ± 0.349
0.432MetGln: 0.432 ± 0.344
0.864MetArg: 0.864 ± 0.619
1.296MetSer: 1.296 ± 0.639
2.16MetThr: 2.16 ± 0.84
0.864MetVal: 0.864 ± 0.497
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.024AsnAla: 3.024 ± 1.067
0.0AsnCys: 0.0 ± 0.0
3.456AsnAsp: 3.456 ± 1.26
3.888AsnGlu: 3.888 ± 0.727
0.432AsnPhe: 0.432 ± 0.382
3.888AsnGly: 3.888 ± 1.31
2.16AsnHis: 2.16 ± 0.794
2.592AsnIle: 2.592 ± 0.815
5.616AsnLys: 5.616 ± 1.285
6.048AsnLeu: 6.048 ± 1.477
0.432AsnMet: 0.432 ± 0.596
2.592AsnAsn: 2.592 ± 0.924
4.32AsnPro: 4.32 ± 1.013
2.16AsnGln: 2.16 ± 0.655
1.728AsnArg: 1.728 ± 0.632
3.024AsnSer: 3.024 ± 1.022
2.16AsnThr: 2.16 ± 0.859
3.888AsnVal: 3.888 ± 1.315
0.432AsnTrp: 0.432 ± 0.461
0.864AsnTyr: 0.864 ± 0.516
0.0AsnXaa: 0.0 ± 0.0
Pro
3.024ProAla: 3.024 ± 0.902
0.0ProCys: 0.0 ± 0.0
2.16ProAsp: 2.16 ± 0.952
1.296ProGlu: 1.296 ± 1.216
2.16ProPhe: 2.16 ± 1.104
1.296ProGly: 1.296 ± 0.689
0.0ProHis: 0.0 ± 0.0
0.864ProIle: 0.864 ± 0.617
5.184ProLys: 5.184 ± 2.226
3.024ProLeu: 3.024 ± 0.886
0.432ProMet: 0.432 ± 0.465
3.456ProAsn: 3.456 ± 1.687
1.296ProPro: 1.296 ± 0.871
3.456ProGln: 3.456 ± 1.106
3.456ProArg: 3.456 ± 0.848
1.296ProSer: 1.296 ± 0.502
1.728ProThr: 1.728 ± 0.627
2.16ProVal: 2.16 ± 0.918
0.432ProTrp: 0.432 ± 0.349
1.296ProTyr: 1.296 ± 0.577
0.0ProXaa: 0.0 ± 0.0
Gln
4.752GlnAla: 4.752 ± 1.231
0.0GlnCys: 0.0 ± 0.0
1.728GlnAsp: 1.728 ± 1.002
3.456GlnGlu: 3.456 ± 1.355
0.432GlnPhe: 0.432 ± 0.349
2.592GlnGly: 2.592 ± 1.293
0.864GlnHis: 0.864 ± 0.49
2.16GlnIle: 2.16 ± 0.835
3.456GlnLys: 3.456 ± 1.294
3.456GlnLeu: 3.456 ± 0.936
0.432GlnMet: 0.432 ± 0.382
1.296GlnAsn: 1.296 ± 0.482
1.296GlnPro: 1.296 ± 0.582
3.456GlnGln: 3.456 ± 0.834
2.16GlnArg: 2.16 ± 0.911
2.592GlnSer: 2.592 ± 0.997
3.024GlnThr: 3.024 ± 1.064
2.592GlnVal: 2.592 ± 0.729
0.432GlnTrp: 0.432 ± 0.349
1.728GlnTyr: 1.728 ± 0.828
0.0GlnXaa: 0.0 ± 0.0
Arg
2.592ArgAla: 2.592 ± 0.934
0.0ArgCys: 0.0 ± 0.0
1.728ArgAsp: 1.728 ± 0.721
3.888ArgGlu: 3.888 ± 1.414
3.888ArgPhe: 3.888 ± 1.106
2.16ArgGly: 2.16 ± 0.923
1.728ArgHis: 1.728 ± 0.77
4.752ArgIle: 4.752 ± 1.251
2.592ArgLys: 2.592 ± 0.924
5.184ArgLeu: 5.184 ± 2.197
0.432ArgMet: 0.432 ± 0.473
3.024ArgAsn: 3.024 ± 0.891
1.296ArgPro: 1.296 ± 0.893
1.296ArgGln: 1.296 ± 0.523
1.296ArgArg: 1.296 ± 0.679
1.296ArgSer: 1.296 ± 0.688
3.888ArgThr: 3.888 ± 1.198
2.592ArgVal: 2.592 ± 1.028
0.432ArgTrp: 0.432 ± 0.454
2.592ArgTyr: 2.592 ± 1.348
0.0ArgXaa: 0.0 ± 0.0
Ser
1.728SerAla: 1.728 ± 0.632
0.0SerCys: 0.0 ± 0.0
3.024SerAsp: 3.024 ± 0.906
3.888SerGlu: 3.888 ± 1.11
0.864SerPhe: 0.864 ± 0.453
3.888SerGly: 3.888 ± 1.116
0.864SerHis: 0.864 ± 0.455
3.456SerIle: 3.456 ± 0.941
4.752SerLys: 4.752 ± 1.712
6.048SerLeu: 6.048 ± 1.34
0.432SerMet: 0.432 ± 0.344
3.888SerAsn: 3.888 ± 0.81
1.296SerPro: 1.296 ± 0.437
1.296SerGln: 1.296 ± 0.725
0.432SerArg: 0.432 ± 0.45
1.296SerSer: 1.296 ± 0.694
3.456SerThr: 3.456 ± 1.021
1.728SerVal: 1.728 ± 0.609
0.864SerTrp: 0.864 ± 0.439
3.456SerTyr: 3.456 ± 1.098
0.0SerXaa: 0.0 ± 0.0
Thr
3.888ThrAla: 3.888 ± 1.057
0.0ThrCys: 0.0 ± 0.0
0.864ThrAsp: 0.864 ± 0.486
5.616ThrGlu: 5.616 ± 1.921
4.752ThrPhe: 4.752 ± 1.928
3.024ThrGly: 3.024 ± 0.962
2.592ThrHis: 2.592 ± 0.738
3.888ThrIle: 3.888 ± 0.895
4.752ThrLys: 4.752 ± 1.449
8.207ThrLeu: 8.207 ± 1.268
2.16ThrMet: 2.16 ± 0.942
1.728ThrAsn: 1.728 ± 0.874
3.024ThrPro: 3.024 ± 0.997
1.728ThrGln: 1.728 ± 0.998
3.456ThrArg: 3.456 ± 0.855
3.888ThrSer: 3.888 ± 0.879
3.456ThrThr: 3.456 ± 1.279
3.888ThrVal: 3.888 ± 1.111
0.864ThrTrp: 0.864 ± 0.569
3.456ThrTyr: 3.456 ± 1.482
0.0ThrXaa: 0.0 ± 0.0
Val
4.32ValAla: 4.32 ± 1.436
0.432ValCys: 0.432 ± 0.349
3.024ValAsp: 3.024 ± 0.866
3.888ValGlu: 3.888 ± 1.119
2.16ValPhe: 2.16 ± 0.684
1.728ValGly: 1.728 ± 0.616
0.432ValHis: 0.432 ± 0.45
1.296ValIle: 1.296 ± 0.502
6.479ValLys: 6.479 ± 1.826
5.616ValLeu: 5.616 ± 1.301
1.296ValMet: 1.296 ± 0.772
2.592ValAsn: 2.592 ± 0.8
3.024ValPro: 3.024 ± 1.344
1.296ValGln: 1.296 ± 0.634
2.592ValArg: 2.592 ± 1.007
3.456ValSer: 3.456 ± 0.955
3.456ValThr: 3.456 ± 0.941
3.456ValVal: 3.456 ± 1.161
0.0ValTrp: 0.0 ± 0.0
3.456ValTyr: 3.456 ± 1.103
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.728TrpGlu: 1.728 ± 0.775
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.432TrpHis: 0.432 ± 0.349
0.432TrpIle: 0.432 ± 0.349
0.864TrpLys: 0.864 ± 0.477
1.296TrpLeu: 1.296 ± 0.618
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.432TrpPro: 0.432 ± 0.349
0.432TrpGln: 0.432 ± 0.349
0.432TrpArg: 0.432 ± 0.349
1.296TrpSer: 1.296 ± 0.613
0.0TrpThr: 0.0 ± 0.0
1.296TrpVal: 1.296 ± 0.502
0.432TrpTrp: 0.432 ± 0.344
0.432TrpTyr: 0.432 ± 0.349
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.16TyrAla: 2.16 ± 0.995
0.432TyrCys: 0.432 ± 0.405
3.456TyrAsp: 3.456 ± 0.714
3.024TyrGlu: 3.024 ± 1.156
0.0TyrPhe: 0.0 ± 0.0
2.16TyrGly: 2.16 ± 0.708
1.296TyrHis: 1.296 ± 0.634
4.32TyrIle: 4.32 ± 1.203
6.048TyrLys: 6.048 ± 1.46
4.32TyrLeu: 4.32 ± 1.122
1.296TyrMet: 1.296 ± 0.634
0.864TyrAsn: 0.864 ± 0.974
2.16TyrPro: 2.16 ± 0.864
4.32TyrGln: 4.32 ± 1.186
3.456TyrArg: 3.456 ± 1.174
1.728TyrSer: 1.728 ± 0.745
3.024TyrThr: 3.024 ± 0.836
0.432TyrVal: 0.432 ± 0.349
0.0TyrTrp: 0.0 ± 0.0
1.296TyrTyr: 1.296 ± 0.702
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (2316 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski