Amino acid dipepetide frequency for Streptococcus satellite phage Javan416

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.614AlaAla: 0.614 ± 0.585
0.922AlaCys: 0.922 ± 0.614
4.301AlaAsp: 4.301 ± 1.517
4.301AlaGlu: 4.301 ± 1.217
1.229AlaPhe: 1.229 ± 0.552
2.765AlaGly: 2.765 ± 0.881
0.307AlaHis: 0.307 ± 0.235
5.837AlaIle: 5.837 ± 1.258
4.916AlaLys: 4.916 ± 1.147
7.066AlaLeu: 7.066 ± 1.818
1.843AlaMet: 1.843 ± 0.685
3.379AlaAsn: 3.379 ± 0.886
2.151AlaPro: 2.151 ± 0.658
2.151AlaGln: 2.151 ± 0.86
2.151AlaArg: 2.151 ± 0.678
1.843AlaSer: 1.843 ± 0.713
4.608AlaThr: 4.608 ± 1.689
2.151AlaVal: 2.151 ± 1.062
1.229AlaTrp: 1.229 ± 0.647
2.765AlaTyr: 2.765 ± 1.1
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.922CysGlu: 0.922 ± 0.391
0.614CysPhe: 0.614 ± 0.399
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.614CysLys: 0.614 ± 0.547
0.614CysLeu: 0.614 ± 0.42
0.0CysMet: 0.0 ± 0.0
0.307CysAsn: 0.307 ± 0.281
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.307CysArg: 0.307 ± 0.281
0.614CysSer: 0.614 ± 0.47
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.307CysTyr: 0.307 ± 0.293
0.0CysXaa: 0.0 ± 0.0
Asp
1.229AspAla: 1.229 ± 0.511
0.307AspCys: 0.307 ± 0.245
3.994AspAsp: 3.994 ± 1.198
5.837AspGlu: 5.837 ± 1.565
3.994AspPhe: 3.994 ± 1.103
1.536AspGly: 1.536 ± 0.703
0.922AspHis: 0.922 ± 0.585
4.916AspIle: 4.916 ± 1.028
7.066AspLys: 7.066 ± 1.545
4.608AspLeu: 4.608 ± 1.146
0.922AspMet: 0.922 ± 0.596
6.759AspAsn: 6.759 ± 1.012
1.843AspPro: 1.843 ± 0.548
0.614AspGln: 0.614 ± 0.489
2.765AspArg: 2.765 ± 1.008
2.151AspSer: 2.151 ± 0.925
2.458AspThr: 2.458 ± 1.019
2.765AspVal: 2.765 ± 0.83
0.614AspTrp: 0.614 ± 0.443
3.687AspTyr: 3.687 ± 0.869
0.0AspXaa: 0.0 ± 0.0
Glu
3.379GluAla: 3.379 ± 0.892
0.307GluCys: 0.307 ± 0.273
5.223GluAsp: 5.223 ± 1.024
7.68GluGlu: 7.68 ± 2.384
3.994GluPhe: 3.994 ± 1.323
1.843GluGly: 1.843 ± 0.773
1.843GluHis: 1.843 ± 0.608
6.144GluIle: 6.144 ± 1.932
7.988GluLys: 7.988 ± 1.671
14.132GluLeu: 14.132 ± 3.206
1.229GluMet: 1.229 ± 0.454
7.988GluAsn: 7.988 ± 1.168
0.922GluPro: 0.922 ± 0.584
4.301GluGln: 4.301 ± 1.265
4.608GluArg: 4.608 ± 1.249
3.994GluSer: 3.994 ± 0.981
3.072GluThr: 3.072 ± 1.097
5.837GluVal: 5.837 ± 1.54
0.922GluTrp: 0.922 ± 0.411
4.916GluTyr: 4.916 ± 1.328
0.0GluXaa: 0.0 ± 0.0
Phe
1.229PheAla: 1.229 ± 0.478
0.0PheCys: 0.0 ± 0.0
2.458PheAsp: 2.458 ± 0.9
2.458PheGlu: 2.458 ± 0.993
1.229PhePhe: 1.229 ± 0.55
2.151PheGly: 2.151 ± 0.797
1.229PheHis: 1.229 ± 0.381
1.536PheIle: 1.536 ± 0.767
4.301PheLys: 4.301 ± 0.937
4.301PheLeu: 4.301 ± 1.27
1.536PheMet: 1.536 ± 0.749
4.301PheAsn: 4.301 ± 1.508
0.922PhePro: 0.922 ± 0.395
0.614PheGln: 0.614 ± 0.318
3.379PheArg: 3.379 ± 0.895
4.608PheSer: 4.608 ± 0.915
1.229PheThr: 1.229 ± 0.504
1.843PheVal: 1.843 ± 0.526
0.614PheTrp: 0.614 ± 0.382
2.151PheTyr: 2.151 ± 0.592
0.0PheXaa: 0.0 ± 0.0
Gly
2.458GlyAla: 2.458 ± 0.706
0.307GlyCys: 0.307 ± 0.281
1.536GlyAsp: 1.536 ± 0.635
3.994GlyGlu: 3.994 ± 0.948
1.229GlyPhe: 1.229 ± 0.532
1.843GlyGly: 1.843 ± 0.611
0.307GlyHis: 0.307 ± 0.281
3.994GlyIle: 3.994 ± 0.99
3.687GlyLys: 3.687 ± 1.532
4.916GlyLeu: 4.916 ± 1.405
0.922GlyMet: 0.922 ± 0.454
1.536GlyAsn: 1.536 ± 0.818
0.0GlyPro: 0.0 ± 0.0
0.922GlyGln: 0.922 ± 0.492
2.151GlyArg: 2.151 ± 0.683
1.229GlySer: 1.229 ± 0.699
5.223GlyThr: 5.223 ± 1.2
3.687GlyVal: 3.687 ± 0.751
0.614GlyTrp: 0.614 ± 0.399
2.765GlyTyr: 2.765 ± 0.771
0.0GlyXaa: 0.0 ± 0.0
His
1.229HisAla: 1.229 ± 1.123
0.0HisCys: 0.0 ± 0.0
0.614HisAsp: 0.614 ± 0.411
1.229HisGlu: 1.229 ± 0.605
1.229HisPhe: 1.229 ± 0.627
0.307HisGly: 0.307 ± 0.273
0.614HisHis: 0.614 ± 0.425
1.536HisIle: 1.536 ± 0.723
1.229HisLys: 1.229 ± 0.381
2.458HisLeu: 2.458 ± 0.879
0.307HisMet: 0.307 ± 0.369
1.843HisAsn: 1.843 ± 1.008
0.614HisPro: 0.614 ± 0.318
0.307HisGln: 0.307 ± 0.273
0.307HisArg: 0.307 ± 0.281
1.536HisSer: 1.536 ± 0.588
0.614HisThr: 0.614 ± 0.346
0.922HisVal: 0.922 ± 0.598
0.307HisTrp: 0.307 ± 0.372
0.922HisTyr: 0.922 ± 0.391
0.0HisXaa: 0.0 ± 0.0
Ile
6.759IleAla: 6.759 ± 1.744
0.614IleCys: 0.614 ± 0.412
7.373IleAsp: 7.373 ± 1.001
6.452IleGlu: 6.452 ± 1.435
3.072IlePhe: 3.072 ± 0.813
3.379IleGly: 3.379 ± 0.717
1.843IleHis: 1.843 ± 0.433
3.379IleIle: 3.379 ± 0.879
10.753IleLys: 10.753 ± 1.514
6.144IleLeu: 6.144 ± 1.267
1.843IleMet: 1.843 ± 0.782
6.144IleAsn: 6.144 ± 1.648
2.458IlePro: 2.458 ± 0.862
2.765IleGln: 2.765 ± 0.686
2.458IleArg: 2.458 ± 0.689
4.301IleSer: 4.301 ± 1.253
4.301IleThr: 4.301 ± 1.193
2.458IleVal: 2.458 ± 0.842
0.307IleTrp: 0.307 ± 0.322
2.458IleTyr: 2.458 ± 0.844
0.0IleXaa: 0.0 ± 0.0
Lys
6.452LysAla: 6.452 ± 2.15
0.0LysCys: 0.0 ± 0.0
5.53LysAsp: 5.53 ± 0.859
13.21LysGlu: 13.21 ± 2.133
0.614LysPhe: 0.614 ± 0.402
5.53LysGly: 5.53 ± 1.252
1.229LysHis: 1.229 ± 0.782
8.295LysIle: 8.295 ± 1.876
5.223LysLys: 5.223 ± 1.008
6.759LysLeu: 6.759 ± 1.39
1.843LysMet: 1.843 ± 0.531
5.53LysAsn: 5.53 ± 1.38
3.379LysPro: 3.379 ± 0.903
4.916LysGln: 4.916 ± 1.876
5.223LysArg: 5.223 ± 1.119
6.452LysSer: 6.452 ± 1.468
4.301LysThr: 4.301 ± 1.21
3.994LysVal: 3.994 ± 0.994
0.614LysTrp: 0.614 ± 0.522
3.994LysTyr: 3.994 ± 1.059
0.0LysXaa: 0.0 ± 0.0
Leu
6.759LeuAla: 6.759 ± 1.393
0.307LeuCys: 0.307 ± 0.235
7.066LeuAsp: 7.066 ± 1.633
11.06LeuGlu: 11.06 ± 1.628
4.608LeuPhe: 4.608 ± 1.358
3.994LeuGly: 3.994 ± 1.352
1.229LeuHis: 1.229 ± 0.655
6.144LeuIle: 6.144 ± 1.499
9.217LeuLys: 9.217 ± 1.481
8.909LeuLeu: 8.909 ± 1.812
1.843LeuMet: 1.843 ± 0.699
7.373LeuAsn: 7.373 ± 1.702
2.765LeuPro: 2.765 ± 0.754
4.916LeuGln: 4.916 ± 1.383
4.608LeuArg: 4.608 ± 1.265
5.53LeuSer: 5.53 ± 1.334
4.608LeuThr: 4.608 ± 1.18
3.687LeuVal: 3.687 ± 0.711
0.614LeuTrp: 0.614 ± 0.346
3.994LeuTyr: 3.994 ± 0.758
0.0LeuXaa: 0.0 ± 0.0
Met
3.072MetAla: 3.072 ± 1.32
0.0MetCys: 0.0 ± 0.0
0.922MetAsp: 0.922 ± 0.635
0.922MetGlu: 0.922 ± 0.529
0.307MetPhe: 0.307 ± 0.307
0.614MetGly: 0.614 ± 0.441
0.0MetHis: 0.0 ± 0.0
0.307MetIle: 0.307 ± 0.322
1.843MetLys: 1.843 ± 0.738
2.765MetLeu: 2.765 ± 0.901
0.614MetMet: 0.614 ± 0.487
2.151MetAsn: 2.151 ± 0.526
1.229MetPro: 1.229 ± 0.576
1.229MetGln: 1.229 ± 0.633
0.307MetArg: 0.307 ± 0.235
0.614MetSer: 0.614 ± 0.405
2.458MetThr: 2.458 ± 0.893
0.922MetVal: 0.922 ± 0.481
0.0MetTrp: 0.0 ± 0.0
0.922MetTyr: 0.922 ± 0.584
0.0MetXaa: 0.0 ± 0.0
Asn
5.53AsnAla: 5.53 ± 0.924
0.307AsnCys: 0.307 ± 0.322
2.765AsnAsp: 2.765 ± 0.536
4.916AsnGlu: 4.916 ± 1.219
3.687AsnPhe: 3.687 ± 0.85
5.223AsnGly: 5.223 ± 1.131
1.229AsnHis: 1.229 ± 0.551
7.988AsnIle: 7.988 ± 1.629
7.68AsnLys: 7.68 ± 1.023
5.223AsnLeu: 5.223 ± 1.277
1.229AsnMet: 1.229 ± 0.736
5.223AsnAsn: 5.223 ± 2.127
3.687AsnPro: 3.687 ± 1.339
3.379AsnGln: 3.379 ± 1.069
5.223AsnArg: 5.223 ± 1.056
3.379AsnSer: 3.379 ± 0.89
3.072AsnThr: 3.072 ± 0.905
2.458AsnVal: 2.458 ± 0.716
0.0AsnTrp: 0.0 ± 0.0
1.536AsnTyr: 1.536 ± 0.839
0.0AsnXaa: 0.0 ± 0.0
Pro
0.922ProAla: 0.922 ± 0.423
0.0ProCys: 0.0 ± 0.0
1.229ProAsp: 1.229 ± 0.806
1.843ProGlu: 1.843 ± 0.752
0.922ProPhe: 0.922 ± 0.545
0.0ProGly: 0.0 ± 0.0
0.614ProHis: 0.614 ± 0.34
2.765ProIle: 2.765 ± 0.895
4.301ProLys: 4.301 ± 1.164
2.765ProLeu: 2.765 ± 1.091
0.307ProMet: 0.307 ± 0.322
2.458ProAsn: 2.458 ± 1.277
0.614ProPro: 0.614 ± 0.493
0.614ProGln: 0.614 ± 0.346
2.765ProArg: 2.765 ± 1.116
0.614ProSer: 0.614 ± 0.561
2.151ProThr: 2.151 ± 0.526
1.229ProVal: 1.229 ± 0.455
0.307ProTrp: 0.307 ± 0.273
0.922ProTyr: 0.922 ± 0.423
0.0ProXaa: 0.0 ± 0.0
Gln
3.072GlnAla: 3.072 ± 0.901
0.307GlnCys: 0.307 ± 0.273
2.458GlnAsp: 2.458 ± 0.636
3.994GlnGlu: 3.994 ± 1.015
0.922GlnPhe: 0.922 ± 0.514
1.843GlnGly: 1.843 ± 0.949
1.536GlnHis: 1.536 ± 0.74
4.301GlnIle: 4.301 ± 1.244
2.765GlnLys: 2.765 ± 0.829
3.072GlnLeu: 3.072 ± 1.039
0.614GlnMet: 0.614 ± 0.443
2.458GlnAsn: 2.458 ± 1.038
1.229GlnPro: 1.229 ± 0.658
1.229GlnGln: 1.229 ± 0.685
2.151GlnArg: 2.151 ± 0.672
1.536GlnSer: 1.536 ± 0.43
2.151GlnThr: 2.151 ± 0.639
1.843GlnVal: 1.843 ± 1.039
0.0GlnTrp: 0.0 ± 0.0
2.765GlnTyr: 2.765 ± 1.08
0.0GlnXaa: 0.0 ± 0.0
Arg
2.765ArgAla: 2.765 ± 0.857
0.307ArgCys: 0.307 ± 0.235
1.843ArgAsp: 1.843 ± 0.559
5.53ArgGlu: 5.53 ± 1.161
3.379ArgPhe: 3.379 ± 1.055
2.151ArgGly: 2.151 ± 0.858
1.229ArgHis: 1.229 ± 0.72
3.687ArgIle: 3.687 ± 0.802
2.458ArgLys: 2.458 ± 0.607
4.608ArgLeu: 4.608 ± 1.308
0.614ArgMet: 0.614 ± 0.33
3.072ArgAsn: 3.072 ± 0.813
1.229ArgPro: 1.229 ± 0.809
3.379ArgGln: 3.379 ± 0.774
1.229ArgArg: 1.229 ± 0.613
0.922ArgSer: 0.922 ± 0.396
2.458ArgThr: 2.458 ± 0.641
2.458ArgVal: 2.458 ± 0.777
0.614ArgTrp: 0.614 ± 0.382
2.765ArgTyr: 2.765 ± 0.993
0.0ArgXaa: 0.0 ± 0.0
Ser
2.458SerAla: 2.458 ± 1.176
0.0SerCys: 0.0 ± 0.0
4.608SerAsp: 4.608 ± 1.28
3.994SerGlu: 3.994 ± 1.101
3.072SerPhe: 3.072 ± 0.832
2.151SerGly: 2.151 ± 0.992
1.229SerHis: 1.229 ± 0.566
4.608SerIle: 4.608 ± 1.161
4.916SerLys: 4.916 ± 1.072
5.53SerLeu: 5.53 ± 0.621
0.922SerMet: 0.922 ± 0.516
2.151SerAsn: 2.151 ± 0.648
0.307SerPro: 0.307 ± 0.372
0.614SerGln: 0.614 ± 0.34
1.536SerArg: 1.536 ± 0.683
1.229SerSer: 1.229 ± 0.638
1.536SerThr: 1.536 ± 0.6
3.687SerVal: 3.687 ± 0.782
0.0SerTrp: 0.0 ± 0.0
3.994SerTyr: 3.994 ± 1.212
0.0SerXaa: 0.0 ± 0.0
Thr
2.458ThrAla: 2.458 ± 0.821
0.307ThrCys: 0.307 ± 0.322
1.229ThrAsp: 1.229 ± 0.551
5.223ThrGlu: 5.223 ± 1.664
1.843ThrPhe: 1.843 ± 0.644
4.301ThrGly: 4.301 ± 1.13
1.229ThrHis: 1.229 ± 0.589
4.916ThrIle: 4.916 ± 1.142
2.765ThrLys: 2.765 ± 0.962
4.608ThrLeu: 4.608 ± 1.107
2.151ThrMet: 2.151 ± 1.012
2.458ThrAsn: 2.458 ± 0.836
1.536ThrPro: 1.536 ± 0.819
2.458ThrGln: 2.458 ± 0.766
3.379ThrArg: 3.379 ± 1.045
1.843ThrSer: 1.843 ± 0.683
3.379ThrThr: 3.379 ± 0.986
4.608ThrVal: 4.608 ± 1.188
0.614ThrTrp: 0.614 ± 0.384
1.229ThrTyr: 1.229 ± 0.403
0.0ThrXaa: 0.0 ± 0.0
Val
3.687ValAla: 3.687 ± 0.983
0.0ValCys: 0.0 ± 0.0
2.458ValAsp: 2.458 ± 0.703
3.687ValGlu: 3.687 ± 1.121
3.687ValPhe: 3.687 ± 0.986
1.229ValGly: 1.229 ± 0.587
0.307ValHis: 0.307 ± 0.281
5.53ValIle: 5.53 ± 1.424
5.837ValLys: 5.837 ± 1.134
3.994ValLeu: 3.994 ± 0.989
0.922ValMet: 0.922 ± 0.566
4.301ValAsn: 4.301 ± 1.509
1.536ValPro: 1.536 ± 0.774
1.536ValGln: 1.536 ± 0.728
0.614ValArg: 0.614 ± 0.441
3.072ValSer: 3.072 ± 1.271
3.379ValThr: 3.379 ± 0.857
3.072ValVal: 3.072 ± 1.503
0.307ValTrp: 0.307 ± 0.363
1.536ValTyr: 1.536 ± 0.66
0.0ValXaa: 0.0 ± 0.0
Trp
0.307TrpAla: 0.307 ± 0.322
0.0TrpCys: 0.0 ± 0.0
0.614TrpAsp: 0.614 ± 0.383
0.922TrpGlu: 0.922 ± 0.498
0.307TrpPhe: 0.307 ± 0.363
0.307TrpGly: 0.307 ± 0.235
0.307TrpHis: 0.307 ± 0.273
0.307TrpIle: 0.307 ± 0.372
0.614TrpLys: 0.614 ± 0.405
1.229TrpLeu: 1.229 ± 0.567
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.307TrpGln: 0.307 ± 0.372
0.307TrpArg: 0.307 ± 0.293
0.614TrpSer: 0.614 ± 0.458
0.0TrpThr: 0.0 ± 0.0
1.229TrpVal: 1.229 ± 0.62
0.0TrpTrp: 0.0 ± 0.0
0.307TrpTyr: 0.307 ± 0.293
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.151TyrAla: 2.151 ± 0.906
0.307TyrCys: 0.307 ± 0.273
2.765TyrAsp: 2.765 ± 0.569
2.151TyrGlu: 2.151 ± 0.613
2.151TyrPhe: 2.151 ± 0.693
2.151TyrGly: 2.151 ± 1.063
0.922TyrHis: 0.922 ± 0.417
3.072TyrIle: 3.072 ± 0.605
4.916TyrLys: 4.916 ± 0.818
5.53TyrLeu: 5.53 ± 1.408
1.229TyrMet: 1.229 ± 0.619
4.301TyrAsn: 4.301 ± 0.916
0.922TyrPro: 0.922 ± 0.422
3.994TyrGln: 3.994 ± 1.468
1.229TyrArg: 1.229 ± 0.618
2.458TyrSer: 2.458 ± 0.966
1.843TyrThr: 1.843 ± 0.916
1.843TyrVal: 1.843 ± 0.679
0.0TyrTrp: 0.0 ± 0.0
2.151TyrTyr: 2.151 ± 0.654
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 17 proteins (3256 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski