Amino acid dipepetide frequency for Streptococcus satellite phage Javan500

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.282AlaAla: 0.282 ± 0.277
1.128AlaCys: 1.128 ± 0.474
2.539AlaAsp: 2.539 ± 0.877
4.231AlaGlu: 4.231 ± 1.137
3.667AlaPhe: 3.667 ± 0.866
1.693AlaGly: 1.693 ± 0.589
0.0AlaHis: 0.0 ± 0.0
7.052AlaIle: 7.052 ± 1.181
4.231AlaLys: 4.231 ± 1.076
4.513AlaLeu: 4.513 ± 1.117
1.693AlaMet: 1.693 ± 0.661
3.385AlaAsn: 3.385 ± 0.691
1.975AlaPro: 1.975 ± 0.774
1.693AlaGln: 1.693 ± 0.472
2.539AlaArg: 2.539 ± 0.587
4.513AlaSer: 4.513 ± 1.486
4.513AlaThr: 4.513 ± 0.674
3.949AlaVal: 3.949 ± 0.905
1.128AlaTrp: 1.128 ± 0.707
2.821AlaTyr: 2.821 ± 0.841
0.0AlaXaa: 0.0 ± 0.0
Cys
0.846CysAla: 0.846 ± 0.452
0.282CysCys: 0.282 ± 0.248
0.564CysAsp: 0.564 ± 0.373
0.282CysGlu: 0.282 ± 0.248
0.0CysPhe: 0.0 ± 0.0
0.282CysGly: 0.282 ± 0.248
0.282CysHis: 0.282 ± 0.263
0.0CysIle: 0.0 ± 0.0
0.564CysLys: 0.564 ± 0.351
1.128CysLeu: 1.128 ± 0.538
0.0CysMet: 0.0 ± 0.0
0.282CysAsn: 0.282 ± 0.256
0.846CysPro: 0.846 ± 0.457
0.564CysGln: 0.564 ± 0.319
0.282CysArg: 0.282 ± 0.248
0.564CysSer: 0.564 ± 0.416
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.564CysTyr: 0.564 ± 0.364
0.0CysXaa: 0.0 ± 0.0
Asp
1.975AspAla: 1.975 ± 0.683
1.128AspCys: 1.128 ± 0.622
2.821AspAsp: 2.821 ± 0.672
4.231AspGlu: 4.231 ± 1.157
2.539AspPhe: 2.539 ± 0.784
2.257AspGly: 2.257 ± 0.768
1.128AspHis: 1.128 ± 0.585
7.052AspIle: 7.052 ± 1.251
6.206AspLys: 6.206 ± 1.217
4.795AspLeu: 4.795 ± 1.051
1.975AspMet: 1.975 ± 0.747
1.693AspAsn: 1.693 ± 0.659
0.846AspPro: 0.846 ± 0.456
1.41AspGln: 1.41 ± 0.622
2.539AspArg: 2.539 ± 0.577
3.103AspSer: 3.103 ± 1.081
5.36AspThr: 5.36 ± 1.138
1.41AspVal: 1.41 ± 0.719
0.282AspTrp: 0.282 ± 0.277
3.667AspTyr: 3.667 ± 1.154
0.0AspXaa: 0.0 ± 0.0
Glu
6.77GluAla: 6.77 ± 1.02
1.693GluCys: 1.693 ± 0.744
7.052GluAsp: 7.052 ± 1.698
5.642GluGlu: 5.642 ± 1.186
2.257GluPhe: 2.257 ± 0.815
1.975GluGly: 1.975 ± 0.7
2.257GluHis: 2.257 ± 0.759
5.642GluIle: 5.642 ± 1.379
5.36GluLys: 5.36 ± 0.989
9.027GluLeu: 9.027 ± 1.217
1.975GluMet: 1.975 ± 0.591
3.103GluAsn: 3.103 ± 0.771
0.846GluPro: 0.846 ± 0.352
3.667GluGln: 3.667 ± 1.223
4.231GluArg: 4.231 ± 1.274
1.128GluSer: 1.128 ± 0.548
3.667GluThr: 3.667 ± 1.071
3.103GluVal: 3.103 ± 0.727
0.846GluTrp: 0.846 ± 0.435
4.231GluTyr: 4.231 ± 1.17
0.0GluXaa: 0.0 ± 0.0
Phe
1.41PheAla: 1.41 ± 0.432
0.0PheCys: 0.0 ± 0.0
3.667PheAsp: 3.667 ± 0.639
3.103PheGlu: 3.103 ± 0.818
2.539PhePhe: 2.539 ± 1.076
1.41PheGly: 1.41 ± 0.46
1.975PheHis: 1.975 ± 0.459
2.257PheIle: 2.257 ± 0.594
5.078PheLys: 5.078 ± 0.944
3.103PheLeu: 3.103 ± 0.903
0.0PheMet: 0.0 ± 0.0
1.975PheAsn: 1.975 ± 0.767
1.128PhePro: 1.128 ± 0.658
0.846PheGln: 0.846 ± 0.377
1.693PheArg: 1.693 ± 0.645
2.539PheSer: 2.539 ± 0.644
3.103PheThr: 3.103 ± 0.714
2.257PheVal: 2.257 ± 0.575
0.282PheTrp: 0.282 ± 0.235
1.693PheTyr: 1.693 ± 0.675
0.0PheXaa: 0.0 ± 0.0
Gly
2.257GlyAla: 2.257 ± 1.067
0.0GlyCys: 0.0 ± 0.0
4.231GlyAsp: 4.231 ± 1.033
1.975GlyGlu: 1.975 ± 0.515
1.975GlyPhe: 1.975 ± 0.764
3.385GlyGly: 3.385 ± 0.993
1.128GlyHis: 1.128 ± 0.613
2.257GlyIle: 2.257 ± 0.945
3.667GlyLys: 3.667 ± 0.876
6.206GlyLeu: 6.206 ± 1.429
1.128GlyMet: 1.128 ± 0.43
2.257GlyAsn: 2.257 ± 0.688
0.0GlyPro: 0.0 ± 0.0
2.821GlyGln: 2.821 ± 1.308
2.539GlyArg: 2.539 ± 0.826
1.693GlySer: 1.693 ± 0.744
2.539GlyThr: 2.539 ± 0.661
3.667GlyVal: 3.667 ± 0.788
0.282GlyTrp: 0.282 ± 0.235
3.103GlyTyr: 3.103 ± 0.727
0.0GlyXaa: 0.0 ± 0.0
His
3.103HisAla: 3.103 ± 1.217
0.0HisCys: 0.0 ± 0.0
0.564HisAsp: 0.564 ± 0.408
0.564HisGlu: 0.564 ± 0.329
0.282HisPhe: 0.282 ± 0.256
0.846HisGly: 0.846 ± 0.418
0.282HisHis: 0.282 ± 0.235
1.693HisIle: 1.693 ± 0.567
1.975HisLys: 1.975 ± 0.78
2.257HisLeu: 2.257 ± 0.934
0.282HisMet: 0.282 ± 0.26
1.41HisAsn: 1.41 ± 0.658
0.846HisPro: 0.846 ± 0.369
1.41HisGln: 1.41 ± 0.551
0.564HisArg: 0.564 ± 0.423
0.564HisSer: 0.564 ± 0.386
1.128HisThr: 1.128 ± 0.344
0.564HisVal: 0.564 ± 0.324
0.564HisTrp: 0.564 ± 0.319
1.975HisTyr: 1.975 ± 0.622
0.0HisXaa: 0.0 ± 0.0
Ile
6.77IleAla: 6.77 ± 1.396
0.282IleCys: 0.282 ± 0.263
6.206IleAsp: 6.206 ± 1.358
4.795IleGlu: 4.795 ± 1.153
2.539IlePhe: 2.539 ± 0.734
1.975IleGly: 1.975 ± 0.53
1.41IleHis: 1.41 ± 0.799
4.231IleIle: 4.231 ± 1.156
9.309IleLys: 9.309 ± 1.539
4.231IleLeu: 4.231 ± 0.819
1.693IleMet: 1.693 ± 0.628
3.667IleAsn: 3.667 ± 0.822
2.821IlePro: 2.821 ± 0.976
2.539IleGln: 2.539 ± 0.763
3.385IleArg: 3.385 ± 0.761
5.36IleSer: 5.36 ± 1.235
5.078IleThr: 5.078 ± 1.238
2.821IleVal: 2.821 ± 0.655
0.282IleTrp: 0.282 ± 0.256
1.975IleTyr: 1.975 ± 0.771
0.0IleXaa: 0.0 ± 0.0
Lys
7.334LysAla: 7.334 ± 1.318
0.282LysCys: 0.282 ± 0.283
4.513LysAsp: 4.513 ± 1.045
10.719LysGlu: 10.719 ± 1.772
2.821LysPhe: 2.821 ± 0.751
4.231LysGly: 4.231 ± 1.495
2.821LysHis: 2.821 ± 0.682
4.795LysIle: 4.795 ± 1.105
7.052LysLys: 7.052 ± 1.618
7.616LysLeu: 7.616 ± 1.736
1.975LysMet: 1.975 ± 0.796
5.078LysAsn: 5.078 ± 1.225
5.642LysPro: 5.642 ± 1.299
3.667LysGln: 3.667 ± 1.346
4.795LysArg: 4.795 ± 1.005
3.949LysSer: 3.949 ± 0.92
6.488LysThr: 6.488 ± 1.439
6.488LysVal: 6.488 ± 1.778
0.282LysTrp: 0.282 ± 0.263
2.821LysTyr: 2.821 ± 0.779
0.0LysXaa: 0.0 ± 0.0
Leu
4.231LeuAla: 4.231 ± 1.13
0.282LeuCys: 0.282 ± 0.256
5.642LeuAsp: 5.642 ± 1.125
10.437LeuGlu: 10.437 ± 1.164
4.513LeuPhe: 4.513 ± 1.028
5.642LeuGly: 5.642 ± 1.466
1.41LeuHis: 1.41 ± 0.644
7.898LeuIle: 7.898 ± 1.455
10.437LeuLys: 10.437 ± 1.429
9.591LeuLeu: 9.591 ± 1.705
2.257LeuMet: 2.257 ± 0.691
5.642LeuAsn: 5.642 ± 1.521
4.513LeuPro: 4.513 ± 1.129
2.821LeuGln: 2.821 ± 0.638
1.975LeuArg: 1.975 ± 0.585
8.181LeuSer: 8.181 ± 1.938
4.795LeuThr: 4.795 ± 0.809
3.667LeuVal: 3.667 ± 1.08
1.128LeuTrp: 1.128 ± 0.413
5.36LeuTyr: 5.36 ± 1.026
0.0LeuXaa: 0.0 ± 0.0
Met
2.257MetAla: 2.257 ± 1.054
0.0MetCys: 0.0 ± 0.0
1.128MetAsp: 1.128 ± 0.48
0.564MetGlu: 0.564 ± 0.438
0.564MetPhe: 0.564 ± 0.409
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.41MetIle: 1.41 ± 0.528
2.821MetLys: 2.821 ± 0.716
2.821MetLeu: 2.821 ± 0.723
0.282MetMet: 0.282 ± 0.319
1.975MetAsn: 1.975 ± 0.557
0.282MetPro: 0.282 ± 0.245
0.282MetGln: 0.282 ± 0.319
1.975MetArg: 1.975 ± 0.745
1.693MetSer: 1.693 ± 0.766
3.667MetThr: 3.667 ± 0.951
0.564MetVal: 0.564 ± 0.33
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.667AsnAla: 3.667 ± 0.967
0.0AsnCys: 0.0 ± 0.0
1.41AsnAsp: 1.41 ± 0.636
3.667AsnGlu: 3.667 ± 0.932
0.564AsnPhe: 0.564 ± 0.374
4.513AsnGly: 4.513 ± 1.025
1.128AsnHis: 1.128 ± 0.429
2.821AsnIle: 2.821 ± 0.977
4.231AsnLys: 4.231 ± 0.902
5.642AsnLeu: 5.642 ± 1.026
1.128AsnMet: 1.128 ± 0.494
2.539AsnAsn: 2.539 ± 0.8
2.539AsnPro: 2.539 ± 0.685
2.821AsnGln: 2.821 ± 1.031
3.949AsnArg: 3.949 ± 0.872
2.257AsnSer: 2.257 ± 0.697
2.257AsnThr: 2.257 ± 0.756
2.539AsnVal: 2.539 ± 0.749
0.564AsnTrp: 0.564 ± 0.369
2.257AsnTyr: 2.257 ± 0.662
0.0AsnXaa: 0.0 ± 0.0
Pro
0.282ProAla: 0.282 ± 0.301
0.282ProCys: 0.282 ± 0.312
1.975ProAsp: 1.975 ± 0.751
3.949ProGlu: 3.949 ± 0.923
1.693ProPhe: 1.693 ± 0.678
0.282ProGly: 0.282 ± 0.248
0.0ProHis: 0.0 ± 0.0
1.693ProIle: 1.693 ± 0.741
5.924ProLys: 5.924 ± 1.112
2.539ProLeu: 2.539 ± 0.798
0.282ProMet: 0.282 ± 0.248
2.257ProAsn: 2.257 ± 0.675
0.564ProPro: 0.564 ± 0.361
0.846ProGln: 0.846 ± 0.474
2.257ProArg: 2.257 ± 0.869
1.128ProSer: 1.128 ± 0.462
3.385ProThr: 3.385 ± 0.717
1.975ProVal: 1.975 ± 0.651
0.0ProTrp: 0.0 ± 0.0
1.693ProTyr: 1.693 ± 0.684
0.0ProXaa: 0.0 ± 0.0
Gln
2.539GlnAla: 2.539 ± 0.664
0.282GlnCys: 0.282 ± 0.27
1.41GlnAsp: 1.41 ± 0.487
2.821GlnGlu: 2.821 ± 0.833
1.41GlnPhe: 1.41 ± 0.484
2.539GlnGly: 2.539 ± 0.729
0.846GlnHis: 0.846 ± 0.37
2.821GlnIle: 2.821 ± 0.676
3.667GlnLys: 3.667 ± 1.081
6.77GlnLeu: 6.77 ± 1.193
1.41GlnMet: 1.41 ± 0.89
1.41GlnAsn: 1.41 ± 0.671
1.693GlnPro: 1.693 ± 0.728
2.257GlnGln: 2.257 ± 0.499
3.385GlnArg: 3.385 ± 0.727
2.539GlnSer: 2.539 ± 0.792
1.41GlnThr: 1.41 ± 0.551
3.103GlnVal: 3.103 ± 1.036
0.0GlnTrp: 0.0 ± 0.0
0.846GlnTyr: 0.846 ± 0.404
0.0GlnXaa: 0.0 ± 0.0
Arg
1.693ArgAla: 1.693 ± 0.43
0.564ArgCys: 0.564 ± 0.335
2.539ArgAsp: 2.539 ± 0.757
2.257ArgGlu: 2.257 ± 0.693
1.975ArgPhe: 1.975 ± 0.712
2.821ArgGly: 2.821 ± 0.835
1.41ArgHis: 1.41 ± 0.64
2.257ArgIle: 2.257 ± 0.659
6.206ArgLys: 6.206 ± 1.287
5.078ArgLeu: 5.078 ± 1.001
0.846ArgMet: 0.846 ± 0.446
1.41ArgAsn: 1.41 ± 0.624
1.128ArgPro: 1.128 ± 0.416
3.949ArgGln: 3.949 ± 0.853
2.539ArgArg: 2.539 ± 0.9
2.257ArgSer: 2.257 ± 0.8
2.821ArgThr: 2.821 ± 0.736
4.231ArgVal: 4.231 ± 0.839
0.564ArgTrp: 0.564 ± 0.388
3.949ArgTyr: 3.949 ± 1.021
0.0ArgXaa: 0.0 ± 0.0
Ser
3.385SerAla: 3.385 ± 0.957
0.282SerCys: 0.282 ± 0.248
3.385SerAsp: 3.385 ± 0.779
3.949SerGlu: 3.949 ± 1.151
2.257SerPhe: 2.257 ± 0.647
2.257SerGly: 2.257 ± 0.792
0.846SerHis: 0.846 ± 0.361
4.513SerIle: 4.513 ± 0.972
5.36SerLys: 5.36 ± 1.031
6.488SerLeu: 6.488 ± 1.13
0.564SerMet: 0.564 ± 0.408
1.975SerAsn: 1.975 ± 0.798
1.41SerPro: 1.41 ± 0.835
3.103SerGln: 3.103 ± 0.773
1.975SerArg: 1.975 ± 0.685
1.975SerSer: 1.975 ± 0.542
3.385SerThr: 3.385 ± 0.92
3.667SerVal: 3.667 ± 1.083
0.564SerTrp: 0.564 ± 0.423
2.257SerTyr: 2.257 ± 0.769
0.0SerXaa: 0.0 ± 0.0
Thr
4.231ThrAla: 4.231 ± 0.892
0.0ThrCys: 0.0 ± 0.0
1.975ThrAsp: 1.975 ± 0.549
4.513ThrGlu: 4.513 ± 1.023
3.103ThrPhe: 3.103 ± 1.117
5.924ThrGly: 5.924 ± 0.953
1.693ThrHis: 1.693 ± 0.587
3.949ThrIle: 3.949 ± 1.067
2.539ThrLys: 2.539 ± 0.8
7.334ThrLeu: 7.334 ± 1.627
1.41ThrMet: 1.41 ± 0.609
1.41ThrAsn: 1.41 ± 0.646
3.385ThrPro: 3.385 ± 0.948
2.539ThrGln: 2.539 ± 0.887
3.385ThrArg: 3.385 ± 1.042
3.385ThrSer: 3.385 ± 0.901
4.795ThrThr: 4.795 ± 1.399
4.231ThrVal: 4.231 ± 0.941
0.846ThrTrp: 0.846 ± 0.462
4.231ThrTyr: 4.231 ± 1.323
0.0ThrXaa: 0.0 ± 0.0
Val
2.821ValAla: 2.821 ± 0.967
0.564ValCys: 0.564 ± 0.374
1.975ValAsp: 1.975 ± 0.667
2.539ValGlu: 2.539 ± 1.003
2.539ValPhe: 2.539 ± 0.618
1.975ValGly: 1.975 ± 0.622
0.564ValHis: 0.564 ± 0.387
7.052ValIle: 7.052 ± 1.184
4.513ValLys: 4.513 ± 1.058
5.642ValLeu: 5.642 ± 1.363
1.693ValMet: 1.693 ± 0.669
3.385ValAsn: 3.385 ± 0.95
0.846ValPro: 0.846 ± 0.469
2.257ValGln: 2.257 ± 0.81
1.975ValArg: 1.975 ± 0.633
3.949ValSer: 3.949 ± 0.943
3.385ValThr: 3.385 ± 1.441
2.539ValVal: 2.539 ± 0.776
0.564ValTrp: 0.564 ± 0.469
3.103ValTyr: 3.103 ± 0.964
0.0ValXaa: 0.0 ± 0.0
Trp
0.282TrpAla: 0.282 ± 0.264
0.0TrpCys: 0.0 ± 0.0
0.846TrpAsp: 0.846 ± 0.465
0.846TrpGlu: 0.846 ± 0.447
0.0TrpPhe: 0.0 ± 0.0
0.282TrpGly: 0.282 ± 0.245
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.846TrpLys: 0.846 ± 0.518
1.693TrpLeu: 1.693 ± 0.747
0.0TrpMet: 0.0 ± 0.0
0.282TrpAsn: 0.282 ± 0.264
0.282TrpPro: 0.282 ± 0.235
0.564TrpGln: 0.564 ± 0.364
0.564TrpArg: 0.564 ± 0.361
0.564TrpSer: 0.564 ± 0.358
0.0TrpThr: 0.0 ± 0.0
1.41TrpVal: 1.41 ± 0.567
0.282TrpTrp: 0.282 ± 0.264
0.282TrpTyr: 0.282 ± 0.283
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.41TyrAla: 1.41 ± 0.511
0.282TyrCys: 0.282 ± 0.263
1.975TyrAsp: 1.975 ± 0.621
3.385TyrGlu: 3.385 ± 0.913
2.821TyrPhe: 2.821 ± 0.734
2.821TyrGly: 2.821 ± 0.793
1.693TyrHis: 1.693 ± 0.488
1.975TyrIle: 1.975 ± 0.742
3.385TyrLys: 3.385 ± 0.815
3.949TyrLeu: 3.949 ± 0.839
1.41TyrMet: 1.41 ± 0.694
5.36TyrAsn: 5.36 ± 1.117
1.41TyrPro: 1.41 ± 0.775
3.103TyrGln: 3.103 ± 0.944
3.949TyrArg: 3.949 ± 1.197
2.539TyrSer: 2.539 ± 0.766
3.103TyrThr: 3.103 ± 0.656
1.693TyrVal: 1.693 ± 0.592
0.564TyrTrp: 0.564 ± 0.496
3.385TyrTyr: 3.385 ± 0.93
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23 proteins (3546 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski