Amino acid dipepetide frequency for Streptococcus satellite phage Javan109

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.648AlaAla: 3.648 ± 0.708
0.663AlaCys: 0.663 ± 0.474
5.307AlaAsp: 5.307 ± 1.1
5.97AlaGlu: 5.97 ± 1.098
1.99AlaPhe: 1.99 ± 0.756
1.99AlaGly: 1.99 ± 0.582
0.663AlaHis: 0.663 ± 0.488
6.633AlaIle: 6.633 ± 1.829
2.985AlaLys: 2.985 ± 0.734
2.653AlaLeu: 2.653 ± 0.732
1.99AlaMet: 1.99 ± 0.946
2.322AlaAsn: 2.322 ± 1.072
0.995AlaPro: 0.995 ± 0.598
1.99AlaGln: 1.99 ± 0.844
4.643AlaArg: 4.643 ± 1.177
3.317AlaSer: 3.317 ± 0.84
2.322AlaThr: 2.322 ± 1.084
2.322AlaVal: 2.322 ± 0.814
0.332AlaTrp: 0.332 ± 0.313
2.985AlaTyr: 2.985 ± 1.223
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.332CysPhe: 0.332 ± 0.332
0.332CysGly: 0.332 ± 0.283
0.0CysHis: 0.0 ± 0.0
0.663CysIle: 0.663 ± 0.448
0.332CysLys: 0.332 ± 0.298
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.663CysPro: 0.663 ± 0.366
0.332CysGln: 0.332 ± 0.335
0.332CysArg: 0.332 ± 0.252
0.332CysSer: 0.332 ± 0.298
0.663CysThr: 0.663 ± 0.418
0.332CysVal: 0.332 ± 0.326
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.99AspAla: 1.99 ± 0.635
0.663AspCys: 0.663 ± 0.378
2.985AspAsp: 2.985 ± 0.774
2.322AspGlu: 2.322 ± 0.9
1.99AspPhe: 1.99 ± 1.341
2.653AspGly: 2.653 ± 1.192
0.332AspHis: 0.332 ± 0.252
7.96AspIle: 7.96 ± 1.66
8.624AspLys: 8.624 ± 1.036
5.307AspLeu: 5.307 ± 1.06
0.995AspMet: 0.995 ± 0.478
3.317AspAsn: 3.317 ± 0.993
0.663AspPro: 0.663 ± 0.418
0.995AspGln: 0.995 ± 0.581
1.327AspArg: 1.327 ± 0.52
2.653AspSer: 2.653 ± 1.147
2.985AspThr: 2.985 ± 0.915
3.98AspVal: 3.98 ± 0.967
0.332AspTrp: 0.332 ± 0.39
3.98AspTyr: 3.98 ± 1.232
0.0AspXaa: 0.0 ± 0.0
Glu
3.98GluAla: 3.98 ± 1.394
0.663GluCys: 0.663 ± 0.468
2.985GluAsp: 2.985 ± 0.701
5.638GluGlu: 5.638 ± 1.8
2.322GluPhe: 2.322 ± 0.869
2.653GluGly: 2.653 ± 0.867
0.332GluHis: 0.332 ± 0.303
7.96GluIle: 7.96 ± 1.514
7.96GluLys: 7.96 ± 1.34
12.604GluLeu: 12.604 ± 2.35
1.99GluMet: 1.99 ± 0.908
5.97GluAsn: 5.97 ± 1.308
2.322GluPro: 2.322 ± 0.907
4.975GluGln: 4.975 ± 1.004
3.648GluArg: 3.648 ± 1.295
3.317GluSer: 3.317 ± 1.112
4.975GluThr: 4.975 ± 1.012
1.99GluVal: 1.99 ± 0.767
1.327GluTrp: 1.327 ± 0.771
3.648GluTyr: 3.648 ± 1.185
0.0GluXaa: 0.0 ± 0.0
Phe
1.658PheAla: 1.658 ± 0.827
0.0PheCys: 0.0 ± 0.0
3.317PheAsp: 3.317 ± 1.29
3.98PheGlu: 3.98 ± 1.253
1.99PhePhe: 1.99 ± 0.935
2.653PheGly: 2.653 ± 0.918
0.663PheHis: 0.663 ± 0.376
2.985PheIle: 2.985 ± 0.983
2.985PheLys: 2.985 ± 0.886
3.317PheLeu: 3.317 ± 1.063
1.99PheMet: 1.99 ± 0.616
0.332PheAsn: 0.332 ± 0.285
0.332PhePro: 0.332 ± 0.298
0.663PheGln: 0.663 ± 0.409
1.658PheArg: 1.658 ± 0.699
4.312PheSer: 4.312 ± 1.087
1.327PheThr: 1.327 ± 0.584
1.327PheVal: 1.327 ± 0.77
0.663PheTrp: 0.663 ± 0.397
0.663PheTyr: 0.663 ± 0.438
0.0PheXaa: 0.0 ± 0.0
Gly
3.317GlyAla: 3.317 ± 0.916
0.332GlyCys: 0.332 ± 0.252
1.99GlyAsp: 1.99 ± 0.756
3.98GlyGlu: 3.98 ± 1.405
1.658GlyPhe: 1.658 ± 0.741
2.985GlyGly: 2.985 ± 1.039
0.995GlyHis: 0.995 ± 0.513
4.643GlyIle: 4.643 ± 0.999
4.312GlyLys: 4.312 ± 1.198
4.643GlyLeu: 4.643 ± 1.422
1.327GlyMet: 1.327 ± 0.54
2.985GlyAsn: 2.985 ± 0.939
0.663GlyPro: 0.663 ± 0.595
3.648GlyGln: 3.648 ± 0.845
0.995GlyArg: 0.995 ± 0.511
2.985GlySer: 2.985 ± 0.806
1.327GlyThr: 1.327 ± 0.539
3.648GlyVal: 3.648 ± 0.985
0.0GlyTrp: 0.0 ± 0.0
2.322GlyTyr: 2.322 ± 1.108
0.0GlyXaa: 0.0 ± 0.0
His
0.663HisAla: 0.663 ± 0.504
0.0HisCys: 0.0 ± 0.0
0.663HisAsp: 0.663 ± 0.421
0.332HisGlu: 0.332 ± 0.303
0.663HisPhe: 0.663 ± 0.384
1.658HisGly: 1.658 ± 0.706
0.663HisHis: 0.663 ± 0.398
0.995HisIle: 0.995 ± 0.576
1.99HisLys: 1.99 ± 0.723
1.99HisLeu: 1.99 ± 0.752
0.332HisMet: 0.332 ± 0.39
1.327HisAsn: 1.327 ± 0.556
0.0HisPro: 0.0 ± 0.0
0.995HisGln: 0.995 ± 0.571
0.663HisArg: 0.663 ± 0.437
0.995HisSer: 0.995 ± 0.549
0.332HisThr: 0.332 ± 0.252
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.995HisTyr: 0.995 ± 0.513
0.0HisXaa: 0.0 ± 0.0
Ile
1.99IleAla: 1.99 ± 0.65
0.0IleCys: 0.0 ± 0.0
6.302IleAsp: 6.302 ± 1.798
6.965IleGlu: 6.965 ± 1.185
3.317IlePhe: 3.317 ± 0.939
6.302IleGly: 6.302 ± 1.426
1.327IleHis: 1.327 ± 0.844
7.297IleIle: 7.297 ± 2.089
9.619IleLys: 9.619 ± 1.87
8.955IleLeu: 8.955 ± 1.585
1.99IleMet: 1.99 ± 0.602
4.643IleAsn: 4.643 ± 0.941
1.99IlePro: 1.99 ± 0.658
2.985IleGln: 2.985 ± 1.172
2.322IleArg: 2.322 ± 0.747
9.619IleSer: 9.619 ± 2.596
4.312IleThr: 4.312 ± 1.066
4.975IleVal: 4.975 ± 1.399
1.658IleTrp: 1.658 ± 0.775
2.322IleTyr: 2.322 ± 0.739
0.0IleXaa: 0.0 ± 0.0
Lys
6.965LysAla: 6.965 ± 1.743
0.332LysCys: 0.332 ± 0.283
4.312LysAsp: 4.312 ± 1.291
10.945LysGlu: 10.945 ± 2.384
2.985LysPhe: 2.985 ± 0.803
2.653LysGly: 2.653 ± 0.904
1.99LysHis: 1.99 ± 0.652
6.633LysIle: 6.633 ± 1.457
11.609LysLys: 11.609 ± 2.357
10.614LysLeu: 10.614 ± 2.02
2.322LysMet: 2.322 ± 0.83
6.633LysAsn: 6.633 ± 1.276
1.99LysPro: 1.99 ± 0.933
6.302LysGln: 6.302 ± 1.104
7.629LysArg: 7.629 ± 1.269
4.643LysSer: 4.643 ± 1.104
6.302LysThr: 6.302 ± 1.33
3.317LysVal: 3.317 ± 0.769
1.327LysTrp: 1.327 ± 0.844
3.98LysTyr: 3.98 ± 1.39
0.0LysXaa: 0.0 ± 0.0
Leu
8.624LeuAla: 8.624 ± 1.389
0.332LeuCys: 0.332 ± 0.326
7.96LeuAsp: 7.96 ± 1.541
10.614LeuGlu: 10.614 ± 2.678
4.312LeuPhe: 4.312 ± 1.572
4.643LeuGly: 4.643 ± 0.832
0.995LeuHis: 0.995 ± 0.533
4.643LeuIle: 4.643 ± 1.41
9.619LeuLys: 9.619 ± 1.493
9.95LeuLeu: 9.95 ± 1.544
1.99LeuMet: 1.99 ± 0.655
5.638LeuAsn: 5.638 ± 1.138
3.317LeuPro: 3.317 ± 1.013
2.653LeuGln: 2.653 ± 0.638
3.648LeuArg: 3.648 ± 1.008
6.302LeuSer: 6.302 ± 1.573
6.965LeuThr: 6.965 ± 1.233
2.985LeuVal: 2.985 ± 1.077
0.995LeuTrp: 0.995 ± 0.519
3.648LeuTyr: 3.648 ± 0.968
0.0LeuXaa: 0.0 ± 0.0
Met
0.995MetAla: 0.995 ± 0.509
0.332MetCys: 0.332 ± 0.332
0.995MetAsp: 0.995 ± 0.618
1.327MetGlu: 1.327 ± 0.566
0.0MetPhe: 0.0 ± 0.0
0.995MetGly: 0.995 ± 0.624
0.332MetHis: 0.332 ± 0.313
1.99MetIle: 1.99 ± 0.951
1.99MetLys: 1.99 ± 0.731
3.648MetLeu: 3.648 ± 1.007
0.332MetMet: 0.332 ± 0.332
1.99MetAsn: 1.99 ± 1.023
0.332MetPro: 0.332 ± 0.303
1.99MetGln: 1.99 ± 0.858
1.327MetArg: 1.327 ± 0.586
1.99MetSer: 1.99 ± 0.871
2.322MetThr: 2.322 ± 0.696
0.663MetVal: 0.663 ± 0.366
0.0MetTrp: 0.0 ± 0.0
0.663MetTyr: 0.663 ± 0.474
0.0MetXaa: 0.0 ± 0.0
Asn
2.653AsnAla: 2.653 ± 0.957
0.0AsnCys: 0.0 ± 0.0
2.985AsnAsp: 2.985 ± 0.884
4.975AsnGlu: 4.975 ± 1.451
0.995AsnPhe: 0.995 ± 0.486
3.317AsnGly: 3.317 ± 1.178
0.332AsnHis: 0.332 ± 0.39
4.975AsnIle: 4.975 ± 1.386
5.638AsnLys: 5.638 ± 1.135
5.638AsnLeu: 5.638 ± 1.428
1.327AsnMet: 1.327 ± 0.676
3.317AsnAsn: 3.317 ± 1.155
1.658AsnPro: 1.658 ± 0.737
2.653AsnGln: 2.653 ± 0.744
2.322AsnArg: 2.322 ± 0.654
5.638AsnSer: 5.638 ± 1.151
2.322AsnThr: 2.322 ± 1.048
2.653AsnVal: 2.653 ± 0.89
1.327AsnTrp: 1.327 ± 0.765
1.327AsnTyr: 1.327 ± 0.524
0.0AsnXaa: 0.0 ± 0.0
Pro
0.995ProAla: 0.995 ± 0.48
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
1.99ProGlu: 1.99 ± 0.831
1.658ProPhe: 1.658 ± 0.645
0.663ProGly: 0.663 ± 0.418
0.663ProHis: 0.663 ± 0.398
2.322ProIle: 2.322 ± 0.915
3.648ProLys: 3.648 ± 0.897
2.322ProLeu: 2.322 ± 0.914
0.332ProMet: 0.332 ± 0.283
1.327ProAsn: 1.327 ± 0.786
0.663ProPro: 0.663 ± 0.47
0.995ProGln: 0.995 ± 0.573
0.995ProArg: 0.995 ± 0.439
1.327ProSer: 1.327 ± 0.615
0.995ProThr: 0.995 ± 0.689
2.322ProVal: 2.322 ± 0.772
0.332ProTrp: 0.332 ± 0.298
0.663ProTyr: 0.663 ± 0.432
0.0ProXaa: 0.0 ± 0.0
Gln
3.98GlnAla: 3.98 ± 1.247
0.0GlnCys: 0.0 ± 0.0
0.663GlnAsp: 0.663 ± 0.503
4.975GlnGlu: 4.975 ± 1.443
1.327GlnPhe: 1.327 ± 0.511
3.648GlnGly: 3.648 ± 1.195
0.995GlnHis: 0.995 ± 0.554
4.312GlnIle: 4.312 ± 0.663
5.97GlnLys: 5.97 ± 1.215
2.985GlnLeu: 2.985 ± 0.787
0.995GlnMet: 0.995 ± 0.518
2.653GlnAsn: 2.653 ± 1.043
1.99GlnPro: 1.99 ± 0.814
2.653GlnGln: 2.653 ± 0.849
1.99GlnArg: 1.99 ± 0.697
4.975GlnSer: 4.975 ± 1.116
1.99GlnThr: 1.99 ± 0.814
0.995GlnVal: 0.995 ± 0.517
0.0GlnTrp: 0.0 ± 0.0
1.658GlnTyr: 1.658 ± 0.795
0.0GlnXaa: 0.0 ± 0.0
Arg
1.99ArgAla: 1.99 ± 0.66
0.0ArgCys: 0.0 ± 0.0
2.653ArgAsp: 2.653 ± 0.66
4.312ArgGlu: 4.312 ± 1.353
0.663ArgPhe: 0.663 ± 0.468
1.658ArgGly: 1.658 ± 0.889
0.995ArgHis: 0.995 ± 0.473
3.98ArgIle: 3.98 ± 1.02
4.975ArgLys: 4.975 ± 1.259
4.643ArgLeu: 4.643 ± 0.985
0.995ArgMet: 0.995 ± 0.623
2.653ArgAsn: 2.653 ± 0.997
0.663ArgPro: 0.663 ± 0.461
4.312ArgGln: 4.312 ± 1.021
1.99ArgArg: 1.99 ± 0.674
1.327ArgSer: 1.327 ± 0.568
2.653ArgThr: 2.653 ± 1.03
1.327ArgVal: 1.327 ± 0.884
0.332ArgTrp: 0.332 ± 0.356
0.995ArgTyr: 0.995 ± 0.556
0.0ArgXaa: 0.0 ± 0.0
Ser
2.985SerAla: 2.985 ± 0.758
0.332SerCys: 0.332 ± 0.283
3.98SerAsp: 3.98 ± 1.009
4.312SerGlu: 4.312 ± 1.367
2.653SerPhe: 2.653 ± 1.165
3.317SerGly: 3.317 ± 0.921
1.327SerHis: 1.327 ± 0.774
6.633SerIle: 6.633 ± 1.459
8.624SerLys: 8.624 ± 1.562
6.965SerLeu: 6.965 ± 1.526
0.663SerMet: 0.663 ± 0.431
3.317SerAsn: 3.317 ± 0.795
2.653SerPro: 2.653 ± 0.935
2.985SerGln: 2.985 ± 1.109
2.653SerArg: 2.653 ± 1.178
2.653SerSer: 2.653 ± 0.884
2.985SerThr: 2.985 ± 0.93
3.317SerVal: 3.317 ± 0.713
0.332SerTrp: 0.332 ± 0.252
4.312SerTyr: 4.312 ± 1.01
0.0SerXaa: 0.0 ± 0.0
Thr
2.322ThrAla: 2.322 ± 0.7
0.0ThrCys: 0.0 ± 0.0
3.317ThrAsp: 3.317 ± 1.005
3.317ThrGlu: 3.317 ± 1.13
1.99ThrPhe: 1.99 ± 0.883
3.317ThrGly: 3.317 ± 0.724
0.995ThrHis: 0.995 ± 0.465
5.307ThrIle: 5.307 ± 1.418
5.307ThrLys: 5.307 ± 1.248
4.643ThrLeu: 4.643 ± 1.299
1.327ThrMet: 1.327 ± 0.585
2.653ThrAsn: 2.653 ± 0.631
0.995ThrPro: 0.995 ± 0.528
1.327ThrGln: 1.327 ± 0.789
1.99ThrArg: 1.99 ± 0.909
2.322ThrSer: 2.322 ± 0.822
2.985ThrThr: 2.985 ± 1.557
3.98ThrVal: 3.98 ± 0.765
0.332ThrTrp: 0.332 ± 0.33
1.99ThrTyr: 1.99 ± 0.585
0.0ThrXaa: 0.0 ± 0.0
Val
3.648ValAla: 3.648 ± 1.045
0.0ValCys: 0.0 ± 0.0
3.648ValAsp: 3.648 ± 0.694
2.653ValGlu: 2.653 ± 0.707
0.995ValPhe: 0.995 ± 0.57
1.327ValGly: 1.327 ± 0.753
0.995ValHis: 0.995 ± 0.483
4.312ValIle: 4.312 ± 1.286
1.658ValLys: 1.658 ± 0.846
2.985ValLeu: 2.985 ± 0.989
1.327ValMet: 1.327 ± 0.568
2.653ValAsn: 2.653 ± 1.176
1.658ValPro: 1.658 ± 0.563
1.99ValGln: 1.99 ± 0.819
1.658ValArg: 1.658 ± 0.619
4.975ValSer: 4.975 ± 1.26
1.327ValThr: 1.327 ± 0.567
1.99ValVal: 1.99 ± 0.952
0.0ValTrp: 0.0 ± 0.0
3.98ValTyr: 3.98 ± 1.282
0.0ValXaa: 0.0 ± 0.0
Trp
0.995TrpAla: 0.995 ± 0.503
0.0TrpCys: 0.0 ± 0.0
0.663TrpAsp: 0.663 ± 0.491
0.663TrpGlu: 0.663 ± 0.398
0.663TrpPhe: 0.663 ± 0.437
0.332TrpGly: 0.332 ± 0.356
0.0TrpHis: 0.0 ± 0.0
1.99TrpIle: 1.99 ± 0.829
0.332TrpLys: 0.332 ± 0.341
0.995TrpLeu: 0.995 ± 0.461
0.332TrpMet: 0.332 ± 0.39
0.332TrpAsn: 0.332 ± 0.303
0.0TrpPro: 0.0 ± 0.0
0.663TrpGln: 0.663 ± 0.402
0.332TrpArg: 0.332 ± 0.283
0.663TrpSer: 0.663 ± 0.376
0.0TrpThr: 0.0 ± 0.0
0.332TrpVal: 0.332 ± 0.285
0.332TrpTrp: 0.332 ± 0.382
0.332TrpTyr: 0.332 ± 0.335
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.322TyrAla: 2.322 ± 0.76
0.663TyrCys: 0.663 ± 0.595
1.327TyrAsp: 1.327 ± 0.664
1.99TyrGlu: 1.99 ± 0.491
4.312TyrPhe: 4.312 ± 1.015
1.658TyrGly: 1.658 ± 0.705
0.663TyrHis: 0.663 ± 0.437
2.653TyrIle: 2.653 ± 0.871
5.307TyrLys: 5.307 ± 1.337
4.975TyrLeu: 4.975 ± 1.222
1.327TyrMet: 1.327 ± 0.657
2.322TyrAsn: 2.322 ± 0.757
0.663TyrPro: 0.663 ± 0.664
3.648TyrGln: 3.648 ± 0.99
0.995TyrArg: 0.995 ± 0.548
2.653TyrSer: 2.653 ± 1.049
1.327TyrThr: 1.327 ± 0.63
1.327TyrVal: 1.327 ± 0.681
0.332TyrTrp: 0.332 ± 0.303
1.658TyrTyr: 1.658 ± 0.704
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23 proteins (3016 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski