Amino acid dipepetide frequency for Streptococcus satellite phage Javan631

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.284AlaAla: 0.284 ± 0.238
1.137AlaCys: 1.137 ± 0.827
3.128AlaAsp: 3.128 ± 0.729
5.118AlaGlu: 5.118 ± 1.901
1.99AlaPhe: 1.99 ± 0.47
4.834AlaGly: 4.834 ± 1.85
0.569AlaHis: 0.569 ± 0.29
4.549AlaIle: 4.549 ± 1.461
6.54AlaLys: 6.54 ± 1.54
5.687AlaLeu: 5.687 ± 1.52
2.843AlaMet: 2.843 ± 0.96
3.128AlaAsn: 3.128 ± 1.038
0.0AlaPro: 0.0 ± 0.0
2.275AlaGln: 2.275 ± 1.06
0.853AlaArg: 0.853 ± 0.399
5.687AlaSer: 5.687 ± 2.144
3.412AlaThr: 3.412 ± 0.967
2.843AlaVal: 2.843 ± 0.833
0.853AlaTrp: 0.853 ± 0.493
2.559AlaTyr: 2.559 ± 0.826
0.0AlaXaa: 0.0 ± 0.0
Cys
0.284CysAla: 0.284 ± 0.349
0.284CysCys: 0.284 ± 0.309
0.284CysAsp: 0.284 ± 0.349
0.853CysGlu: 0.853 ± 0.494
0.284CysPhe: 0.284 ± 0.227
0.0CysGly: 0.0 ± 0.0
0.284CysHis: 0.284 ± 0.306
0.853CysIle: 0.853 ± 0.681
0.0CysLys: 0.0 ± 0.0
0.853CysLeu: 0.853 ± 0.635
0.0CysMet: 0.0 ± 0.0
0.284CysAsn: 0.284 ± 0.227
0.0CysPro: 0.0 ± 0.0
0.284CysGln: 0.284 ± 0.309
0.569CysArg: 0.569 ± 0.351
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.284CysVal: 0.284 ± 0.227
0.0CysTrp: 0.0 ± 0.0
0.569CysTyr: 0.569 ± 0.393
0.0CysXaa: 0.0 ± 0.0
Asp
3.128AspAla: 3.128 ± 1.07
0.569AspCys: 0.569 ± 0.387
3.981AspAsp: 3.981 ± 1.492
3.412AspGlu: 3.412 ± 1.197
3.128AspPhe: 3.128 ± 0.931
2.559AspGly: 2.559 ± 0.709
0.284AspHis: 0.284 ± 0.238
6.54AspIle: 6.54 ± 1.752
6.255AspLys: 6.255 ± 2.69
5.687AspLeu: 5.687 ± 1.654
1.706AspMet: 1.706 ± 0.755
5.971AspAsn: 5.971 ± 1.102
0.853AspPro: 0.853 ± 0.5
1.422AspGln: 1.422 ± 0.497
1.99AspArg: 1.99 ± 0.983
3.412AspSer: 3.412 ± 1.181
3.128AspThr: 3.128 ± 0.777
3.696AspVal: 3.696 ± 0.953
0.284AspTrp: 0.284 ± 0.306
2.843AspTyr: 2.843 ± 0.771
0.0AspXaa: 0.0 ± 0.0
Glu
3.412GluAla: 3.412 ± 1.023
0.569GluCys: 0.569 ± 0.293
5.118GluAsp: 5.118 ± 1.58
4.265GluGlu: 4.265 ± 1.401
4.265GluPhe: 4.265 ± 1.017
1.422GluGly: 1.422 ± 0.577
0.853GluHis: 0.853 ± 0.452
5.687GluIle: 5.687 ± 1.32
6.824GluLys: 6.824 ± 2.216
10.236GluLeu: 10.236 ± 2.036
2.559GluMet: 2.559 ± 0.741
5.118GluAsn: 5.118 ± 1.341
2.275GluPro: 2.275 ± 0.929
1.99GluGln: 1.99 ± 0.704
3.981GluArg: 3.981 ± 1.966
4.265GluSer: 4.265 ± 0.935
3.981GluThr: 3.981 ± 1.143
4.549GluVal: 4.549 ± 1.489
0.284GluTrp: 0.284 ± 0.309
3.128GluTyr: 3.128 ± 1.048
0.0GluXaa: 0.0 ± 0.0
Phe
2.275PheAla: 2.275 ± 0.747
0.284PheCys: 0.284 ± 0.309
3.696PheAsp: 3.696 ± 1.118
5.118PheGlu: 5.118 ± 1.029
2.559PhePhe: 2.559 ± 0.649
4.265PheGly: 4.265 ± 0.983
0.853PheHis: 0.853 ± 0.444
2.843PheIle: 2.843 ± 0.693
3.412PheLys: 3.412 ± 0.625
4.549PheLeu: 4.549 ± 1.073
0.569PheMet: 0.569 ± 0.413
1.422PheAsn: 1.422 ± 0.583
1.422PhePro: 1.422 ± 0.604
1.137PheGln: 1.137 ± 0.535
1.99PheArg: 1.99 ± 1.128
2.559PheSer: 2.559 ± 0.872
1.706PheThr: 1.706 ± 0.727
2.559PheVal: 2.559 ± 0.623
0.284PheTrp: 0.284 ± 0.227
0.853PheTyr: 0.853 ± 0.672
0.0PheXaa: 0.0 ± 0.0
Gly
4.549GlyAla: 4.549 ± 1.029
0.853GlyCys: 0.853 ± 0.5
1.706GlyAsp: 1.706 ± 0.489
2.559GlyGlu: 2.559 ± 1.012
3.128GlyPhe: 3.128 ± 1.03
2.275GlyGly: 2.275 ± 1.218
0.284GlyHis: 0.284 ± 0.244
6.255GlyIle: 6.255 ± 1.885
3.412GlyLys: 3.412 ± 0.767
7.108GlyLeu: 7.108 ± 1.383
1.706GlyMet: 1.706 ± 0.624
3.412GlyAsn: 3.412 ± 1.743
0.569GlyPro: 0.569 ± 0.476
2.559GlyGln: 2.559 ± 1.022
0.569GlyArg: 0.569 ± 0.307
3.696GlySer: 3.696 ± 1.666
2.559GlyThr: 2.559 ± 0.728
4.265GlyVal: 4.265 ± 1.157
0.284GlyTrp: 0.284 ± 0.331
1.99GlyTyr: 1.99 ± 1.158
0.0GlyXaa: 0.0 ± 0.0
His
0.569HisAla: 0.569 ± 0.488
0.0HisCys: 0.0 ± 0.0
0.284HisAsp: 0.284 ± 0.306
0.569HisGlu: 0.569 ± 0.293
1.422HisPhe: 1.422 ± 0.88
0.853HisGly: 0.853 ± 0.566
1.137HisHis: 1.137 ± 0.566
1.99HisIle: 1.99 ± 0.949
1.137HisLys: 1.137 ± 0.393
1.422HisLeu: 1.422 ± 0.6
0.284HisMet: 0.284 ± 0.339
0.853HisAsn: 0.853 ± 0.542
0.284HisPro: 0.284 ± 0.309
0.0HisGln: 0.0 ± 0.0
0.569HisArg: 0.569 ± 0.396
1.422HisSer: 1.422 ± 0.669
1.137HisThr: 1.137 ± 0.628
0.0HisVal: 0.0 ± 0.0
0.284HisTrp: 0.284 ± 0.331
0.284HisTyr: 0.284 ± 0.244
0.0HisXaa: 0.0 ± 0.0
Ile
5.687IleAla: 5.687 ± 1.673
0.0IleCys: 0.0 ± 0.0
5.402IleAsp: 5.402 ± 1.339
6.255IleGlu: 6.255 ± 2.066
3.696IlePhe: 3.696 ± 1.218
3.981IleGly: 3.981 ± 1.429
1.137IleHis: 1.137 ± 0.518
5.971IleIle: 5.971 ± 1.262
7.677IleLys: 7.677 ± 1.746
4.549IleLeu: 4.549 ± 1.045
1.422IleMet: 1.422 ± 0.748
3.981IleAsn: 3.981 ± 1.325
1.99IlePro: 1.99 ± 0.709
5.118IleGln: 5.118 ± 1.128
2.559IleArg: 2.559 ± 0.527
7.961IleSer: 7.961 ± 1.591
4.265IleThr: 4.265 ± 1.33
3.412IleVal: 3.412 ± 0.862
0.284IleTrp: 0.284 ± 0.238
1.422IleTyr: 1.422 ± 0.627
0.0IleXaa: 0.0 ± 0.0
Lys
6.54LysAla: 6.54 ± 1.615
0.0LysCys: 0.0 ± 0.0
4.265LysAsp: 4.265 ± 1.402
10.236LysGlu: 10.236 ± 2.079
2.275LysPhe: 2.275 ± 1.044
2.843LysGly: 2.843 ± 0.939
1.422LysHis: 1.422 ± 0.768
7.108LysIle: 7.108 ± 1.113
9.383LysLys: 9.383 ± 2.317
8.53LysLeu: 8.53 ± 2.16
2.559LysMet: 2.559 ± 0.824
6.255LysAsn: 6.255 ± 1.296
1.99LysPro: 1.99 ± 0.667
5.402LysGln: 5.402 ± 1.656
3.412LysArg: 3.412 ± 1.243
6.54LysSer: 6.54 ± 1.405
6.255LysThr: 6.255 ± 1.082
5.971LysVal: 5.971 ± 1.077
0.853LysTrp: 0.853 ± 0.332
4.265LysTyr: 4.265 ± 1.289
0.0LysXaa: 0.0 ± 0.0
Leu
6.824LeuAla: 6.824 ± 1.364
0.569LeuCys: 0.569 ± 0.348
7.961LeuAsp: 7.961 ± 1.405
6.54LeuGlu: 6.54 ± 1.656
3.696LeuPhe: 3.696 ± 1.103
7.393LeuGly: 7.393 ± 2.618
1.137LeuHis: 1.137 ± 0.375
7.677LeuIle: 7.677 ± 1.422
8.814LeuLys: 8.814 ± 1.882
11.373LeuLeu: 11.373 ± 1.877
2.275LeuMet: 2.275 ± 0.838
8.246LeuAsn: 8.246 ± 1.36
3.128LeuPro: 3.128 ± 1.287
5.687LeuGln: 5.687 ± 1.561
3.412LeuArg: 3.412 ± 1.024
6.255LeuSer: 6.255 ± 1.172
5.402LeuThr: 5.402 ± 1.486
5.118LeuVal: 5.118 ± 1.48
0.284LeuTrp: 0.284 ± 0.244
2.559LeuTyr: 2.559 ± 1.145
0.0LeuXaa: 0.0 ± 0.0
Met
1.99MetAla: 1.99 ± 0.686
0.0MetCys: 0.0 ± 0.0
2.275MetAsp: 2.275 ± 0.919
2.843MetGlu: 2.843 ± 0.775
0.569MetPhe: 0.569 ± 0.307
1.706MetGly: 1.706 ± 0.926
0.0MetHis: 0.0 ± 0.0
0.853MetIle: 0.853 ± 0.622
2.843MetLys: 2.843 ± 0.803
2.559MetLeu: 2.559 ± 0.747
1.137MetMet: 1.137 ± 0.525
2.559MetAsn: 2.559 ± 0.841
0.569MetPro: 0.569 ± 0.35
1.422MetGln: 1.422 ± 0.451
1.137MetArg: 1.137 ± 0.729
0.853MetSer: 0.853 ± 0.513
1.99MetThr: 1.99 ± 0.78
1.706MetVal: 1.706 ± 0.622
0.284MetTrp: 0.284 ± 0.349
0.284MetTyr: 0.284 ± 0.273
0.0MetXaa: 0.0 ± 0.0
Asn
4.834AsnAla: 4.834 ± 0.843
0.0AsnCys: 0.0 ± 0.0
3.412AsnAsp: 3.412 ± 1.015
6.824AsnGlu: 6.824 ± 1.496
4.549AsnPhe: 4.549 ± 1.681
4.834AsnGly: 4.834 ± 1.051
1.706AsnHis: 1.706 ± 0.48
2.559AsnIle: 2.559 ± 0.765
8.246AsnLys: 8.246 ± 1.947
8.246AsnLeu: 8.246 ± 2.008
2.275AsnMet: 2.275 ± 1.1
3.412AsnAsn: 3.412 ± 0.904
1.422AsnPro: 1.422 ± 0.463
1.422AsnGln: 1.422 ± 0.851
1.99AsnArg: 1.99 ± 0.628
2.843AsnSer: 2.843 ± 0.645
2.559AsnThr: 2.559 ± 0.933
2.843AsnVal: 2.843 ± 0.813
0.284AsnTrp: 0.284 ± 0.269
3.128AsnTyr: 3.128 ± 1.111
0.0AsnXaa: 0.0 ± 0.0
Pro
0.284ProAla: 0.284 ± 0.244
0.0ProCys: 0.0 ± 0.0
1.99ProAsp: 1.99 ± 0.55
2.275ProGlu: 2.275 ± 0.685
1.422ProPhe: 1.422 ± 0.721
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
1.137ProIle: 1.137 ± 0.41
1.706ProLys: 1.706 ± 0.819
2.559ProLeu: 2.559 ± 0.803
0.569ProMet: 0.569 ± 0.318
1.706ProAsn: 1.706 ± 0.678
0.853ProPro: 0.853 ± 0.494
0.0ProGln: 0.0 ± 0.0
1.137ProArg: 1.137 ± 0.479
1.706ProSer: 1.706 ± 0.734
2.559ProThr: 2.559 ± 0.817
1.422ProVal: 1.422 ± 0.662
0.0ProTrp: 0.0 ± 0.0
0.284ProTyr: 0.284 ± 0.371
0.0ProXaa: 0.0 ± 0.0
Gln
3.981GlnAla: 3.981 ± 1.107
0.284GlnCys: 0.284 ± 0.269
2.275GlnAsp: 2.275 ± 0.848
2.275GlnGlu: 2.275 ± 0.703
1.422GlnPhe: 1.422 ± 0.594
2.559GlnGly: 2.559 ± 0.877
0.284GlnHis: 0.284 ± 0.244
3.128GlnIle: 3.128 ± 1.057
3.696GlnLys: 3.696 ± 1.064
5.118GlnLeu: 5.118 ± 1.174
0.284GlnMet: 0.284 ± 0.331
2.275GlnAsn: 2.275 ± 0.933
1.137GlnPro: 1.137 ± 0.574
1.137GlnGln: 1.137 ± 0.579
1.137GlnArg: 1.137 ± 0.348
2.275GlnSer: 2.275 ± 0.734
1.137GlnThr: 1.137 ± 0.76
1.706GlnVal: 1.706 ± 0.547
0.853GlnTrp: 0.853 ± 0.502
1.99GlnTyr: 1.99 ± 0.678
0.0GlnXaa: 0.0 ± 0.0
Arg
2.275ArgAla: 2.275 ± 0.858
0.284ArgCys: 0.284 ± 0.309
1.706ArgAsp: 1.706 ± 0.58
2.559ArgGlu: 2.559 ± 0.835
1.422ArgPhe: 1.422 ± 0.772
1.706ArgGly: 1.706 ± 0.6
1.706ArgHis: 1.706 ± 0.667
2.843ArgIle: 2.843 ± 1.308
4.265ArgLys: 4.265 ± 1.21
3.696ArgLeu: 3.696 ± 0.73
0.853ArgMet: 0.853 ± 0.728
2.559ArgAsn: 2.559 ± 0.88
0.569ArgPro: 0.569 ± 0.454
2.559ArgGln: 2.559 ± 0.692
0.284ArgArg: 0.284 ± 0.227
0.853ArgSer: 0.853 ± 0.46
2.559ArgThr: 2.559 ± 0.978
1.99ArgVal: 1.99 ± 0.715
0.569ArgTrp: 0.569 ± 0.348
0.853ArgTyr: 0.853 ± 0.401
0.0ArgXaa: 0.0 ± 0.0
Ser
3.981SerAla: 3.981 ± 1.749
0.569SerCys: 0.569 ± 0.407
4.834SerAsp: 4.834 ± 1.083
3.981SerGlu: 3.981 ± 0.931
2.559SerPhe: 2.559 ± 0.593
4.265SerGly: 4.265 ± 1.616
0.853SerHis: 0.853 ± 0.375
5.118SerIle: 5.118 ± 1.147
6.824SerLys: 6.824 ± 2.1
6.255SerLeu: 6.255 ± 2.322
2.559SerMet: 2.559 ± 0.719
3.128SerAsn: 3.128 ± 0.757
1.422SerPro: 1.422 ± 0.781
1.99SerGln: 1.99 ± 0.724
1.706SerArg: 1.706 ± 0.681
3.696SerSer: 3.696 ± 1.461
3.981SerThr: 3.981 ± 1.432
4.265SerVal: 4.265 ± 1.078
0.284SerTrp: 0.284 ± 0.244
4.265SerTyr: 4.265 ± 0.783
0.0SerXaa: 0.0 ± 0.0
Thr
2.559ThrAla: 2.559 ± 1.601
0.0ThrCys: 0.0 ± 0.0
3.128ThrAsp: 3.128 ± 0.612
3.128ThrGlu: 3.128 ± 0.877
1.99ThrPhe: 1.99 ± 0.637
3.128ThrGly: 3.128 ± 0.779
1.137ThrHis: 1.137 ± 0.602
4.265ThrIle: 4.265 ± 1.084
3.412ThrLys: 3.412 ± 0.672
6.824ThrLeu: 6.824 ± 1.749
0.853ThrMet: 0.853 ± 0.556
4.265ThrAsn: 4.265 ± 1.102
1.422ThrPro: 1.422 ± 0.768
1.137ThrGln: 1.137 ± 0.57
2.559ThrArg: 2.559 ± 0.99
3.696ThrSer: 3.696 ± 1.141
1.706ThrThr: 1.706 ± 0.767
3.696ThrVal: 3.696 ± 1.194
0.284ThrTrp: 0.284 ± 0.385
3.412ThrTyr: 3.412 ± 0.595
0.0ThrXaa: 0.0 ± 0.0
Val
3.696ValAla: 3.696 ± 0.805
0.284ValCys: 0.284 ± 0.227
1.706ValAsp: 1.706 ± 0.617
3.981ValGlu: 3.981 ± 1.318
1.706ValPhe: 1.706 ± 0.486
2.843ValGly: 2.843 ± 1.111
0.569ValHis: 0.569 ± 0.351
4.549ValIle: 4.549 ± 1.146
5.971ValLys: 5.971 ± 1.018
5.971ValLeu: 5.971 ± 1.249
0.853ValMet: 0.853 ± 0.463
4.549ValAsn: 4.549 ± 0.761
1.137ValPro: 1.137 ± 0.604
2.559ValGln: 2.559 ± 0.907
1.706ValArg: 1.706 ± 0.729
4.834ValSer: 4.834 ± 1.074
3.696ValThr: 3.696 ± 0.851
1.99ValVal: 1.99 ± 0.73
0.284ValTrp: 0.284 ± 0.227
1.422ValTyr: 1.422 ± 0.747
0.0ValXaa: 0.0 ± 0.0
Trp
0.284TrpAla: 0.284 ± 0.244
0.284TrpCys: 0.284 ± 0.309
0.569TrpAsp: 0.569 ± 0.404
0.853TrpGlu: 0.853 ± 0.449
0.284TrpPhe: 0.284 ± 0.306
0.284TrpGly: 0.284 ± 0.227
0.0TrpHis: 0.0 ± 0.0
0.853TrpIle: 0.853 ± 0.467
0.284TrpLys: 0.284 ± 0.331
0.284TrpLeu: 0.284 ± 0.349
0.284TrpMet: 0.284 ± 0.299
0.569TrpAsn: 0.569 ± 0.39
0.284TrpPro: 0.284 ± 0.306
0.0TrpGln: 0.0 ± 0.0
0.284TrpArg: 0.284 ± 0.227
0.853TrpSer: 0.853 ± 0.337
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.284TrpTrp: 0.284 ± 0.244
0.569TrpTyr: 0.569 ± 0.327
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.853TyrAla: 0.853 ± 0.561
0.0TyrCys: 0.0 ± 0.0
3.128TyrAsp: 3.128 ± 1.066
1.422TyrGlu: 1.422 ± 0.585
1.99TyrPhe: 1.99 ± 0.878
1.99TyrGly: 1.99 ± 0.85
0.284TyrHis: 0.284 ± 0.386
1.99TyrIle: 1.99 ± 0.733
5.402TyrLys: 5.402 ± 1.252
2.843TyrLeu: 2.843 ± 0.983
1.706TyrMet: 1.706 ± 0.481
3.696TyrAsn: 3.696 ± 1.293
0.284TyrPro: 0.284 ± 0.227
0.853TyrGln: 0.853 ± 0.412
4.265TyrArg: 4.265 ± 1.38
3.128TyrSer: 3.128 ± 1.142
0.569TyrThr: 0.569 ± 0.351
1.99TyrVal: 1.99 ± 0.626
0.284TyrTrp: 0.284 ± 0.331
1.422TyrTyr: 1.422 ± 0.668
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (3518 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski