Amino acid dipepetide frequency for Streptococcus satellite phage Javan277

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.704AlaAla: 1.704 ± 0.916
0.568AlaCys: 0.568 ± 0.356
2.556AlaAsp: 2.556 ± 0.888
4.26AlaGlu: 4.26 ± 1.179
2.84AlaPhe: 2.84 ± 0.785
1.988AlaGly: 1.988 ± 0.839
0.568AlaHis: 0.568 ± 0.348
3.976AlaIle: 3.976 ± 0.902
4.544AlaLys: 4.544 ± 0.984
5.396AlaLeu: 5.396 ± 1.066
0.852AlaMet: 0.852 ± 0.508
2.84AlaAsn: 2.84 ± 0.773
1.704AlaPro: 1.704 ± 0.747
1.988AlaGln: 1.988 ± 0.77
2.272AlaArg: 2.272 ± 0.811
3.692AlaSer: 3.692 ± 0.912
1.988AlaThr: 1.988 ± 0.838
2.84AlaVal: 2.84 ± 0.851
0.0AlaTrp: 0.0 ± 0.0
2.84AlaTyr: 2.84 ± 0.799
0.0AlaXaa: 0.0 ± 0.0
Cys
0.852CysAla: 0.852 ± 0.446
0.0CysCys: 0.0 ± 0.0
0.568CysAsp: 0.568 ± 0.506
0.568CysGlu: 0.568 ± 0.555
0.568CysPhe: 0.568 ± 0.552
0.284CysGly: 0.284 ± 0.278
0.568CysHis: 0.568 ± 0.356
0.568CysIle: 0.568 ± 0.404
0.852CysLys: 0.852 ± 0.422
0.284CysLeu: 0.284 ± 0.292
0.284CysMet: 0.284 ± 0.292
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.568CysArg: 0.568 ± 0.398
0.0CysSer: 0.0 ± 0.0
0.284CysThr: 0.284 ± 0.292
0.0CysVal: 0.0 ± 0.0
0.284CysTrp: 0.284 ± 0.317
0.852CysTyr: 0.852 ± 0.469
0.0CysXaa: 0.0 ± 0.0
Asp
1.42AspAla: 1.42 ± 0.511
0.852AspCys: 0.852 ± 0.441
2.556AspAsp: 2.556 ± 1.078
3.124AspGlu: 3.124 ± 1.084
3.124AspPhe: 3.124 ± 0.763
2.84AspGly: 2.84 ± 1.037
0.284AspHis: 0.284 ± 0.277
5.68AspIle: 5.68 ± 1.162
5.964AspLys: 5.964 ± 1.411
5.396AspLeu: 5.396 ± 1.639
2.556AspMet: 2.556 ± 0.744
3.408AspAsn: 3.408 ± 0.899
0.852AspPro: 0.852 ± 0.525
1.988AspGln: 1.988 ± 0.888
1.704AspArg: 1.704 ± 0.949
3.408AspSer: 3.408 ± 1.031
3.124AspThr: 3.124 ± 1.18
1.988AspVal: 1.988 ± 0.718
0.852AspTrp: 0.852 ± 0.503
3.408AspTyr: 3.408 ± 0.866
0.0AspXaa: 0.0 ± 0.0
Glu
3.976GluAla: 3.976 ± 1.099
0.568GluCys: 0.568 ± 0.434
4.544GluAsp: 4.544 ± 1.432
6.532GluGlu: 6.532 ± 1.942
3.124GluPhe: 3.124 ± 1.105
2.84GluGly: 2.84 ± 1.115
0.852GluHis: 0.852 ± 0.447
7.384GluIle: 7.384 ± 1.696
8.236GluLys: 8.236 ± 1.635
9.656GluLeu: 9.656 ± 1.885
1.704GluMet: 1.704 ± 0.785
4.544GluAsn: 4.544 ± 1.044
1.136GluPro: 1.136 ± 0.429
5.112GluGln: 5.112 ± 1.467
3.976GluArg: 3.976 ± 1.253
4.26GluSer: 4.26 ± 1.184
4.26GluThr: 4.26 ± 0.898
3.408GluVal: 3.408 ± 0.869
0.568GluTrp: 0.568 ± 0.357
2.84GluTyr: 2.84 ± 0.709
0.0GluXaa: 0.0 ± 0.0
Phe
2.272PheAla: 2.272 ± 0.805
0.284PheCys: 0.284 ± 0.324
3.408PheAsp: 3.408 ± 0.942
3.124PheGlu: 3.124 ± 0.88
1.704PhePhe: 1.704 ± 0.665
1.136PheGly: 1.136 ± 0.592
0.568PheHis: 0.568 ± 0.337
3.408PheIle: 3.408 ± 0.904
5.112PheLys: 5.112 ± 1.164
4.26PheLeu: 4.26 ± 0.931
1.704PheMet: 1.704 ± 0.57
2.556PheAsn: 2.556 ± 0.881
1.136PhePro: 1.136 ± 0.523
1.42PheGln: 1.42 ± 0.493
1.136PheArg: 1.136 ± 0.6
2.556PheSer: 2.556 ± 0.857
2.84PheThr: 2.84 ± 0.771
0.852PheVal: 0.852 ± 0.524
0.284PheTrp: 0.284 ± 0.278
1.988PheTyr: 1.988 ± 0.711
0.0PheXaa: 0.0 ± 0.0
Gly
1.988GlyAla: 1.988 ± 0.759
0.0GlyCys: 0.0 ± 0.0
1.42GlyAsp: 1.42 ± 0.528
3.692GlyGlu: 3.692 ± 1.074
1.136GlyPhe: 1.136 ± 0.551
1.704GlyGly: 1.704 ± 0.578
0.568GlyHis: 0.568 ± 0.365
3.976GlyIle: 3.976 ± 1.154
4.26GlyLys: 4.26 ± 1.171
5.112GlyLeu: 5.112 ± 1.357
1.704GlyMet: 1.704 ± 0.775
2.272GlyAsn: 2.272 ± 0.828
0.284GlyPro: 0.284 ± 0.254
1.704GlyGln: 1.704 ± 0.679
1.988GlyArg: 1.988 ± 0.731
2.84GlySer: 2.84 ± 0.823
3.124GlyThr: 3.124 ± 0.834
3.124GlyVal: 3.124 ± 0.989
0.568GlyTrp: 0.568 ± 0.402
2.272GlyTyr: 2.272 ± 0.835
0.0GlyXaa: 0.0 ± 0.0
His
1.136HisAla: 1.136 ± 0.777
0.0HisCys: 0.0 ± 0.0
1.42HisAsp: 1.42 ± 0.694
0.852HisGlu: 0.852 ± 0.424
0.568HisPhe: 0.568 ± 0.341
0.852HisGly: 0.852 ± 0.46
0.852HisHis: 0.852 ± 0.581
3.408HisIle: 3.408 ± 1.07
1.136HisLys: 1.136 ± 0.657
2.272HisLeu: 2.272 ± 0.597
0.0HisMet: 0.0 ± 0.0
1.136HisAsn: 1.136 ± 0.617
0.852HisPro: 0.852 ± 0.49
0.568HisGln: 0.568 ± 0.351
0.852HisArg: 0.852 ± 0.438
1.42HisSer: 1.42 ± 0.514
0.284HisThr: 0.284 ± 0.26
0.852HisVal: 0.852 ± 0.489
0.0HisTrp: 0.0 ± 0.0
1.42HisTyr: 1.42 ± 0.568
0.0HisXaa: 0.0 ± 0.0
Ile
2.272IleAla: 2.272 ± 0.839
0.568IleCys: 0.568 ± 0.355
6.816IleAsp: 6.816 ± 1.188
6.816IleGlu: 6.816 ± 1.795
3.124IlePhe: 3.124 ± 1.307
3.976IleGly: 3.976 ± 0.833
1.136IleHis: 1.136 ± 0.554
6.248IleIle: 6.248 ± 1.362
7.952IleLys: 7.952 ± 1.364
7.668IleLeu: 7.668 ± 1.706
2.84IleMet: 2.84 ± 0.994
4.828IleAsn: 4.828 ± 1.007
1.704IlePro: 1.704 ± 0.59
5.112IleGln: 5.112 ± 1.081
4.544IleArg: 4.544 ± 0.737
6.248IleSer: 6.248 ± 1.269
5.112IleThr: 5.112 ± 1.288
3.124IleVal: 3.124 ± 1.136
0.852IleTrp: 0.852 ± 0.445
3.124IleTyr: 3.124 ± 0.977
0.0IleXaa: 0.0 ± 0.0
Lys
6.248LysAla: 6.248 ± 1.43
0.568LysCys: 0.568 ± 0.556
3.408LysAsp: 3.408 ± 0.759
9.656LysGlu: 9.656 ± 1.211
2.272LysPhe: 2.272 ± 0.759
3.124LysGly: 3.124 ± 0.987
3.408LysHis: 3.408 ± 1.023
5.964LysIle: 5.964 ± 0.898
9.372LysLys: 9.372 ± 1.551
7.1LysLeu: 7.1 ± 1.508
1.988LysMet: 1.988 ± 0.623
5.68LysAsn: 5.68 ± 1.529
3.692LysPro: 3.692 ± 0.783
4.544LysGln: 4.544 ± 1.339
4.828LysArg: 4.828 ± 1.359
5.68LysSer: 5.68 ± 1.125
7.668LysThr: 7.668 ± 0.937
5.396LysVal: 5.396 ± 1.119
0.852LysTrp: 0.852 ± 0.461
2.556LysTyr: 2.556 ± 0.823
0.0LysXaa: 0.0 ± 0.0
Leu
5.964LeuAla: 5.964 ± 1.339
0.852LeuCys: 0.852 ± 0.582
7.952LeuAsp: 7.952 ± 1.424
10.792LeuGlu: 10.792 ± 2.026
3.124LeuPhe: 3.124 ± 1.374
6.532LeuGly: 6.532 ± 1.235
1.704LeuHis: 1.704 ± 0.806
6.248LeuIle: 6.248 ± 1.12
9.656LeuLys: 9.656 ± 1.442
12.212LeuLeu: 12.212 ± 1.648
3.692LeuMet: 3.692 ± 0.809
5.68LeuAsn: 5.68 ± 1.033
3.692LeuPro: 3.692 ± 1.052
5.396LeuGln: 5.396 ± 0.978
1.136LeuArg: 1.136 ± 0.603
5.68LeuSer: 5.68 ± 1.262
5.112LeuThr: 5.112 ± 1.026
4.544LeuVal: 4.544 ± 1.38
1.42LeuTrp: 1.42 ± 0.654
2.84LeuTyr: 2.84 ± 0.828
0.0LeuXaa: 0.0 ± 0.0
Met
1.988MetAla: 1.988 ± 0.739
0.0MetCys: 0.0 ± 0.0
1.42MetAsp: 1.42 ± 0.671
3.124MetGlu: 3.124 ± 0.707
1.136MetPhe: 1.136 ± 0.511
1.136MetGly: 1.136 ± 0.631
0.568MetHis: 0.568 ± 0.42
1.704MetIle: 1.704 ± 0.633
2.84MetLys: 2.84 ± 0.873
1.704MetLeu: 1.704 ± 0.769
0.0MetMet: 0.0 ± 0.0
1.988MetAsn: 1.988 ± 0.914
0.284MetPro: 0.284 ± 0.278
0.284MetGln: 0.284 ± 0.276
1.704MetArg: 1.704 ± 0.617
1.704MetSer: 1.704 ± 0.832
3.976MetThr: 3.976 ± 0.988
1.704MetVal: 1.704 ± 0.876
0.284MetTrp: 0.284 ± 0.324
1.704MetTyr: 1.704 ± 0.657
0.0MetXaa: 0.0 ± 0.0
Asn
3.124AsnAla: 3.124 ± 0.928
0.284AsnCys: 0.284 ± 0.26
1.988AsnAsp: 1.988 ± 0.637
3.976AsnGlu: 3.976 ± 1.052
1.988AsnPhe: 1.988 ± 0.779
3.408AsnGly: 3.408 ± 0.941
1.42AsnHis: 1.42 ± 0.548
4.828AsnIle: 4.828 ± 1.127
4.544AsnLys: 4.544 ± 1.116
5.396AsnLeu: 5.396 ± 1.12
0.852AsnMet: 0.852 ± 0.449
3.692AsnAsn: 3.692 ± 0.825
1.704AsnPro: 1.704 ± 0.718
1.988AsnGln: 1.988 ± 0.618
1.988AsnArg: 1.988 ± 0.561
3.976AsnSer: 3.976 ± 0.926
4.26AsnThr: 4.26 ± 1.245
1.42AsnVal: 1.42 ± 0.688
1.704AsnTrp: 1.704 ± 0.699
1.704AsnTyr: 1.704 ± 0.717
0.0AsnXaa: 0.0 ± 0.0
Pro
1.42ProAla: 1.42 ± 0.517
0.0ProCys: 0.0 ± 0.0
0.852ProAsp: 0.852 ± 0.56
2.272ProGlu: 2.272 ± 0.782
1.42ProPhe: 1.42 ± 0.713
1.136ProGly: 1.136 ± 0.523
0.284ProHis: 0.284 ± 0.314
1.704ProIle: 1.704 ± 0.71
1.42ProLys: 1.42 ± 0.953
3.124ProLeu: 3.124 ± 0.739
0.284ProMet: 0.284 ± 0.273
1.704ProAsn: 1.704 ± 0.649
0.568ProPro: 0.568 ± 0.409
0.568ProGln: 0.568 ± 0.355
1.136ProArg: 1.136 ± 0.642
1.136ProSer: 1.136 ± 0.536
1.42ProThr: 1.42 ± 0.545
1.988ProVal: 1.988 ± 0.886
0.0ProTrp: 0.0 ± 0.0
0.852ProTyr: 0.852 ± 0.518
0.0ProXaa: 0.0 ± 0.0
Gln
3.408GlnAla: 3.408 ± 1.275
0.284GlnCys: 0.284 ± 0.314
2.272GlnAsp: 2.272 ± 0.644
1.704GlnGlu: 1.704 ± 0.626
2.272GlnPhe: 2.272 ± 0.98
1.42GlnGly: 1.42 ± 0.764
1.988GlnHis: 1.988 ± 0.71
3.692GlnIle: 3.692 ± 1.03
3.692GlnLys: 3.692 ± 1.095
3.692GlnLeu: 3.692 ± 0.83
1.42GlnMet: 1.42 ± 0.595
3.408GlnAsn: 3.408 ± 1.093
0.284GlnPro: 0.284 ± 0.297
2.272GlnGln: 2.272 ± 1.181
1.988GlnArg: 1.988 ± 0.676
2.84GlnSer: 2.84 ± 1.015
2.84GlnThr: 2.84 ± 0.933
3.408GlnVal: 3.408 ± 0.952
0.284GlnTrp: 0.284 ± 0.267
2.272GlnTyr: 2.272 ± 0.763
0.0GlnXaa: 0.0 ± 0.0
Arg
1.42ArgAla: 1.42 ± 0.631
0.852ArgCys: 0.852 ± 0.418
1.42ArgAsp: 1.42 ± 0.566
3.976ArgGlu: 3.976 ± 1.049
0.852ArgPhe: 0.852 ± 0.412
1.42ArgGly: 1.42 ± 0.642
1.136ArgHis: 1.136 ± 0.536
5.396ArgIle: 5.396 ± 1.278
3.976ArgLys: 3.976 ± 0.879
5.964ArgLeu: 5.964 ± 1.394
1.136ArgMet: 1.136 ± 0.658
1.704ArgAsn: 1.704 ± 0.776
1.136ArgPro: 1.136 ± 0.463
2.556ArgGln: 2.556 ± 1.308
1.988ArgArg: 1.988 ± 0.713
1.988ArgSer: 1.988 ± 0.782
2.556ArgThr: 2.556 ± 0.713
1.136ArgVal: 1.136 ± 0.585
0.0ArgTrp: 0.0 ± 0.0
1.988ArgTyr: 1.988 ± 0.757
0.0ArgXaa: 0.0 ± 0.0
Ser
2.556SerAla: 2.556 ± 0.915
0.284SerCys: 0.284 ± 0.317
3.692SerAsp: 3.692 ± 0.647
4.828SerGlu: 4.828 ± 1.119
2.84SerPhe: 2.84 ± 0.939
2.272SerGly: 2.272 ± 0.609
0.852SerHis: 0.852 ± 0.441
5.112SerIle: 5.112 ± 0.947
5.68SerLys: 5.68 ± 0.995
6.816SerLeu: 6.816 ± 1.236
1.988SerMet: 1.988 ± 0.834
3.124SerAsn: 3.124 ± 1.297
1.136SerPro: 1.136 ± 0.537
1.42SerGln: 1.42 ± 0.548
3.408SerArg: 3.408 ± 1.07
2.556SerSer: 2.556 ± 0.773
4.828SerThr: 4.828 ± 0.869
3.976SerVal: 3.976 ± 1.123
0.284SerTrp: 0.284 ± 0.314
3.976SerTyr: 3.976 ± 1.012
0.0SerXaa: 0.0 ± 0.0
Thr
2.556ThrAla: 2.556 ± 0.858
0.568ThrCys: 0.568 ± 0.417
3.124ThrAsp: 3.124 ± 0.986
5.68ThrGlu: 5.68 ± 1.428
3.124ThrPhe: 3.124 ± 0.94
4.544ThrGly: 4.544 ± 1.011
1.136ThrHis: 1.136 ± 0.459
6.248ThrIle: 6.248 ± 1.066
5.396ThrLys: 5.396 ± 1.323
4.26ThrLeu: 4.26 ± 1.042
2.556ThrMet: 2.556 ± 0.904
2.272ThrAsn: 2.272 ± 0.791
1.136ThrPro: 1.136 ± 0.517
2.84ThrGln: 2.84 ± 1.025
1.704ThrArg: 1.704 ± 0.87
3.976ThrSer: 3.976 ± 0.861
2.84ThrThr: 2.84 ± 1.051
5.112ThrVal: 5.112 ± 1.512
0.284ThrTrp: 0.284 ± 0.289
2.272ThrTyr: 2.272 ± 0.764
0.0ThrXaa: 0.0 ± 0.0
Val
2.556ValAla: 2.556 ± 1.03
0.0ValCys: 0.0 ± 0.0
2.84ValAsp: 2.84 ± 0.723
1.704ValGlu: 1.704 ± 0.684
3.408ValPhe: 3.408 ± 0.737
1.42ValGly: 1.42 ± 0.59
0.0ValHis: 0.0 ± 0.0
4.26ValIle: 4.26 ± 1.134
5.396ValLys: 5.396 ± 1.782
5.112ValLeu: 5.112 ± 1.112
1.704ValMet: 1.704 ± 0.865
1.136ValAsn: 1.136 ± 0.59
1.42ValPro: 1.42 ± 0.605
3.124ValGln: 3.124 ± 0.852
2.272ValArg: 2.272 ± 0.623
3.692ValSer: 3.692 ± 0.904
1.988ValThr: 1.988 ± 0.69
2.84ValVal: 2.84 ± 0.958
1.136ValTrp: 1.136 ± 0.634
3.408ValTyr: 3.408 ± 1.017
0.0ValXaa: 0.0 ± 0.0
Trp
1.988TrpAla: 1.988 ± 0.595
0.0TrpCys: 0.0 ± 0.0
0.568TrpAsp: 0.568 ± 0.417
0.852TrpGlu: 0.852 ± 0.582
0.568TrpPhe: 0.568 ± 0.355
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.852TrpIle: 0.852 ± 0.486
0.284TrpLys: 0.284 ± 0.314
1.136TrpLeu: 1.136 ± 0.502
0.284TrpMet: 0.284 ± 0.262
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.568TrpGln: 0.568 ± 0.393
0.568TrpArg: 0.568 ± 0.426
1.136TrpSer: 1.136 ± 0.602
0.284TrpThr: 0.284 ± 0.289
0.284TrpVal: 0.284 ± 0.267
0.0TrpTrp: 0.0 ± 0.0
0.284TrpTyr: 0.284 ± 0.278
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.568TyrAla: 0.568 ± 0.384
0.852TyrCys: 0.852 ± 0.469
1.704TyrAsp: 1.704 ± 0.671
1.988TyrGlu: 1.988 ± 0.583
2.556TyrPhe: 2.556 ± 0.949
1.704TyrGly: 1.704 ± 0.833
1.988TyrHis: 1.988 ± 0.659
3.408TyrIle: 3.408 ± 1.288
3.692TyrLys: 3.692 ± 0.947
7.668TyrLeu: 7.668 ± 1.145
1.42TyrMet: 1.42 ± 0.625
2.272TyrAsn: 2.272 ± 0.61
0.568TyrPro: 0.568 ± 0.392
2.272TyrGln: 2.272 ± 0.994
2.84TyrArg: 2.84 ± 0.827
2.84TyrSer: 2.84 ± 0.823
2.84TyrThr: 2.84 ± 0.877
1.42TyrVal: 1.42 ± 0.816
0.0TyrTrp: 0.0 ± 0.0
1.704TyrTyr: 1.704 ± 0.691
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (3522 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski