Amino acid dipepetide frequency for Streptococcus satellite phage Javan171

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.294AlaAla: 0.294 ± 0.371
1.174AlaCys: 1.174 ± 0.49
3.229AlaAsp: 3.229 ± 1.348
4.696AlaGlu: 4.696 ± 1.397
3.229AlaPhe: 3.229 ± 0.773
1.761AlaGly: 1.761 ± 0.732
0.0AlaHis: 0.0 ± 0.0
6.164AlaIle: 6.164 ± 1.277
4.99AlaLys: 4.99 ± 1.089
4.403AlaLeu: 4.403 ± 1.115
1.174AlaMet: 1.174 ± 0.476
4.403AlaAsn: 4.403 ± 0.946
0.587AlaPro: 0.587 ± 0.366
2.642AlaGln: 2.642 ± 0.66
2.348AlaArg: 2.348 ± 0.594
5.577AlaSer: 5.577 ± 1.73
4.109AlaThr: 4.109 ± 0.528
2.935AlaVal: 2.935 ± 0.966
0.881AlaTrp: 0.881 ± 0.555
1.468AlaTyr: 1.468 ± 0.615
0.0AlaXaa: 0.0 ± 0.0
Cys
0.881CysAla: 0.881 ± 0.474
0.294CysCys: 0.294 ± 0.294
0.587CysAsp: 0.587 ± 0.426
0.294CysGlu: 0.294 ± 0.294
0.0CysPhe: 0.0 ± 0.0
0.587CysGly: 0.587 ± 0.463
0.294CysHis: 0.294 ± 0.253
0.0CysIle: 0.0 ± 0.0
0.294CysLys: 0.294 ± 0.271
1.174CysLeu: 1.174 ± 0.584
0.0CysMet: 0.0 ± 0.0
0.587CysAsn: 0.587 ± 0.495
0.587CysPro: 0.587 ± 0.463
0.587CysGln: 0.587 ± 0.344
0.587CysArg: 0.587 ± 0.41
0.587CysSer: 0.587 ± 0.432
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.587CysTyr: 0.587 ± 0.439
0.0CysXaa: 0.0 ± 0.0
Asp
2.642AspAla: 2.642 ± 0.958
0.881AspCys: 0.881 ± 0.668
3.229AspAsp: 3.229 ± 0.868
4.99AspGlu: 4.99 ± 1.408
1.761AspPhe: 1.761 ± 0.513
2.055AspGly: 2.055 ± 0.76
1.174AspHis: 1.174 ± 0.598
7.338AspIle: 7.338 ± 1.228
6.457AspLys: 6.457 ± 1.064
6.751AspLeu: 6.751 ± 0.848
1.174AspMet: 1.174 ± 0.641
1.761AspAsn: 1.761 ± 0.542
0.881AspPro: 0.881 ± 0.562
1.761AspGln: 1.761 ± 0.658
2.935AspArg: 2.935 ± 0.716
2.348AspSer: 2.348 ± 0.811
3.229AspThr: 3.229 ± 0.934
0.881AspVal: 0.881 ± 0.651
0.294AspTrp: 0.294 ± 0.246
4.109AspTyr: 4.109 ± 1.456
0.0AspXaa: 0.0 ± 0.0
Glu
6.164GluAla: 6.164 ± 1.121
1.174GluCys: 1.174 ± 0.673
4.99GluAsp: 4.99 ± 1.299
5.87GluGlu: 5.87 ± 1.8
3.522GluPhe: 3.522 ± 1.483
2.935GluGly: 2.935 ± 0.949
1.761GluHis: 1.761 ± 0.727
6.164GluIle: 6.164 ± 1.287
5.577GluLys: 5.577 ± 0.874
9.392GluLeu: 9.392 ± 1.246
2.055GluMet: 2.055 ± 0.697
2.935GluAsn: 2.935 ± 0.712
1.174GluPro: 1.174 ± 0.468
4.696GluGln: 4.696 ± 1.711
3.816GluArg: 3.816 ± 1.08
1.468GluSer: 1.468 ± 0.87
4.99GluThr: 4.99 ± 1.28
4.696GluVal: 4.696 ± 1.162
0.587GluTrp: 0.587 ± 0.372
4.696GluTyr: 4.696 ± 1.053
0.0GluXaa: 0.0 ± 0.0
Phe
2.055PheAla: 2.055 ± 0.566
0.0PheCys: 0.0 ± 0.0
3.229PheAsp: 3.229 ± 0.622
2.348PheGlu: 2.348 ± 0.674
1.761PhePhe: 1.761 ± 0.609
1.174PheGly: 1.174 ± 0.46
2.055PheHis: 2.055 ± 0.552
2.348PheIle: 2.348 ± 0.517
4.403PheLys: 4.403 ± 1.023
2.642PheLeu: 2.642 ± 1.052
0.294PheMet: 0.294 ± 0.27
2.348PheAsn: 2.348 ± 0.904
0.587PhePro: 0.587 ± 0.344
1.468PheGln: 1.468 ± 0.497
2.055PheArg: 2.055 ± 0.616
3.229PheSer: 3.229 ± 0.713
3.522PheThr: 3.522 ± 0.823
2.642PheVal: 2.642 ± 0.621
0.294PheTrp: 0.294 ± 0.246
1.761PheTyr: 1.761 ± 0.753
0.0PheXaa: 0.0 ± 0.0
Gly
2.348GlyAla: 2.348 ± 1.081
0.0GlyCys: 0.0 ± 0.0
4.109GlyAsp: 4.109 ± 1.195
1.468GlyGlu: 1.468 ± 0.465
2.642GlyPhe: 2.642 ± 0.786
1.761GlyGly: 1.761 ± 0.573
1.468GlyHis: 1.468 ± 0.61
2.642GlyIle: 2.642 ± 1.069
3.816GlyLys: 3.816 ± 0.915
5.87GlyLeu: 5.87 ± 1.448
1.174GlyMet: 1.174 ± 0.443
2.348GlyAsn: 2.348 ± 0.697
0.587GlyPro: 0.587 ± 0.388
2.935GlyGln: 2.935 ± 1.448
2.642GlyArg: 2.642 ± 0.894
1.761GlySer: 1.761 ± 0.853
2.935GlyThr: 2.935 ± 0.721
2.642GlyVal: 2.642 ± 1.115
0.587GlyTrp: 0.587 ± 0.332
2.642GlyTyr: 2.642 ± 0.759
0.0GlyXaa: 0.0 ± 0.0
His
2.055HisAla: 2.055 ± 1.113
0.0HisCys: 0.0 ± 0.0
0.294HisAsp: 0.294 ± 0.371
0.587HisGlu: 0.587 ± 0.356
0.0HisPhe: 0.0 ± 0.0
1.174HisGly: 1.174 ± 0.442
0.294HisHis: 0.294 ± 0.246
1.468HisIle: 1.468 ± 0.53
1.468HisLys: 1.468 ± 0.908
2.348HisLeu: 2.348 ± 0.911
0.0HisMet: 0.0 ± 0.0
1.174HisAsn: 1.174 ± 0.667
0.587HisPro: 0.587 ± 0.348
1.174HisGln: 1.174 ± 0.677
0.587HisArg: 0.587 ± 0.388
0.0HisSer: 0.0 ± 0.0
1.174HisThr: 1.174 ± 0.488
0.881HisVal: 0.881 ± 0.388
0.587HisTrp: 0.587 ± 0.344
2.055HisTyr: 2.055 ± 0.778
0.0HisXaa: 0.0 ± 0.0
Ile
5.87IleAla: 5.87 ± 1.262
0.587IleCys: 0.587 ± 0.388
5.283IleAsp: 5.283 ± 1.467
5.577IleGlu: 5.577 ± 1.042
2.642IlePhe: 2.642 ± 0.811
1.761IleGly: 1.761 ± 0.503
0.881IleHis: 0.881 ± 0.651
3.522IleIle: 3.522 ± 0.986
11.447IleLys: 11.447 ± 1.768
5.283IleLeu: 5.283 ± 1.026
1.174IleMet: 1.174 ± 0.578
3.229IleAsn: 3.229 ± 0.879
2.348IlePro: 2.348 ± 0.612
2.348IleGln: 2.348 ± 0.843
2.935IleArg: 2.935 ± 0.828
4.109IleSer: 4.109 ± 1.247
4.99IleThr: 4.99 ± 1.202
3.229IleVal: 3.229 ± 0.887
0.0IleTrp: 0.0 ± 0.0
2.642IleTyr: 2.642 ± 0.967
0.0IleXaa: 0.0 ± 0.0
Lys
8.218LysAla: 8.218 ± 1.57
0.294LysCys: 0.294 ± 0.371
3.816LysAsp: 3.816 ± 1.056
12.621LysGlu: 12.621 ± 1.667
2.935LysPhe: 2.935 ± 0.569
4.403LysGly: 4.403 ± 1.498
2.055LysHis: 2.055 ± 0.601
4.696LysIle: 4.696 ± 1.177
7.338LysLys: 7.338 ± 1.754
5.87LysLeu: 5.87 ± 1.368
2.348LysMet: 2.348 ± 1.034
5.87LysAsn: 5.87 ± 1.158
4.403LysPro: 4.403 ± 1.388
4.696LysGln: 4.696 ± 1.066
5.87LysArg: 5.87 ± 1.089
3.816LysSer: 3.816 ± 1.104
7.338LysThr: 7.338 ± 1.766
6.751LysVal: 6.751 ± 1.134
1.174LysTrp: 1.174 ± 0.701
3.229LysTyr: 3.229 ± 0.946
0.0LysXaa: 0.0 ± 0.0
Leu
6.164LeuAla: 6.164 ± 1.226
0.881LeuCys: 0.881 ± 0.625
5.87LeuAsp: 5.87 ± 1.276
10.86LeuGlu: 10.86 ± 1.269
4.403LeuPhe: 4.403 ± 1.209
4.696LeuGly: 4.696 ± 1.271
1.468LeuHis: 1.468 ± 0.686
8.805LeuIle: 8.805 ± 2.045
9.099LeuLys: 9.099 ± 1.791
8.218LeuLeu: 8.218 ± 1.464
1.174LeuMet: 1.174 ± 0.859
4.403LeuAsn: 4.403 ± 1.124
4.696LeuPro: 4.696 ± 1.262
2.642LeuGln: 2.642 ± 0.612
2.055LeuArg: 2.055 ± 0.741
8.218LeuSer: 8.218 ± 2.037
4.403LeuThr: 4.403 ± 0.853
3.522LeuVal: 3.522 ± 1.124
0.881LeuTrp: 0.881 ± 0.43
5.87LeuTyr: 5.87 ± 1.002
0.0LeuXaa: 0.0 ± 0.0
Met
0.881MetAla: 0.881 ± 0.441
0.294MetCys: 0.294 ± 0.246
1.174MetAsp: 1.174 ± 0.525
0.294MetGlu: 0.294 ± 0.308
0.587MetPhe: 0.587 ± 0.372
0.881MetGly: 0.881 ± 0.589
0.0MetHis: 0.0 ± 0.0
0.587MetIle: 0.587 ± 0.417
3.816MetLys: 3.816 ± 0.921
2.935MetLeu: 2.935 ± 0.697
0.0MetMet: 0.0 ± 0.0
1.468MetAsn: 1.468 ± 0.552
0.294MetPro: 0.294 ± 0.271
0.294MetGln: 0.294 ± 0.288
1.174MetArg: 1.174 ± 0.493
1.468MetSer: 1.468 ± 0.744
3.816MetThr: 3.816 ± 0.999
0.587MetVal: 0.587 ± 0.365
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.816AsnAla: 3.816 ± 0.883
0.0AsnCys: 0.0 ± 0.0
1.468AsnAsp: 1.468 ± 0.569
3.229AsnGlu: 3.229 ± 1.014
1.174AsnPhe: 1.174 ± 0.572
4.403AsnGly: 4.403 ± 1.018
1.468AsnHis: 1.468 ± 0.529
2.935AsnIle: 2.935 ± 1.324
4.696AsnLys: 4.696 ± 0.977
4.696AsnLeu: 4.696 ± 1.145
2.055AsnMet: 2.055 ± 0.726
2.055AsnAsn: 2.055 ± 0.925
2.935AsnPro: 2.935 ± 0.736
2.935AsnGln: 2.935 ± 0.847
4.109AsnArg: 4.109 ± 0.8
2.348AsnSer: 2.348 ± 0.508
2.935AsnThr: 2.935 ± 1.032
2.642AsnVal: 2.642 ± 0.911
0.587AsnTrp: 0.587 ± 0.493
2.348AsnTyr: 2.348 ± 0.703
0.0AsnXaa: 0.0 ± 0.0
Pro
0.294ProAla: 0.294 ± 0.303
0.294ProCys: 0.294 ± 0.271
1.468ProAsp: 1.468 ± 0.548
4.109ProGlu: 4.109 ± 1.053
1.468ProPhe: 1.468 ± 0.641
0.294ProGly: 0.294 ± 0.294
0.0ProHis: 0.0 ± 0.0
1.761ProIle: 1.761 ± 0.665
4.99ProLys: 4.99 ± 1.185
2.642ProLeu: 2.642 ± 1.103
0.587ProMet: 0.587 ± 0.427
2.055ProAsn: 2.055 ± 0.834
0.881ProPro: 0.881 ± 0.571
0.587ProGln: 0.587 ± 0.339
2.348ProArg: 2.348 ± 1.043
1.174ProSer: 1.174 ± 0.626
2.935ProThr: 2.935 ± 0.721
2.348ProVal: 2.348 ± 0.799
0.0ProTrp: 0.0 ± 0.0
1.761ProTyr: 1.761 ± 0.914
0.0ProXaa: 0.0 ± 0.0
Gln
2.055GlnAla: 2.055 ± 0.775
0.0GlnCys: 0.0 ± 0.0
2.055GlnAsp: 2.055 ± 0.832
4.109GlnGlu: 4.109 ± 1.051
1.174GlnPhe: 1.174 ± 0.486
3.522GlnGly: 3.522 ± 0.891
0.881GlnHis: 0.881 ± 0.421
2.348GlnIle: 2.348 ± 0.682
4.403GlnLys: 4.403 ± 1.268
6.751GlnLeu: 6.751 ± 1.105
1.174GlnMet: 1.174 ± 0.838
2.348GlnAsn: 2.348 ± 0.884
1.468GlnPro: 1.468 ± 0.617
3.229GlnGln: 3.229 ± 1.098
2.935GlnArg: 2.935 ± 0.646
2.935GlnSer: 2.935 ± 0.765
0.881GlnThr: 0.881 ± 0.439
2.642GlnVal: 2.642 ± 1.054
0.294GlnTrp: 0.294 ± 0.246
0.587GlnTyr: 0.587 ± 0.339
0.0GlnXaa: 0.0 ± 0.0
Arg
1.468ArgAla: 1.468 ± 0.567
0.587ArgCys: 0.587 ± 0.371
3.522ArgAsp: 3.522 ± 0.973
2.055ArgGlu: 2.055 ± 0.724
2.055ArgPhe: 2.055 ± 0.734
2.642ArgGly: 2.642 ± 0.916
1.468ArgHis: 1.468 ± 0.613
2.348ArgIle: 2.348 ± 0.736
5.577ArgLys: 5.577 ± 1.351
5.87ArgLeu: 5.87 ± 1.118
0.881ArgMet: 0.881 ± 0.439
2.642ArgAsn: 2.642 ± 1.053
2.055ArgPro: 2.055 ± 0.765
3.522ArgGln: 3.522 ± 0.833
1.761ArgArg: 1.761 ± 0.726
2.642ArgSer: 2.642 ± 0.788
2.055ArgThr: 2.055 ± 0.716
3.522ArgVal: 3.522 ± 0.846
0.587ArgTrp: 0.587 ± 0.458
3.229ArgTyr: 3.229 ± 1.02
0.0ArgXaa: 0.0 ± 0.0
Ser
3.229SerAla: 3.229 ± 1.054
0.587SerCys: 0.587 ± 0.463
4.109SerAsp: 4.109 ± 1.085
4.403SerGlu: 4.403 ± 1.334
2.348SerPhe: 2.348 ± 0.828
2.055SerGly: 2.055 ± 0.908
0.587SerHis: 0.587 ± 0.371
4.403SerIle: 4.403 ± 1.13
6.457SerLys: 6.457 ± 1.765
6.457SerLeu: 6.457 ± 1.238
1.468SerMet: 1.468 ± 0.514
2.055SerAsn: 2.055 ± 0.548
1.468SerPro: 1.468 ± 0.649
2.348SerGln: 2.348 ± 0.934
1.761SerArg: 1.761 ± 0.567
1.761SerSer: 1.761 ± 0.641
3.522SerThr: 3.522 ± 1.064
3.229SerVal: 3.229 ± 1.031
0.587SerTrp: 0.587 ± 0.388
2.348SerTyr: 2.348 ± 0.865
0.0SerXaa: 0.0 ± 0.0
Thr
3.816ThrAla: 3.816 ± 1.22
0.0ThrCys: 0.0 ± 0.0
2.348ThrAsp: 2.348 ± 0.707
3.816ThrGlu: 3.816 ± 1.222
3.522ThrPhe: 3.522 ± 1.069
5.283ThrGly: 5.283 ± 1.136
0.587ThrHis: 0.587 ± 0.363
5.577ThrIle: 5.577 ± 1.501
2.642ThrLys: 2.642 ± 0.914
6.751ThrLeu: 6.751 ± 1.251
0.587ThrMet: 0.587 ± 0.364
2.055ThrAsn: 2.055 ± 1.146
3.522ThrPro: 3.522 ± 1.093
2.935ThrGln: 2.935 ± 0.826
3.522ThrArg: 3.522 ± 1.127
4.109ThrSer: 4.109 ± 1.005
3.522ThrThr: 3.522 ± 1.492
2.935ThrVal: 2.935 ± 0.819
0.881ThrTrp: 0.881 ± 0.438
3.816ThrTyr: 3.816 ± 1.148
0.0ThrXaa: 0.0 ± 0.0
Val
2.055ValAla: 2.055 ± 0.657
0.587ValCys: 0.587 ± 0.41
2.642ValAsp: 2.642 ± 0.95
2.642ValGlu: 2.642 ± 1.136
2.642ValPhe: 2.642 ± 0.747
2.642ValGly: 2.642 ± 0.569
0.294ValHis: 0.294 ± 0.294
4.696ValIle: 4.696 ± 1.398
4.403ValLys: 4.403 ± 0.99
5.87ValLeu: 5.87 ± 1.346
1.468ValMet: 1.468 ± 0.616
3.816ValAsn: 3.816 ± 1.035
1.174ValPro: 1.174 ± 0.608
1.468ValGln: 1.468 ± 0.674
2.055ValArg: 2.055 ± 0.824
4.99ValSer: 4.99 ± 1.151
3.229ValThr: 3.229 ± 1.183
3.229ValVal: 3.229 ± 0.866
0.587ValTrp: 0.587 ± 0.491
1.468ValTyr: 1.468 ± 0.461
0.0ValXaa: 0.0 ± 0.0
Trp
0.294TrpAla: 0.294 ± 0.277
0.0TrpCys: 0.0 ± 0.0
0.587TrpAsp: 0.587 ± 0.34
0.587TrpGlu: 0.587 ± 0.494
0.0TrpPhe: 0.0 ± 0.0
0.587TrpGly: 0.587 ± 0.339
0.0TrpHis: 0.0 ± 0.0
0.294TrpIle: 0.294 ± 0.297
0.881TrpLys: 0.881 ± 0.538
2.055TrpLeu: 2.055 ± 0.773
0.0TrpMet: 0.0 ± 0.0
0.587TrpAsn: 0.587 ± 0.345
0.294TrpPro: 0.294 ± 0.246
0.587TrpGln: 0.587 ± 0.439
0.294TrpArg: 0.294 ± 0.294
0.294TrpSer: 0.294 ± 0.325
0.0TrpThr: 0.0 ± 0.0
1.468TrpVal: 1.468 ± 0.652
0.881TrpTrp: 0.881 ± 0.531
0.587TrpTyr: 0.587 ± 0.417
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.468TyrAla: 1.468 ± 0.541
0.294TyrCys: 0.294 ± 0.325
2.642TyrAsp: 2.642 ± 0.708
4.109TyrGlu: 4.109 ± 1.205
2.348TyrPhe: 2.348 ± 0.691
1.761TyrGly: 1.761 ± 0.799
1.174TyrHis: 1.174 ± 0.467
1.761TyrIle: 1.761 ± 0.666
4.99TyrLys: 4.99 ± 1.405
2.642TyrLeu: 2.642 ± 0.574
1.468TyrMet: 1.468 ± 0.709
4.696TyrAsn: 4.696 ± 1.096
1.468TyrPro: 1.468 ± 0.951
2.935TyrGln: 2.935 ± 0.955
4.696TyrArg: 4.696 ± 1.46
2.348TyrSer: 2.348 ± 0.681
2.642TyrThr: 2.642 ± 0.51
1.174TyrVal: 1.174 ± 0.44
0.587TyrTrp: 0.587 ± 0.589
3.816TyrTyr: 3.816 ± 1.018
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 22 proteins (3408 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski