Amino acid dipepetide frequency for Streptococcus phage 53

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.696AlaAla: 2.696 ± 0.889
0.193AlaCys: 0.193 ± 0.125
3.948AlaAsp: 3.948 ± 0.794
3.274AlaGlu: 3.274 ± 0.502
2.311AlaPhe: 2.311 ± 0.819
3.659AlaGly: 3.659 ± 0.656
0.867AlaHis: 0.867 ± 0.283
4.911AlaIle: 4.911 ± 0.751
6.163AlaLys: 6.163 ± 1.285
6.066AlaLeu: 6.066 ± 0.716
1.252AlaMet: 1.252 ± 0.3
4.333AlaAsn: 4.333 ± 0.974
1.637AlaPro: 1.637 ± 0.381
2.407AlaGln: 2.407 ± 0.584
2.696AlaArg: 2.696 ± 0.455
4.718AlaSer: 4.718 ± 0.71
4.237AlaThr: 4.237 ± 0.946
3.659AlaVal: 3.659 ± 0.653
0.867AlaTrp: 0.867 ± 0.233
2.215AlaTyr: 2.215 ± 0.376
0.0AlaXaa: 0.0 ± 0.0
Cys
0.193CysAla: 0.193 ± 0.116
0.0CysCys: 0.0 ± 0.0
0.674CysAsp: 0.674 ± 0.317
0.578CysGlu: 0.578 ± 0.217
0.385CysPhe: 0.385 ± 0.252
0.289CysGly: 0.289 ± 0.225
0.096CysHis: 0.096 ± 0.094
0.289CysIle: 0.289 ± 0.18
0.578CysLys: 0.578 ± 0.317
0.385CysLeu: 0.385 ± 0.203
0.0CysMet: 0.0 ± 0.0
0.578CysAsn: 0.578 ± 0.276
0.385CysPro: 0.385 ± 0.319
0.385CysGln: 0.385 ± 0.177
0.578CysArg: 0.578 ± 0.365
0.096CysSer: 0.096 ± 0.096
0.289CysThr: 0.289 ± 0.165
0.289CysVal: 0.289 ± 0.14
0.193CysTrp: 0.193 ± 0.132
0.289CysTyr: 0.289 ± 0.167
0.0CysXaa: 0.0 ± 0.0
Asp
3.178AspAla: 3.178 ± 0.546
0.289AspCys: 0.289 ± 0.165
3.852AspAsp: 3.852 ± 0.674
4.044AspGlu: 4.044 ± 0.528
3.755AspPhe: 3.755 ± 0.534
6.163AspGly: 6.163 ± 0.76
0.963AspHis: 0.963 ± 0.338
4.141AspIle: 4.141 ± 0.588
5.392AspLys: 5.392 ± 0.838
3.948AspLeu: 3.948 ± 0.793
2.311AspMet: 2.311 ± 0.479
4.429AspAsn: 4.429 ± 1.001
2.504AspPro: 2.504 ± 0.498
1.252AspGln: 1.252 ± 0.255
2.504AspArg: 2.504 ± 0.407
3.467AspSer: 3.467 ± 0.593
3.563AspThr: 3.563 ± 0.633
4.526AspVal: 4.526 ± 0.916
0.963AspTrp: 0.963 ± 0.242
2.696AspTyr: 2.696 ± 0.607
0.0AspXaa: 0.0 ± 0.0
Glu
4.237GluAla: 4.237 ± 0.551
0.289GluCys: 0.289 ± 0.141
3.274GluAsp: 3.274 ± 0.528
4.237GluGlu: 4.237 ± 0.917
2.6GluPhe: 2.6 ± 0.6
3.081GluGly: 3.081 ± 0.533
1.156GluHis: 1.156 ± 0.322
6.452GluIle: 6.452 ± 0.968
3.659GluLys: 3.659 ± 0.777
5.97GluLeu: 5.97 ± 0.823
2.118GluMet: 2.118 ± 0.443
4.141GluAsn: 4.141 ± 0.803
1.83GluPro: 1.83 ± 0.534
3.081GluGln: 3.081 ± 0.467
3.948GluArg: 3.948 ± 0.76
2.985GluSer: 2.985 ± 0.305
3.467GluThr: 3.467 ± 0.509
4.141GluVal: 4.141 ± 0.741
1.252GluTrp: 1.252 ± 0.218
3.659GluTyr: 3.659 ± 0.686
0.0GluXaa: 0.0 ± 0.0
Phe
3.081PheAla: 3.081 ± 0.506
0.289PheCys: 0.289 ± 0.22
3.948PheAsp: 3.948 ± 0.556
2.215PheGlu: 2.215 ± 0.52
2.022PhePhe: 2.022 ± 0.444
3.563PheGly: 3.563 ± 0.697
0.385PheHis: 0.385 ± 0.172
2.6PheIle: 2.6 ± 0.706
4.718PheLys: 4.718 ± 0.616
3.755PheLeu: 3.755 ± 0.643
0.674PheMet: 0.674 ± 0.257
2.792PheAsn: 2.792 ± 0.665
0.385PhePro: 0.385 ± 0.207
1.156PheGln: 1.156 ± 0.246
1.733PheArg: 1.733 ± 0.373
3.178PheSer: 3.178 ± 0.482
2.504PheThr: 2.504 ± 0.418
2.792PheVal: 2.792 ± 0.484
0.674PheTrp: 0.674 ± 0.256
1.541PheTyr: 1.541 ± 0.444
0.0PheXaa: 0.0 ± 0.0
Gly
2.407GlyAla: 2.407 ± 0.816
0.77GlyCys: 0.77 ± 0.296
4.237GlyAsp: 4.237 ± 0.448
4.237GlyGlu: 4.237 ± 0.548
2.985GlyPhe: 2.985 ± 0.49
4.718GlyGly: 4.718 ± 0.756
0.77GlyHis: 0.77 ± 0.263
5.874GlyIle: 5.874 ± 0.78
6.259GlyLys: 6.259 ± 0.989
6.355GlyLeu: 6.355 ± 0.759
1.444GlyMet: 1.444 ± 0.373
3.659GlyAsn: 3.659 ± 0.707
1.156GlyPro: 1.156 ± 0.334
2.696GlyGln: 2.696 ± 0.473
2.696GlyArg: 2.696 ± 0.531
4.526GlySer: 4.526 ± 0.561
5.007GlyThr: 5.007 ± 0.905
3.467GlyVal: 3.467 ± 0.593
1.252GlyTrp: 1.252 ± 0.363
2.985GlyTyr: 2.985 ± 0.464
0.0GlyXaa: 0.0 ± 0.0
His
0.289HisAla: 0.289 ± 0.166
0.096HisCys: 0.096 ± 0.105
0.963HisAsp: 0.963 ± 0.256
0.674HisGlu: 0.674 ± 0.295
0.674HisPhe: 0.674 ± 0.23
0.867HisGly: 0.867 ± 0.256
0.385HisHis: 0.385 ± 0.173
0.77HisIle: 0.77 ± 0.263
0.867HisLys: 0.867 ± 0.221
1.156HisLeu: 1.156 ± 0.263
0.578HisMet: 0.578 ± 0.2
0.867HisAsn: 0.867 ± 0.297
0.963HisPro: 0.963 ± 0.277
0.674HisGln: 0.674 ± 0.261
0.867HisArg: 0.867 ± 0.31
0.963HisSer: 0.963 ± 0.221
0.578HisThr: 0.578 ± 0.222
1.348HisVal: 1.348 ± 0.274
0.193HisTrp: 0.193 ± 0.145
0.867HisTyr: 0.867 ± 0.331
0.0HisXaa: 0.0 ± 0.0
Ile
5.296IleAla: 5.296 ± 0.986
0.481IleCys: 0.481 ± 0.233
5.2IleAsp: 5.2 ± 0.692
4.815IleGlu: 4.815 ± 0.719
2.118IlePhe: 2.118 ± 0.38
4.237IleGly: 4.237 ± 0.637
0.578IleHis: 0.578 ± 0.21
4.141IleIle: 4.141 ± 0.763
6.837IleLys: 6.837 ± 0.742
3.659IleLeu: 3.659 ± 0.719
1.83IleMet: 1.83 ± 0.496
3.948IleAsn: 3.948 ± 0.497
3.563IlePro: 3.563 ± 0.666
2.792IleGln: 2.792 ± 0.497
3.081IleArg: 3.081 ± 0.508
5.296IleSer: 5.296 ± 0.673
3.948IleThr: 3.948 ± 0.67
2.985IleVal: 2.985 ± 0.557
0.963IleTrp: 0.963 ± 0.27
2.696IleTyr: 2.696 ± 0.531
0.0IleXaa: 0.0 ± 0.0
Lys
5.874LysAla: 5.874 ± 0.54
0.578LysCys: 0.578 ± 0.341
4.237LysAsp: 4.237 ± 0.861
6.933LysGlu: 6.933 ± 0.898
3.563LysPhe: 3.563 ± 0.915
5.585LysGly: 5.585 ± 0.587
1.156LysHis: 1.156 ± 0.46
4.911LysIle: 4.911 ± 0.708
7.607LysLys: 7.607 ± 1.311
6.644LysLeu: 6.644 ± 0.875
2.215LysMet: 2.215 ± 0.533
5.681LysAsn: 5.681 ± 0.544
3.178LysPro: 3.178 ± 0.586
3.37LysGln: 3.37 ± 0.587
3.563LysArg: 3.563 ± 0.7
4.141LysSer: 4.141 ± 0.532
5.007LysThr: 5.007 ± 0.723
4.044LysVal: 4.044 ± 0.637
1.059LysTrp: 1.059 ± 0.281
3.081LysTyr: 3.081 ± 0.851
0.0LysXaa: 0.0 ± 0.0
Leu
6.644LeuAla: 6.644 ± 0.72
0.578LeuCys: 0.578 ± 0.25
5.007LeuAsp: 5.007 ± 0.653
6.259LeuGlu: 6.259 ± 0.896
2.889LeuPhe: 2.889 ± 0.406
5.778LeuGly: 5.778 ± 0.996
0.674LeuHis: 0.674 ± 0.306
4.718LeuIle: 4.718 ± 0.631
6.837LeuLys: 6.837 ± 0.688
5.296LeuLeu: 5.296 ± 0.693
2.118LeuMet: 2.118 ± 0.471
6.066LeuAsn: 6.066 ± 0.732
2.985LeuPro: 2.985 ± 0.483
2.504LeuGln: 2.504 ± 0.471
3.274LeuArg: 3.274 ± 0.782
5.778LeuSer: 5.778 ± 0.789
5.104LeuThr: 5.104 ± 0.63
3.948LeuVal: 3.948 ± 0.618
0.674LeuTrp: 0.674 ± 0.285
2.022LeuTyr: 2.022 ± 0.395
0.0LeuXaa: 0.0 ± 0.0
Met
2.022MetAla: 2.022 ± 0.456
0.096MetCys: 0.096 ± 0.101
0.963MetAsp: 0.963 ± 0.305
1.733MetGlu: 1.733 ± 0.404
1.637MetPhe: 1.637 ± 0.344
1.348MetGly: 1.348 ± 0.415
0.385MetHis: 0.385 ± 0.22
1.637MetIle: 1.637 ± 0.336
2.792MetLys: 2.792 ± 0.479
1.733MetLeu: 1.733 ± 0.317
0.289MetMet: 0.289 ± 0.214
1.444MetAsn: 1.444 ± 0.329
0.77MetPro: 0.77 ± 0.216
0.77MetGln: 0.77 ± 0.206
0.963MetArg: 0.963 ± 0.243
1.733MetSer: 1.733 ± 0.54
1.252MetThr: 1.252 ± 0.422
2.215MetVal: 2.215 ± 0.543
0.096MetTrp: 0.096 ± 0.079
0.77MetTyr: 0.77 ± 0.257
0.0MetXaa: 0.0 ± 0.0
Asn
5.104AsnAla: 5.104 ± 1.256
0.193AsnCys: 0.193 ± 0.13
3.852AsnAsp: 3.852 ± 0.425
4.044AsnGlu: 4.044 ± 0.741
2.6AsnPhe: 2.6 ± 0.674
7.126AsnGly: 7.126 ± 1.35
1.444AsnHis: 1.444 ± 0.273
4.333AsnIle: 4.333 ± 0.581
4.237AsnLys: 4.237 ± 0.558
4.911AsnLeu: 4.911 ± 0.51
1.156AsnMet: 1.156 ± 0.299
3.948AsnAsn: 3.948 ± 0.601
2.985AsnPro: 2.985 ± 0.553
2.022AsnGln: 2.022 ± 0.391
2.022AsnArg: 2.022 ± 0.622
3.755AsnSer: 3.755 ± 0.466
3.852AsnThr: 3.852 ± 0.527
3.755AsnVal: 3.755 ± 0.547
1.156AsnTrp: 1.156 ± 0.273
2.118AsnTyr: 2.118 ± 0.401
0.0AsnXaa: 0.0 ± 0.0
Pro
1.637ProAla: 1.637 ± 0.306
0.193ProCys: 0.193 ± 0.179
1.637ProAsp: 1.637 ± 0.474
2.504ProGlu: 2.504 ± 0.495
1.541ProPhe: 1.541 ± 0.332
1.059ProGly: 1.059 ± 0.292
0.674ProHis: 0.674 ± 0.226
1.926ProIle: 1.926 ± 0.321
3.274ProLys: 3.274 ± 0.696
2.889ProLeu: 2.889 ± 0.409
0.385ProMet: 0.385 ± 0.287
2.6ProAsn: 2.6 ± 0.487
0.867ProPro: 0.867 ± 0.364
1.348ProGln: 1.348 ± 0.376
1.348ProArg: 1.348 ± 0.34
2.311ProSer: 2.311 ± 0.415
2.696ProThr: 2.696 ± 0.459
1.637ProVal: 1.637 ± 0.493
0.385ProTrp: 0.385 ± 0.168
0.867ProTyr: 0.867 ± 0.351
0.0ProXaa: 0.0 ± 0.0
Gln
2.889GlnAla: 2.889 ± 0.595
0.193GlnCys: 0.193 ± 0.135
1.83GlnAsp: 1.83 ± 0.422
2.6GlnGlu: 2.6 ± 0.461
1.541GlnPhe: 1.541 ± 0.41
2.696GlnGly: 2.696 ± 0.727
0.578GlnHis: 0.578 ± 0.19
2.504GlnIle: 2.504 ± 0.584
2.889GlnLys: 2.889 ± 0.448
2.889GlnLeu: 2.889 ± 0.484
1.83GlnMet: 1.83 ± 0.356
2.504GlnAsn: 2.504 ± 0.514
0.481GlnPro: 0.481 ± 0.214
2.504GlnGln: 2.504 ± 0.514
1.637GlnArg: 1.637 ± 0.396
2.792GlnSer: 2.792 ± 0.487
2.407GlnThr: 2.407 ± 0.563
2.022GlnVal: 2.022 ± 0.51
0.481GlnTrp: 0.481 ± 0.166
2.118GlnTyr: 2.118 ± 0.387
0.0GlnXaa: 0.0 ± 0.0
Arg
2.022ArgAla: 2.022 ± 0.341
0.578ArgCys: 0.578 ± 0.535
2.696ArgAsp: 2.696 ± 0.395
2.311ArgGlu: 2.311 ± 0.574
2.215ArgPhe: 2.215 ± 0.425
2.6ArgGly: 2.6 ± 0.4
0.867ArgHis: 0.867 ± 0.272
3.274ArgIle: 3.274 ± 0.653
3.37ArgLys: 3.37 ± 0.806
3.274ArgLeu: 3.274 ± 0.605
1.059ArgMet: 1.059 ± 0.304
2.6ArgAsn: 2.6 ± 0.369
1.156ArgPro: 1.156 ± 0.255
2.215ArgGln: 2.215 ± 0.551
1.541ArgArg: 1.541 ± 0.342
1.926ArgSer: 1.926 ± 0.454
3.081ArgThr: 3.081 ± 0.87
2.6ArgVal: 2.6 ± 0.643
1.348ArgTrp: 1.348 ± 0.314
2.696ArgTyr: 2.696 ± 0.752
0.0ArgXaa: 0.0 ± 0.0
Ser
3.178SerAla: 3.178 ± 0.44
0.385SerCys: 0.385 ± 0.24
4.622SerAsp: 4.622 ± 0.506
4.429SerGlu: 4.429 ± 0.672
3.467SerPhe: 3.467 ± 0.638
4.333SerGly: 4.333 ± 0.522
0.481SerHis: 0.481 ± 0.221
4.237SerIle: 4.237 ± 0.544
4.815SerLys: 4.815 ± 0.788
4.718SerLeu: 4.718 ± 0.667
2.022SerMet: 2.022 ± 0.5
4.911SerAsn: 4.911 ± 0.869
1.83SerPro: 1.83 ± 0.423
2.985SerGln: 2.985 ± 0.577
3.081SerArg: 3.081 ± 0.722
4.237SerSer: 4.237 ± 0.573
4.044SerThr: 4.044 ± 0.564
5.2SerVal: 5.2 ± 0.772
0.674SerTrp: 0.674 ± 0.285
1.444SerTyr: 1.444 ± 0.404
0.0SerXaa: 0.0 ± 0.0
Thr
4.429ThrAla: 4.429 ± 0.659
0.289ThrCys: 0.289 ± 0.175
4.333ThrAsp: 4.333 ± 0.687
3.755ThrGlu: 3.755 ± 0.471
3.659ThrPhe: 3.659 ± 0.622
3.755ThrGly: 3.755 ± 0.554
0.77ThrHis: 0.77 ± 0.277
4.622ThrIle: 4.622 ± 0.829
4.622ThrLys: 4.622 ± 0.771
5.97ThrLeu: 5.97 ± 0.71
1.156ThrMet: 1.156 ± 0.272
3.563ThrAsn: 3.563 ± 0.666
1.926ThrPro: 1.926 ± 0.45
2.215ThrGln: 2.215 ± 0.452
1.444ThrArg: 1.444 ± 0.317
3.948ThrSer: 3.948 ± 0.595
2.792ThrThr: 2.792 ± 0.529
4.526ThrVal: 4.526 ± 0.568
1.059ThrTrp: 1.059 ± 0.317
3.178ThrTyr: 3.178 ± 0.595
0.0ThrXaa: 0.0 ± 0.0
Val
3.852ValAla: 3.852 ± 0.525
0.193ValCys: 0.193 ± 0.129
5.2ValAsp: 5.2 ± 0.565
3.755ValGlu: 3.755 ± 0.533
1.83ValPhe: 1.83 ± 0.405
3.948ValGly: 3.948 ± 0.571
0.867ValHis: 0.867 ± 0.242
3.755ValIle: 3.755 ± 0.556
4.526ValLys: 4.526 ± 0.633
4.622ValLeu: 4.622 ± 0.843
1.252ValMet: 1.252 ± 0.366
3.755ValAsn: 3.755 ± 0.665
1.926ValPro: 1.926 ± 0.437
1.733ValGln: 1.733 ± 0.43
2.985ValArg: 2.985 ± 0.744
4.911ValSer: 4.911 ± 0.696
4.815ValThr: 4.815 ± 0.834
3.755ValVal: 3.755 ± 0.742
0.963ValTrp: 0.963 ± 0.26
1.926ValTyr: 1.926 ± 0.332
0.0ValXaa: 0.0 ± 0.0
Trp
0.578TrpAla: 0.578 ± 0.242
0.193TrpCys: 0.193 ± 0.125
1.252TrpAsp: 1.252 ± 0.385
0.867TrpGlu: 0.867 ± 0.225
0.867TrpPhe: 0.867 ± 0.28
0.674TrpGly: 0.674 ± 0.254
0.481TrpHis: 0.481 ± 0.207
0.674TrpIle: 0.674 ± 0.217
0.674TrpLys: 0.674 ± 0.328
1.444TrpLeu: 1.444 ± 0.293
0.096TrpMet: 0.096 ± 0.098
0.867TrpAsn: 0.867 ± 0.391
0.096TrpPro: 0.096 ± 0.105
0.77TrpGln: 0.77 ± 0.27
0.867TrpArg: 0.867 ± 0.232
1.637TrpSer: 1.637 ± 0.604
1.059TrpThr: 1.059 ± 0.267
1.156TrpVal: 1.156 ± 0.263
0.289TrpTrp: 0.289 ± 0.192
0.193TrpTyr: 0.193 ± 0.116
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.504TyrAla: 2.504 ± 0.331
0.674TyrCys: 0.674 ± 0.352
2.215TyrAsp: 2.215 ± 0.38
2.407TyrGlu: 2.407 ± 0.476
1.348TyrPhe: 1.348 ± 0.348
2.022TyrGly: 2.022 ± 0.434
1.059TyrHis: 1.059 ± 0.432
2.696TyrIle: 2.696 ± 0.465
2.311TyrLys: 2.311 ± 0.473
3.467TyrLeu: 3.467 ± 0.496
0.77TyrMet: 0.77 ± 0.311
1.926TyrAsn: 1.926 ± 0.46
1.156TyrPro: 1.156 ± 0.354
2.504TyrGln: 2.504 ± 0.393
2.6TyrArg: 2.6 ± 0.656
2.696TyrSer: 2.696 ± 0.518
2.311TyrThr: 2.311 ± 0.451
2.504TyrVal: 2.504 ± 0.481
0.193TyrTrp: 0.193 ± 0.134
2.022TyrTyr: 2.022 ± 0.638
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 41 proteins (10386 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski