Amino acid dipepetide frequency for Streptococcus satellite phage Javan614

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.485AlaAla: 2.485 ± 1.47
0.311AlaCys: 0.311 ± 0.224
4.66AlaAsp: 4.66 ± 1.13
2.485AlaGlu: 2.485 ± 0.869
3.107AlaPhe: 3.107 ± 1.067
6.524AlaGly: 6.524 ± 1.874
0.621AlaHis: 0.621 ± 0.399
6.834AlaIle: 6.834 ± 1.737
5.592AlaLys: 5.592 ± 1.693
5.281AlaLeu: 5.281 ± 1.196
2.175AlaMet: 2.175 ± 1.079
4.97AlaAsn: 4.97 ± 1.329
3.417AlaPro: 3.417 ± 1.121
2.796AlaGln: 2.796 ± 1.003
3.107AlaArg: 3.107 ± 0.652
3.728AlaSer: 3.728 ± 0.957
3.417AlaThr: 3.417 ± 0.704
2.796AlaVal: 2.796 ± 1.077
0.932AlaTrp: 0.932 ± 0.425
3.728AlaTyr: 3.728 ± 0.726
0.0AlaXaa: 0.0 ± 0.0
Cys
0.311CysAla: 0.311 ± 0.32
0.0CysCys: 0.0 ± 0.0
0.932CysAsp: 0.932 ± 0.609
0.311CysGlu: 0.311 ± 0.264
0.621CysPhe: 0.621 ± 0.367
0.621CysGly: 0.621 ± 0.377
0.0CysHis: 0.0 ± 0.0
0.311CysIle: 0.311 ± 0.328
0.311CysLys: 0.311 ± 0.224
0.932CysLeu: 0.932 ± 0.381
0.0CysMet: 0.0 ± 0.0
0.311CysAsn: 0.311 ± 0.224
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.311CysArg: 0.311 ± 0.281
0.311CysSer: 0.311 ± 0.48
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.311CysTyr: 0.311 ± 0.339
0.0CysXaa: 0.0 ± 0.0
Asp
2.175AspAla: 2.175 ± 1.881
0.311AspCys: 0.311 ± 0.339
1.864AspAsp: 1.864 ± 0.722
6.213AspGlu: 6.213 ± 1.759
3.107AspPhe: 3.107 ± 1.042
4.66AspGly: 4.66 ± 1.563
0.621AspHis: 0.621 ± 0.577
5.592AspIle: 5.592 ± 1.367
5.592AspLys: 5.592 ± 1.104
3.417AspLeu: 3.417 ± 1.132
2.485AspMet: 2.485 ± 0.697
4.039AspAsn: 4.039 ± 0.964
1.243AspPro: 1.243 ± 0.54
0.932AspGln: 0.932 ± 0.425
2.175AspArg: 2.175 ± 1.037
2.796AspSer: 2.796 ± 1.048
3.728AspThr: 3.728 ± 0.817
2.485AspVal: 2.485 ± 0.713
0.932AspTrp: 0.932 ± 0.532
3.417AspTyr: 3.417 ± 1.486
0.0AspXaa: 0.0 ± 0.0
Glu
5.281GluAla: 5.281 ± 1.322
0.932GluCys: 0.932 ± 0.502
2.175GluAsp: 2.175 ± 0.724
3.417GluGlu: 3.417 ± 1.155
3.417GluPhe: 3.417 ± 0.994
2.485GluGly: 2.485 ± 0.651
1.243GluHis: 1.243 ± 0.537
3.728GluIle: 3.728 ± 1.429
6.524GluLys: 6.524 ± 1.774
9.63GluLeu: 9.63 ± 2.081
0.621GluMet: 0.621 ± 0.429
2.175GluAsn: 2.175 ± 0.639
2.796GluPro: 2.796 ± 1.106
3.728GluGln: 3.728 ± 2.483
1.864GluArg: 1.864 ± 0.956
2.796GluSer: 2.796 ± 0.958
4.97GluThr: 4.97 ± 1.327
1.243GluVal: 1.243 ± 0.554
0.621GluTrp: 0.621 ± 0.461
2.485GluTyr: 2.485 ± 0.992
0.0GluXaa: 0.0 ± 0.0
Phe
3.728PheAla: 3.728 ± 1.734
0.0PheCys: 0.0 ± 0.0
3.417PheAsp: 3.417 ± 0.967
2.796PheGlu: 2.796 ± 1.397
1.243PhePhe: 1.243 ± 0.775
2.796PheGly: 2.796 ± 0.909
0.311PheHis: 0.311 ± 0.281
3.107PheIle: 3.107 ± 0.663
2.485PheLys: 2.485 ± 0.799
4.97PheLeu: 4.97 ± 1.193
0.932PheMet: 0.932 ± 0.605
4.349PheAsn: 4.349 ± 1.36
1.553PhePro: 1.553 ± 0.742
1.243PheGln: 1.243 ± 0.531
1.243PheArg: 1.243 ± 0.644
2.175PheSer: 2.175 ± 0.595
1.864PheThr: 1.864 ± 0.762
2.175PheVal: 2.175 ± 0.77
0.311PheTrp: 0.311 ± 0.224
1.864PheTyr: 1.864 ± 0.894
0.0PheXaa: 0.0 ± 0.0
Gly
3.728GlyAla: 3.728 ± 1.141
1.243GlyCys: 1.243 ± 0.642
3.417GlyAsp: 3.417 ± 1.245
3.728GlyGlu: 3.728 ± 0.774
3.728GlyPhe: 3.728 ± 0.845
3.107GlyGly: 3.107 ± 1.098
0.621GlyHis: 0.621 ± 0.359
5.902GlyIle: 5.902 ± 1.912
4.349GlyLys: 4.349 ± 1.005
5.592GlyLeu: 5.592 ± 1.52
0.311GlyMet: 0.311 ± 0.264
3.728GlyAsn: 3.728 ± 1.391
0.311GlyPro: 0.311 ± 0.32
2.485GlyGln: 2.485 ± 0.915
2.485GlyArg: 2.485 ± 0.756
1.864GlySer: 1.864 ± 0.731
3.417GlyThr: 3.417 ± 0.956
5.902GlyVal: 5.902 ± 1.608
0.621GlyTrp: 0.621 ± 0.449
5.902GlyTyr: 5.902 ± 1.856
0.0GlyXaa: 0.0 ± 0.0
His
2.175HisAla: 2.175 ± 0.654
0.0HisCys: 0.0 ± 0.0
0.621HisAsp: 0.621 ± 0.502
1.553HisGlu: 1.553 ± 0.78
1.243HisPhe: 1.243 ± 0.391
0.932HisGly: 0.932 ± 0.676
0.0HisHis: 0.0 ± 0.0
0.621HisIle: 0.621 ± 0.383
0.932HisLys: 0.932 ± 0.391
1.243HisLeu: 1.243 ± 0.525
0.0HisMet: 0.0 ± 0.0
0.311HisAsn: 0.311 ± 0.32
0.311HisPro: 0.311 ± 0.26
0.621HisGln: 0.621 ± 0.44
0.621HisArg: 0.621 ± 0.383
0.621HisSer: 0.621 ± 0.419
1.553HisThr: 1.553 ± 0.832
1.243HisVal: 1.243 ± 0.505
0.0HisTrp: 0.0 ± 0.0
0.621HisTyr: 0.621 ± 0.4
0.0HisXaa: 0.0 ± 0.0
Ile
4.97IleAla: 4.97 ± 1.592
0.621IleCys: 0.621 ± 0.367
3.107IleAsp: 3.107 ± 0.979
3.728IleGlu: 3.728 ± 0.989
2.485IlePhe: 2.485 ± 0.793
4.039IleGly: 4.039 ± 1.238
1.243IleHis: 1.243 ± 0.528
4.039IleIle: 4.039 ± 1.422
5.592IleLys: 5.592 ± 1.251
7.456IleLeu: 7.456 ± 1.978
1.553IleMet: 1.553 ± 0.599
4.97IleAsn: 4.97 ± 0.989
4.039IlePro: 4.039 ± 1.044
1.553IleGln: 1.553 ± 0.58
0.621IleArg: 0.621 ± 0.376
5.281IleSer: 5.281 ± 1.332
5.592IleThr: 5.592 ± 0.975
4.039IleVal: 4.039 ± 1.005
0.311IleTrp: 0.311 ± 0.264
3.417IleTyr: 3.417 ± 0.864
0.0IleXaa: 0.0 ± 0.0
Lys
7.145LysAla: 7.145 ± 1.58
0.0LysCys: 0.0 ± 0.0
4.349LysAsp: 4.349 ± 1.224
6.524LysGlu: 6.524 ± 1.823
3.107LysPhe: 3.107 ± 1.184
5.281LysGly: 5.281 ± 1.338
2.175LysHis: 2.175 ± 0.776
4.66LysIle: 4.66 ± 0.907
7.145LysLys: 7.145 ± 1.856
6.834LysLeu: 6.834 ± 1.746
1.553LysMet: 1.553 ± 0.545
6.213LysAsn: 6.213 ± 1.205
5.281LysPro: 5.281 ± 1.495
2.175LysGln: 2.175 ± 0.711
3.417LysArg: 3.417 ± 1.011
4.349LysSer: 4.349 ± 1.196
4.97LysThr: 4.97 ± 0.839
3.417LysVal: 3.417 ± 1.106
0.932LysTrp: 0.932 ± 0.418
3.728LysTyr: 3.728 ± 0.762
0.0LysXaa: 0.0 ± 0.0
Leu
7.145LeuAla: 7.145 ± 1.362
0.621LeuCys: 0.621 ± 0.353
7.145LeuAsp: 7.145 ± 1.334
6.213LeuGlu: 6.213 ± 2.179
3.417LeuPhe: 3.417 ± 0.744
7.456LeuGly: 7.456 ± 2.081
1.243LeuHis: 1.243 ± 0.984
4.66LeuIle: 4.66 ± 1.524
7.456LeuLys: 7.456 ± 1.61
9.32LeuLeu: 9.32 ± 2.719
3.107LeuMet: 3.107 ± 1.216
4.039LeuAsn: 4.039 ± 1.284
5.281LeuPro: 5.281 ± 0.849
4.039LeuGln: 4.039 ± 1.245
3.728LeuArg: 3.728 ± 1.272
6.213LeuSer: 6.213 ± 1.12
4.97LeuThr: 4.97 ± 1.045
4.349LeuVal: 4.349 ± 0.926
1.553LeuTrp: 1.553 ± 0.69
2.175LeuTyr: 2.175 ± 1.031
0.0LeuXaa: 0.0 ± 0.0
Met
1.864MetAla: 1.864 ± 0.868
0.0MetCys: 0.0 ± 0.0
1.864MetAsp: 1.864 ± 0.623
1.243MetGlu: 1.243 ± 0.512
0.311MetPhe: 0.311 ± 0.32
0.621MetGly: 0.621 ± 0.402
0.0MetHis: 0.0 ± 0.0
1.553MetIle: 1.553 ± 0.576
1.864MetLys: 1.864 ± 0.682
3.417MetLeu: 3.417 ± 1.482
0.932MetMet: 0.932 ± 0.424
1.553MetAsn: 1.553 ± 0.761
0.621MetPro: 0.621 ± 0.376
0.621MetGln: 0.621 ± 0.44
0.621MetArg: 0.621 ± 0.42
1.864MetSer: 1.864 ± 0.632
3.107MetThr: 3.107 ± 1.006
2.485MetVal: 2.485 ± 0.872
0.311MetTrp: 0.311 ± 0.339
0.311MetTyr: 0.311 ± 0.224
0.0MetXaa: 0.0 ± 0.0
Asn
4.039AsnAla: 4.039 ± 1.151
0.311AsnCys: 0.311 ± 0.281
3.728AsnAsp: 3.728 ± 1.11
3.728AsnGlu: 3.728 ± 0.935
0.932AsnPhe: 0.932 ± 0.506
4.039AsnGly: 4.039 ± 0.899
0.621AsnHis: 0.621 ± 0.419
3.107AsnIle: 3.107 ± 0.694
3.728AsnLys: 3.728 ± 0.753
4.66AsnLeu: 4.66 ± 1.596
2.796AsnMet: 2.796 ± 0.787
4.349AsnAsn: 4.349 ± 1.351
2.796AsnPro: 2.796 ± 0.792
1.553AsnGln: 1.553 ± 0.442
1.553AsnArg: 1.553 ± 0.697
3.107AsnSer: 3.107 ± 1.302
4.97AsnThr: 4.97 ± 1.453
3.728AsnVal: 3.728 ± 1.021
0.0AsnTrp: 0.0 ± 0.0
2.175AsnTyr: 2.175 ± 0.647
0.0AsnXaa: 0.0 ± 0.0
Pro
2.175ProAla: 2.175 ± 0.574
0.0ProCys: 0.0 ± 0.0
1.864ProAsp: 1.864 ± 0.8
0.621ProGlu: 0.621 ± 0.419
2.175ProPhe: 2.175 ± 0.812
1.864ProGly: 1.864 ± 1.105
0.621ProHis: 0.621 ± 0.46
1.243ProIle: 1.243 ± 0.512
4.349ProLys: 4.349 ± 1.705
3.107ProLeu: 3.107 ± 0.701
0.311ProMet: 0.311 ± 0.264
3.107ProAsn: 3.107 ± 1.498
2.485ProPro: 2.485 ± 0.62
2.485ProGln: 2.485 ± 0.851
1.243ProArg: 1.243 ± 0.984
4.039ProSer: 4.039 ± 1.699
2.796ProThr: 2.796 ± 1.079
2.796ProVal: 2.796 ± 1.021
0.311ProTrp: 0.311 ± 0.224
2.175ProTyr: 2.175 ± 0.623
0.0ProXaa: 0.0 ± 0.0
Gln
4.349GlnAla: 4.349 ± 1.682
0.0GlnCys: 0.0 ± 0.0
2.175GlnAsp: 2.175 ± 0.739
3.417GlnGlu: 3.417 ± 1.266
1.864GlnPhe: 1.864 ± 0.733
2.175GlnGly: 2.175 ± 0.718
0.932GlnHis: 0.932 ± 0.591
4.66GlnIle: 4.66 ± 1.275
1.864GlnLys: 1.864 ± 0.65
1.864GlnLeu: 1.864 ± 0.648
0.311GlnMet: 0.311 ± 0.26
0.932GlnAsn: 0.932 ± 0.582
1.243GlnPro: 1.243 ± 0.954
1.864GlnGln: 1.864 ± 0.944
1.864GlnArg: 1.864 ± 0.639
3.107GlnSer: 3.107 ± 1.214
2.796GlnThr: 2.796 ± 1.327
1.864GlnVal: 1.864 ± 0.844
0.311GlnTrp: 0.311 ± 0.32
0.621GlnTyr: 0.621 ± 0.419
0.0GlnXaa: 0.0 ± 0.0
Arg
2.485ArgAla: 2.485 ± 0.689
0.0ArgCys: 0.0 ± 0.0
1.864ArgAsp: 1.864 ± 0.881
2.175ArgGlu: 2.175 ± 0.766
0.932ArgPhe: 0.932 ± 0.426
1.864ArgGly: 1.864 ± 0.824
1.864ArgHis: 1.864 ± 0.635
2.796ArgIle: 2.796 ± 0.814
3.417ArgLys: 3.417 ± 0.743
2.485ArgLeu: 2.485 ± 0.65
0.0ArgMet: 0.0 ± 0.0
2.796ArgAsn: 2.796 ± 0.808
1.243ArgPro: 1.243 ± 0.701
0.932ArgGln: 0.932 ± 0.398
1.243ArgArg: 1.243 ± 0.74
1.553ArgSer: 1.553 ± 0.827
2.175ArgThr: 2.175 ± 0.999
2.796ArgVal: 2.796 ± 1.194
0.932ArgTrp: 0.932 ± 0.581
2.485ArgTyr: 2.485 ± 0.775
0.0ArgXaa: 0.0 ± 0.0
Ser
2.485SerAla: 2.485 ± 0.662
0.0SerCys: 0.0 ± 0.0
4.349SerAsp: 4.349 ± 1.165
4.66SerGlu: 4.66 ± 1.805
3.107SerPhe: 3.107 ± 0.759
2.175SerGly: 2.175 ± 0.756
0.621SerHis: 0.621 ± 0.41
2.796SerIle: 2.796 ± 0.929
6.524SerLys: 6.524 ± 1.749
6.213SerLeu: 6.213 ± 1.506
1.243SerMet: 1.243 ± 0.592
1.864SerAsn: 1.864 ± 0.888
1.553SerPro: 1.553 ± 0.689
3.417SerGln: 3.417 ± 1.446
1.864SerArg: 1.864 ± 0.729
3.107SerSer: 3.107 ± 1.083
5.281SerThr: 5.281 ± 1.611
3.107SerVal: 3.107 ± 1.043
2.175SerTrp: 2.175 ± 1.047
3.107SerTyr: 3.107 ± 0.708
0.0SerXaa: 0.0 ± 0.0
Thr
4.66ThrAla: 4.66 ± 1.11
0.0ThrCys: 0.0 ± 0.0
4.039ThrAsp: 4.039 ± 1.156
2.485ThrGlu: 2.485 ± 0.705
3.728ThrPhe: 3.728 ± 1.574
3.728ThrGly: 3.728 ± 1.066
0.621ThrHis: 0.621 ± 0.319
4.349ThrIle: 4.349 ± 1.782
4.97ThrLys: 4.97 ± 0.964
5.902ThrLeu: 5.902 ± 0.839
3.107ThrMet: 3.107 ± 0.846
1.553ThrAsn: 1.553 ± 0.686
1.864ThrPro: 1.864 ± 0.661
4.66ThrGln: 4.66 ± 2.362
3.417ThrArg: 3.417 ± 0.91
4.039ThrSer: 4.039 ± 1.185
3.107ThrThr: 3.107 ± 0.858
6.834ThrVal: 6.834 ± 1.078
0.932ThrTrp: 0.932 ± 0.561
3.417ThrTyr: 3.417 ± 1.201
0.0ThrXaa: 0.0 ± 0.0
Val
4.97ValAla: 4.97 ± 1.016
0.932ValCys: 0.932 ± 0.532
2.796ValAsp: 2.796 ± 1.079
3.107ValGlu: 3.107 ± 1.238
1.864ValPhe: 1.864 ± 0.623
3.728ValGly: 3.728 ± 0.837
0.311ValHis: 0.311 ± 0.281
3.728ValIle: 3.728 ± 0.988
4.97ValLys: 4.97 ± 1.338
5.902ValLeu: 5.902 ± 1.314
1.553ValMet: 1.553 ± 0.796
3.417ValAsn: 3.417 ± 1.169
1.864ValPro: 1.864 ± 0.712
1.243ValGln: 1.243 ± 0.629
1.553ValArg: 1.553 ± 0.762
4.039ValSer: 4.039 ± 1.48
5.902ValThr: 5.902 ± 1.12
4.66ValVal: 4.66 ± 0.987
0.621ValTrp: 0.621 ± 0.641
1.864ValTyr: 1.864 ± 0.97
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.621TrpAsp: 0.621 ± 0.41
0.621TrpGlu: 0.621 ± 0.483
0.311TrpPhe: 0.311 ± 0.264
0.932TrpGly: 0.932 ± 0.485
0.621TrpHis: 0.621 ± 0.337
0.932TrpIle: 0.932 ± 0.582
1.243TrpLys: 1.243 ± 0.414
1.864TrpLeu: 1.864 ± 0.792
0.311TrpMet: 0.311 ± 0.339
0.0TrpAsn: 0.0 ± 0.0
0.311TrpPro: 0.311 ± 0.264
0.311TrpGln: 0.311 ± 0.224
0.311TrpArg: 0.311 ± 0.224
1.553TrpSer: 1.553 ± 0.514
0.621TrpThr: 0.621 ± 0.367
0.932TrpVal: 0.932 ± 0.561
0.311TrpTrp: 0.311 ± 0.281
0.621TrpTyr: 0.621 ± 0.337
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.796TyrAla: 2.796 ± 1.643
0.311TyrCys: 0.311 ± 0.328
2.796TyrAsp: 2.796 ± 0.894
3.107TyrGlu: 3.107 ± 1.687
1.864TyrPhe: 1.864 ± 0.694
3.417TyrGly: 3.417 ± 0.821
0.932TyrHis: 0.932 ± 0.607
4.039TyrIle: 4.039 ± 1.37
4.349TyrLys: 4.349 ± 1.462
4.039TyrLeu: 4.039 ± 0.887
1.553TyrMet: 1.553 ± 0.812
1.553TyrAsn: 1.553 ± 1.062
1.553TyrPro: 1.553 ± 0.77
1.553TyrGln: 1.553 ± 0.805
2.796TyrArg: 2.796 ± 1.072
3.107TyrSer: 3.107 ± 1.323
2.175TyrThr: 2.175 ± 0.741
2.175TyrVal: 2.175 ± 0.837
0.311TyrTrp: 0.311 ± 0.264
1.243TyrTyr: 1.243 ± 0.562
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 17 proteins (3220 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski