Amino acid dipepetide frequency for Spironucleus salmonicida

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.885AlaAla: 3.885 ± 0.079
1.436AlaCys: 1.436 ± 0.035
2.521AlaAsp: 2.521 ± 0.037
3.345AlaGlu: 3.345 ± 0.054
2.551AlaPhe: 2.551 ± 0.036
2.74AlaGly: 2.74 ± 0.048
0.88AlaHis: 0.88 ± 0.02
3.845AlaIle: 3.845 ± 0.041
3.482AlaLys: 3.482 ± 0.038
5.148AlaLeu: 5.148 ± 0.065
0.943AlaMet: 0.943 ± 0.019
2.525AlaAsn: 2.525 ± 0.032
1.971AlaPro: 1.971 ± 0.033
3.717AlaGln: 3.717 ± 0.035
2.231AlaArg: 2.231 ± 0.037
3.665AlaSer: 3.665 ± 0.039
2.568AlaThr: 2.568 ± 0.039
3.408AlaVal: 3.408 ± 0.048
0.293AlaTrp: 0.293 ± 0.009
1.573AlaTyr: 1.573 ± 0.03
0.0AlaXaa: 0.0 ± 0.0
Cys
1.633CysAla: 1.633 ± 0.042
0.44CysCys: 0.44 ± 0.013
1.376CysAsp: 1.376 ± 0.033
1.42CysGlu: 1.42 ± 0.031
1.024CysPhe: 1.024 ± 0.023
1.434CysGly: 1.434 ± 0.04
0.331CysHis: 0.331 ± 0.011
1.674CysIle: 1.674 ± 0.036
1.719CysLys: 1.719 ± 0.043
1.974CysLeu: 1.974 ± 0.038
0.366CysMet: 0.366 ± 0.011
1.234CysAsn: 1.234 ± 0.034
0.833CysPro: 0.833 ± 0.022
1.787CysGln: 1.787 ± 0.032
0.842CysArg: 0.842 ± 0.02
1.933CysSer: 1.933 ± 0.043
1.556CysThr: 1.556 ± 0.052
1.441CysVal: 1.441 ± 0.035
0.178CysTrp: 0.178 ± 0.009
0.817CysTyr: 0.817 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
2.53AspAla: 2.53 ± 0.037
1.138AspCys: 1.138 ± 0.027
2.712AspAsp: 2.712 ± 0.034
3.149AspGlu: 3.149 ± 0.039
3.287AspPhe: 3.287 ± 0.036
2.269AspGly: 2.269 ± 0.037
0.736AspHis: 0.736 ± 0.016
4.539AspIle: 4.539 ± 0.054
3.122AspLys: 3.122 ± 0.038
5.138AspLeu: 5.138 ± 0.046
1.083AspMet: 1.083 ± 0.02
2.756AspAsn: 2.756 ± 0.036
1.494AspPro: 1.494 ± 0.025
3.594AspGln: 3.594 ± 0.041
1.61AspArg: 1.61 ± 0.029
3.532AspSer: 3.532 ± 0.036
2.326AspThr: 2.326 ± 0.033
2.774AspVal: 2.774 ± 0.034
0.328AspTrp: 0.328 ± 0.01
1.866AspTyr: 1.866 ± 0.026
0.0AspXaa: 0.0 ± 0.0
Glu
2.892GluAla: 2.892 ± 0.045
1.155GluCys: 1.155 ± 0.032
2.782GluAsp: 2.782 ± 0.039
3.651GluGlu: 3.651 ± 0.055
3.107GluPhe: 3.107 ± 0.038
2.038GluGly: 2.038 ± 0.029
0.919GluHis: 0.919 ± 0.018
5.542GluIle: 5.542 ± 0.055
4.551GluLys: 4.551 ± 0.053
5.85GluLeu: 5.85 ± 0.058
1.641GluMet: 1.641 ± 0.025
4.217GluAsn: 4.217 ± 0.04
1.331GluPro: 1.331 ± 0.022
4.394GluGln: 4.394 ± 0.054
2.137GluArg: 2.137 ± 0.043
3.469GluSer: 3.469 ± 0.033
2.72GluThr: 2.72 ± 0.032
3.192GluVal: 3.192 ± 0.033
0.347GluTrp: 0.347 ± 0.013
1.904GluTyr: 1.904 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
2.778PheAla: 2.778 ± 0.037
1.305PheCys: 1.305 ± 0.024
3.189PheAsp: 3.189 ± 0.037
3.028PheGlu: 3.028 ± 0.037
2.172PhePhe: 2.172 ± 0.028
2.418PheGly: 2.418 ± 0.037
0.84PheHis: 0.84 ± 0.017
4.123PheIle: 4.123 ± 0.052
3.287PheLys: 3.287 ± 0.033
4.869PheLeu: 4.869 ± 0.048
1.013PheMet: 1.013 ± 0.017
3.239PheAsn: 3.239 ± 0.041
1.578PhePro: 1.578 ± 0.019
3.947PheGln: 3.947 ± 0.045
1.69PheArg: 1.69 ± 0.027
4.496PheSer: 4.496 ± 0.047
2.981PheThr: 2.981 ± 0.035
2.863PheVal: 2.863 ± 0.033
0.375PheTrp: 0.375 ± 0.011
2.112PheTyr: 2.112 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
2.526GlyAla: 2.526 ± 0.044
1.112GlyCys: 1.112 ± 0.025
2.075GlyAsp: 2.075 ± 0.03
2.228GlyGlu: 2.228 ± 0.034
2.301GlyPhe: 2.301 ± 0.033
2.297GlyGly: 2.297 ± 0.04
0.682GlyHis: 0.682 ± 0.019
3.239GlyIle: 3.239 ± 0.039
2.894GlyLys: 2.894 ± 0.036
3.483GlyLeu: 3.483 ± 0.044
0.928GlyMet: 0.928 ± 0.016
2.168GlyAsn: 2.168 ± 0.034
1.179GlyPro: 1.179 ± 0.029
2.644GlyGln: 2.644 ± 0.038
1.781GlyArg: 1.781 ± 0.033
2.954GlySer: 2.954 ± 0.041
2.224GlyThr: 2.224 ± 0.041
2.881GlyVal: 2.881 ± 0.038
0.339GlyTrp: 0.339 ± 0.011
1.704GlyTyr: 1.704 ± 0.028
0.0GlyXaa: 0.0 ± 0.0
His
0.969HisAla: 0.969 ± 0.023
0.395HisCys: 0.395 ± 0.012
0.745HisAsp: 0.745 ± 0.017
0.906HisGlu: 0.906 ± 0.021
0.99HisPhe: 0.99 ± 0.019
0.738HisGly: 0.738 ± 0.017
0.373HisHis: 0.373 ± 0.013
1.4HisIle: 1.4 ± 0.025
1.003HisLys: 1.003 ± 0.022
1.7HisLeu: 1.7 ± 0.025
0.275HisMet: 0.275 ± 0.011
0.879HisAsn: 0.879 ± 0.018
0.701HisPro: 0.701 ± 0.018
1.361HisGln: 1.361 ± 0.021
0.655HisArg: 0.655 ± 0.018
1.28HisSer: 1.28 ± 0.02
0.822HisThr: 0.822 ± 0.016
0.911HisVal: 0.911 ± 0.019
0.126HisTrp: 0.126 ± 0.006
0.671HisTyr: 0.671 ± 0.016
0.0HisXaa: 0.0 ± 0.0
Ile
3.818IleAla: 3.818 ± 0.039
1.915IleCys: 1.915 ± 0.03
4.412IleAsp: 4.412 ± 0.045
4.562IleGlu: 4.562 ± 0.041
4.439IlePhe: 4.439 ± 0.046
2.979IleGly: 2.979 ± 0.036
1.276IleHis: 1.276 ± 0.02
6.681IleIle: 6.681 ± 0.078
5.481IleLys: 5.481 ± 0.057
8.141IleLeu: 8.141 ± 0.077
1.434IleMet: 1.434 ± 0.021
5.173IleAsn: 5.173 ± 0.065
2.851IlePro: 2.851 ± 0.033
7.49IleGln: 7.49 ± 0.082
2.533IleArg: 2.533 ± 0.03
7.038IleSer: 7.038 ± 0.062
4.054IleThr: 4.054 ± 0.047
4.228IleVal: 4.228 ± 0.04
0.467IleTrp: 0.467 ± 0.012
3.025IleTyr: 3.025 ± 0.046
0.0IleXaa: 0.0 ± 0.0
Lys
3.047LysAla: 3.047 ± 0.042
1.84LysCys: 1.84 ± 0.052
3.22LysAsp: 3.22 ± 0.035
3.742LysGlu: 3.742 ± 0.047
3.652LysPhe: 3.652 ± 0.04
2.097LysGly: 2.097 ± 0.028
1.158LysHis: 1.158 ± 0.018
6.305LysIle: 6.305 ± 0.053
4.609LysLys: 4.609 ± 0.056
7.242LysLeu: 7.242 ± 0.052
1.83LysMet: 1.83 ± 0.024
4.424LysAsn: 4.424 ± 0.039
2.024LysPro: 2.024 ± 0.03
6.292LysGln: 6.292 ± 0.065
2.348LysArg: 2.348 ± 0.029
4.872LysSer: 4.872 ± 0.045
3.646LysThr: 3.646 ± 0.033
3.372LysVal: 3.372 ± 0.038
0.396LysTrp: 0.396 ± 0.012
2.709LysTyr: 2.709 ± 0.035
0.0LysXaa: 0.0 ± 0.0
Leu
5.441LeuAla: 5.441 ± 0.061
1.784LeuCys: 1.784 ± 0.028
4.989LeuAsp: 4.989 ± 0.043
5.882LeuGlu: 5.882 ± 0.054
4.578LeuPhe: 4.578 ± 0.053
3.731LeuGly: 3.731 ± 0.045
1.784LeuHis: 1.784 ± 0.027
7.292LeuIle: 7.292 ± 0.068
7.276LeuLys: 7.276 ± 0.057
9.805LeuLeu: 9.805 ± 0.08
1.687LeuMet: 1.687 ± 0.024
6.081LeuAsn: 6.081 ± 0.053
3.788LeuPro: 3.788 ± 0.041
8.662LeuGln: 8.662 ± 0.083
3.848LeuArg: 3.848 ± 0.049
7.687LeuSer: 7.687 ± 0.053
5.385LeuThr: 5.385 ± 0.048
5.423LeuVal: 5.423 ± 0.052
0.466LeuTrp: 0.466 ± 0.013
3.109LeuTyr: 3.109 ± 0.035
0.0LeuXaa: 0.0 ± 0.0
Met
0.988MetAla: 0.988 ± 0.021
0.348MetCys: 0.348 ± 0.011
0.98MetAsp: 0.98 ± 0.017
1.132MetGlu: 1.132 ± 0.021
0.878MetPhe: 0.878 ± 0.016
0.773MetGly: 0.773 ± 0.019
0.41MetHis: 0.41 ± 0.011
1.395MetIle: 1.395 ± 0.024
1.553MetLys: 1.553 ± 0.022
2.013MetLeu: 2.013 ± 0.027
0.479MetMet: 0.479 ± 0.014
1.205MetAsn: 1.205 ± 0.02
0.785MetPro: 0.785 ± 0.018
1.56MetGln: 1.56 ± 0.023
0.9MetArg: 0.9 ± 0.019
1.542MetSer: 1.542 ± 0.024
1.062MetThr: 1.062 ± 0.017
0.972MetVal: 0.972 ± 0.019
0.095MetTrp: 0.095 ± 0.006
0.603MetTyr: 0.603 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
2.716AsnAla: 2.716 ± 0.036
1.847AsnCys: 1.847 ± 0.051
2.911AsnAsp: 2.911 ± 0.032
3.238AsnGlu: 3.238 ± 0.035
3.551AsnPhe: 3.551 ± 0.041
2.3AsnGly: 2.3 ± 0.04
0.911AsnHis: 0.911 ± 0.018
5.644AsnIle: 5.644 ± 0.075
3.863AsnLys: 3.863 ± 0.04
6.119AsnLeu: 6.119 ± 0.053
1.108AsnMet: 1.108 ± 0.018
3.765AsnAsn: 3.765 ± 0.051
1.714AsnPro: 1.714 ± 0.027
5.301AsnGln: 5.301 ± 0.067
1.707AsnArg: 1.707 ± 0.027
4.669AsnSer: 4.669 ± 0.048
3.032AsnThr: 3.032 ± 0.032
2.826AsnVal: 2.826 ± 0.028
0.379AsnTrp: 0.379 ± 0.011
2.501AsnTyr: 2.501 ± 0.033
0.0AsnXaa: 0.0 ± 0.0
Pro
1.956ProAla: 1.956 ± 0.034
0.667ProCys: 0.667 ± 0.021
1.676ProAsp: 1.676 ± 0.026
2.216ProGlu: 2.216 ± 0.034
1.591ProPhe: 1.591 ± 0.025
1.631ProGly: 1.631 ± 0.037
0.643ProHis: 0.643 ± 0.016
2.513ProIle: 2.513 ± 0.031
2.196ProLys: 2.196 ± 0.029
3.04ProLeu: 3.04 ± 0.035
0.493ProMet: 0.493 ± 0.014
1.77ProAsn: 1.77 ± 0.027
1.519ProPro: 1.519 ± 0.042
2.655ProGln: 2.655 ± 0.035
1.241ProArg: 1.241 ± 0.026
2.528ProSer: 2.528 ± 0.031
1.837ProThr: 1.837 ± 0.027
2.059ProVal: 2.059 ± 0.032
0.199ProTrp: 0.199 ± 0.009
1.002ProTyr: 1.002 ± 0.016
0.0ProXaa: 0.0 ± 0.0
Gln
3.45GlnAla: 3.45 ± 0.045
1.507GlnCys: 1.507 ± 0.035
3.478GlnAsp: 3.478 ± 0.044
4.534GlnGlu: 4.534 ± 0.048
4.544GlnPhe: 4.544 ± 0.045
2.382GlnGly: 2.382 ± 0.032
1.507GlnHis: 1.507 ± 0.029
7.79GlnIle: 7.79 ± 0.081
6.326GlnLys: 6.326 ± 0.064
9.092GlnLeu: 9.092 ± 0.077
1.849GlnMet: 1.849 ± 0.022
5.96GlnAsn: 5.96 ± 0.066
2.402GlnPro: 2.402 ± 0.031
8.421GlnGln: 8.421 ± 0.109
2.81GlnArg: 2.81 ± 0.032
5.352GlnSer: 5.352 ± 0.06
4.078GlnThr: 4.078 ± 0.04
4.086GlnVal: 4.086 ± 0.042
0.361GlnTrp: 0.361 ± 0.012
2.967GlnTyr: 2.967 ± 0.044
0.0GlnXaa: 0.0 ± 0.0
Arg
2.159ArgAla: 2.159 ± 0.04
0.824ArgCys: 0.824 ± 0.02
1.663ArgAsp: 1.663 ± 0.026
2.101ArgGlu: 2.101 ± 0.04
1.741ArgPhe: 1.741 ± 0.023
1.632ArgGly: 1.632 ± 0.036
0.671ArgHis: 0.671 ± 0.019
2.71ArgIle: 2.71 ± 0.03
2.603ArgLys: 2.603 ± 0.033
3.468ArgLeu: 3.468 ± 0.04
0.699ArgMet: 0.699 ± 0.015
2.016ArgAsn: 2.016 ± 0.027
1.332ArgPro: 1.332 ± 0.026
2.845ArgGln: 2.845 ± 0.038
1.795ArgArg: 1.795 ± 0.04
2.533ArgSer: 2.533 ± 0.033
1.733ArgThr: 1.733 ± 0.023
2.073ArgVal: 2.073 ± 0.033
0.266ArgTrp: 0.266 ± 0.01
1.262ArgTyr: 1.262 ± 0.021
0.0ArgXaa: 0.0 ± 0.0
Ser
4.04SerAla: 4.04 ± 0.047
1.942SerCys: 1.942 ± 0.042
3.544SerAsp: 3.544 ± 0.037
4.075SerGlu: 4.075 ± 0.046
3.831SerPhe: 3.831 ± 0.042
3.626SerGly: 3.626 ± 0.049
1.206SerHis: 1.206 ± 0.02
6.109SerIle: 6.109 ± 0.064
4.885SerLys: 4.885 ± 0.047
7.284SerLeu: 7.284 ± 0.058
1.262SerMet: 1.262 ± 0.02
4.2SerAsn: 4.2 ± 0.044
2.519SerPro: 2.519 ± 0.031
6.282SerGln: 6.282 ± 0.063
2.64SerArg: 2.64 ± 0.033
6.125SerSer: 6.125 ± 0.054
3.984SerThr: 3.984 ± 0.038
4.186SerVal: 4.186 ± 0.039
0.449SerTrp: 0.449 ± 0.013
2.598SerTyr: 2.598 ± 0.037
0.0SerXaa: 0.0 ± 0.0
Thr
3.005ThrAla: 3.005 ± 0.049
1.879ThrCys: 1.879 ± 0.074
2.554ThrAsp: 2.554 ± 0.033
2.87ThrGlu: 2.87 ± 0.032
2.566ThrPhe: 2.566 ± 0.031
2.384ThrGly: 2.384 ± 0.033
0.882ThrHis: 0.882 ± 0.016
3.998ThrIle: 3.998 ± 0.04
3.303ThrLys: 3.303 ± 0.035
4.82ThrLeu: 4.82 ± 0.045
0.83ThrMet: 0.83 ± 0.016
2.83ThrAsn: 2.83 ± 0.037
2.167ThrPro: 2.167 ± 0.031
3.914ThrGln: 3.914 ± 0.038
1.746ThrArg: 1.746 ± 0.028
3.916ThrSer: 3.916 ± 0.043
2.837ThrThr: 2.837 ± 0.037
2.863ThrVal: 2.863 ± 0.031
0.266ThrTrp: 0.266 ± 0.01
1.618ThrTyr: 1.618 ± 0.025
0.0ThrXaa: 0.0 ± 0.0
Val
2.837ValAla: 2.837 ± 0.043
1.421ValCys: 1.421 ± 0.032
2.852ValAsp: 2.852 ± 0.03
3.376ValGlu: 3.376 ± 0.043
3.013ValPhe: 3.013 ± 0.034
2.451ValGly: 2.451 ± 0.04
0.951ValHis: 0.951 ± 0.018
4.005ValIle: 4.005 ± 0.046
3.773ValLys: 3.773 ± 0.038
5.439ValLeu: 5.439 ± 0.059
1.012ValMet: 1.012 ± 0.019
3.046ValAsn: 3.046 ± 0.033
2.007ValPro: 2.007 ± 0.032
4.549ValGln: 4.549 ± 0.044
2.058ValArg: 2.058 ± 0.036
4.141ValSer: 4.141 ± 0.037
2.416ValThr: 2.416 ± 0.034
3.372ValVal: 3.372 ± 0.046
0.356ValTrp: 0.356 ± 0.012
1.924ValTyr: 1.924 ± 0.027
0.0ValXaa: 0.0 ± 0.0
Trp
0.36TrpAla: 0.36 ± 0.013
0.14TrpCys: 0.14 ± 0.007
0.305TrpAsp: 0.305 ± 0.01
0.302TrpGlu: 0.302 ± 0.01
0.295TrpPhe: 0.295 ± 0.01
0.258TrpGly: 0.258 ± 0.011
0.099TrpHis: 0.099 ± 0.006
0.44TrpIle: 0.44 ± 0.013
0.485TrpLys: 0.485 ± 0.014
0.534TrpLeu: 0.534 ± 0.012
0.166TrpMet: 0.166 ± 0.008
0.334TrpAsn: 0.334 ± 0.011
0.196TrpPro: 0.196 ± 0.009
0.409TrpGln: 0.409 ± 0.011
0.308TrpArg: 0.308 ± 0.01
0.447TrpSer: 0.447 ± 0.013
0.282TrpThr: 0.282 ± 0.01
0.345TrpVal: 0.345 ± 0.01
0.075TrpTrp: 0.075 ± 0.006
0.217TrpTyr: 0.217 ± 0.008
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.697TyrAla: 1.697 ± 0.023
0.918TyrCys: 0.918 ± 0.022
1.998TyrAsp: 1.998 ± 0.029
2.083TyrGlu: 2.083 ± 0.028
2.193TyrPhe: 2.193 ± 0.033
1.462TyrGly: 1.462 ± 0.027
0.64TyrHis: 0.64 ± 0.017
2.754TyrIle: 2.754 ± 0.038
2.413TyrLys: 2.413 ± 0.03
3.539TyrLeu: 3.539 ± 0.044
0.574TyrMet: 0.574 ± 0.014
2.279TyrAsn: 2.279 ± 0.035
1.041TyrPro: 1.041 ± 0.02
3.074TyrGln: 3.074 ± 0.042
1.247TyrArg: 1.247 ± 0.021
2.604TyrSer: 2.604 ± 0.034
1.707TyrThr: 1.707 ± 0.028
1.723TyrVal: 1.723 ± 0.025
0.235TyrTrp: 0.235 ± 0.009
1.625TyrTyr: 1.625 ± 0.028
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.006XaaXaa: 0.006 ± 0.003
Statistics based on 8098 proteins (3053270 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski