Amino acid dipepetide frequency for Sunxiuqinia elliptica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.985AlaAla: 4.985 ± 0.071
0.765AlaCys: 0.765 ± 0.023
3.945AlaAsp: 3.945 ± 0.058
4.582AlaGlu: 4.582 ± 0.058
3.547AlaPhe: 3.547 ± 0.049
5.097AlaGly: 5.097 ± 0.073
1.214AlaHis: 1.214 ± 0.027
5.133AlaIle: 5.133 ± 0.065
4.031AlaLys: 4.031 ± 0.055
6.466AlaLeu: 6.466 ± 0.081
1.695AlaMet: 1.695 ± 0.035
3.406AlaAsn: 3.406 ± 0.051
2.285AlaPro: 2.285 ± 0.046
2.792AlaGln: 2.792 ± 0.048
2.674AlaArg: 2.674 ± 0.044
4.532AlaSer: 4.532 ± 0.058
3.5AlaThr: 3.5 ± 0.071
4.485AlaVal: 4.485 ± 0.057
0.895AlaTrp: 0.895 ± 0.024
2.865AlaTyr: 2.865 ± 0.059
0.0AlaXaa: 0.0 ± 0.0
Cys
0.494CysAla: 0.494 ± 0.018
0.124CysCys: 0.124 ± 0.01
0.455CysAsp: 0.455 ± 0.021
0.528CysGlu: 0.528 ± 0.026
0.406CysPhe: 0.406 ± 0.016
0.653CysGly: 0.653 ± 0.023
0.23CysHis: 0.23 ± 0.017
0.514CysIle: 0.514 ± 0.017
0.447CysLys: 0.447 ± 0.017
0.78CysLeu: 0.78 ± 0.025
0.179CysMet: 0.179 ± 0.011
0.366CysAsn: 0.366 ± 0.016
0.406CysPro: 0.406 ± 0.021
0.361CysGln: 0.361 ± 0.016
0.357CysArg: 0.357 ± 0.016
0.616CysSer: 0.616 ± 0.024
0.408CysThr: 0.408 ± 0.016
0.446CysVal: 0.446 ± 0.017
0.14CysTrp: 0.14 ± 0.023
0.324CysTyr: 0.324 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.554AspAla: 3.554 ± 0.059
0.454AspCys: 0.454 ± 0.02
2.944AspAsp: 2.944 ± 0.054
4.17AspGlu: 4.17 ± 0.059
3.389AspPhe: 3.389 ± 0.049
4.143AspGly: 4.143 ± 0.068
1.091AspHis: 1.091 ± 0.027
3.577AspIle: 3.577 ± 0.053
3.162AspLys: 3.162 ± 0.049
5.571AspLeu: 5.571 ± 0.061
1.202AspMet: 1.202 ± 0.03
2.538AspAsn: 2.538 ± 0.043
2.25AspPro: 2.25 ± 0.047
2.478AspGln: 2.478 ± 0.041
2.351AspArg: 2.351 ± 0.044
2.91AspSer: 2.91 ± 0.051
2.222AspThr: 2.222 ± 0.044
3.473AspVal: 3.473 ± 0.056
0.962AspTrp: 0.962 ± 0.027
2.815AspTyr: 2.815 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
5.051GluAla: 5.051 ± 0.053
0.39GluCys: 0.39 ± 0.017
3.399GluAsp: 3.399 ± 0.054
5.239GluGlu: 5.239 ± 0.067
2.782GluPhe: 2.782 ± 0.045
4.217GluGly: 4.217 ± 0.056
1.296GluHis: 1.296 ± 0.033
4.828GluIle: 4.828 ± 0.063
5.582GluLys: 5.582 ± 0.072
7.303GluLeu: 7.303 ± 0.077
1.826GluMet: 1.826 ± 0.036
3.913GluAsn: 3.913 ± 0.05
1.836GluPro: 1.836 ± 0.043
3.21GluGln: 3.21 ± 0.056
2.92GluArg: 2.92 ± 0.047
3.553GluSer: 3.553 ± 0.05
3.37GluThr: 3.37 ± 0.049
4.66GluVal: 4.66 ± 0.057
0.844GluTrp: 0.844 ± 0.024
2.3GluTyr: 2.3 ± 0.04
0.0GluXaa: 0.0 ± 0.0
Phe
3.109PheAla: 3.109 ± 0.052
0.427PheCys: 0.427 ± 0.02
3.241PheAsp: 3.241 ± 0.048
3.32PheGlu: 3.32 ± 0.048
2.691PhePhe: 2.691 ± 0.056
3.393PheGly: 3.393 ± 0.056
0.917PheHis: 0.917 ± 0.029
3.329PheIle: 3.329 ± 0.052
2.896PheLys: 2.896 ± 0.044
4.574PheLeu: 4.574 ± 0.072
1.168PheMet: 1.168 ± 0.029
2.729PheAsn: 2.729 ± 0.047
1.877PhePro: 1.877 ± 0.037
1.819PheGln: 1.819 ± 0.037
2.083PheArg: 2.083 ± 0.037
3.953PheSer: 3.953 ± 0.056
2.506PheThr: 2.506 ± 0.047
3.175PheVal: 3.175 ± 0.051
0.71PheTrp: 0.71 ± 0.024
2.1PheTyr: 2.1 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
4.553GlyAla: 4.553 ± 0.069
0.659GlyCys: 0.659 ± 0.025
3.45GlyAsp: 3.45 ± 0.059
4.293GlyGlu: 4.293 ± 0.063
3.575GlyPhe: 3.575 ± 0.05
4.782GlyGly: 4.782 ± 0.085
1.299GlyHis: 1.299 ± 0.031
5.355GlyIle: 5.355 ± 0.068
4.893GlyLys: 4.893 ± 0.07
6.24GlyLeu: 6.24 ± 0.063
1.867GlyMet: 1.867 ± 0.038
3.465GlyAsn: 3.465 ± 0.053
1.577GlyPro: 1.577 ± 0.037
2.508GlyGln: 2.508 ± 0.048
2.571GlyArg: 2.571 ± 0.05
4.197GlySer: 4.197 ± 0.071
4.031GlyThr: 4.031 ± 0.08
4.785GlyVal: 4.785 ± 0.064
1.081GlyTrp: 1.081 ± 0.032
3.01GlyTyr: 3.01 ± 0.053
0.0GlyXaa: 0.0 ± 0.0
His
1.215HisAla: 1.215 ± 0.029
0.21HisCys: 0.21 ± 0.011
0.97HisAsp: 0.97 ± 0.027
1.162HisGlu: 1.162 ± 0.031
1.17HisPhe: 1.17 ± 0.031
1.28HisGly: 1.28 ± 0.027
0.542HisHis: 0.542 ± 0.02
1.23HisIle: 1.23 ± 0.03
0.985HisLys: 0.985 ± 0.028
2.002HisLeu: 2.002 ± 0.041
0.377HisMet: 0.377 ± 0.014
0.808HisAsn: 0.808 ± 0.024
1.127HisPro: 1.127 ± 0.031
0.964HisGln: 0.964 ± 0.026
0.856HisArg: 0.856 ± 0.024
1.158HisSer: 1.158 ± 0.027
0.888HisThr: 0.888 ± 0.024
1.084HisVal: 1.084 ± 0.024
0.328HisTrp: 0.328 ± 0.017
0.903HisTyr: 0.903 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
5.147IleAla: 5.147 ± 0.071
0.632IleCys: 0.632 ± 0.022
4.364IleAsp: 4.364 ± 0.048
4.823IleGlu: 4.823 ± 0.063
3.015IlePhe: 3.015 ± 0.054
4.756IleGly: 4.756 ± 0.062
1.39IleHis: 1.39 ± 0.028
4.47IleIle: 4.47 ± 0.07
4.097IleLys: 4.097 ± 0.059
6.111IleLeu: 6.111 ± 0.076
1.277IleMet: 1.277 ± 0.029
3.606IleAsn: 3.606 ± 0.049
3.097IlePro: 3.097 ± 0.054
2.698IleGln: 2.698 ± 0.043
3.236IleArg: 3.236 ± 0.045
4.849IleSer: 4.849 ± 0.065
3.791IleThr: 3.791 ± 0.067
4.092IleVal: 4.092 ± 0.06
0.762IleTrp: 0.762 ± 0.024
2.49IleTyr: 2.49 ± 0.039
0.0IleXaa: 0.0 ± 0.0
Lys
4.893LysAla: 4.893 ± 0.067
0.347LysCys: 0.347 ± 0.017
3.589LysAsp: 3.589 ± 0.058
5.091LysGlu: 5.091 ± 0.078
2.244LysPhe: 2.244 ± 0.046
4.041LysGly: 4.041 ± 0.058
1.389LysHis: 1.389 ± 0.034
4.436LysIle: 4.436 ± 0.057
5.0LysLys: 5.0 ± 0.077
6.275LysLeu: 6.275 ± 0.067
1.744LysMet: 1.744 ± 0.035
3.688LysAsn: 3.688 ± 0.054
2.298LysPro: 2.298 ± 0.036
3.065LysGln: 3.065 ± 0.056
3.091LysArg: 3.091 ± 0.044
3.769LysSer: 3.769 ± 0.052
3.66LysThr: 3.66 ± 0.05
4.176LysVal: 4.176 ± 0.065
0.788LysTrp: 0.788 ± 0.023
2.61LysTyr: 2.61 ± 0.046
0.0LysXaa: 0.0 ± 0.0
Leu
7.074LeuAla: 7.074 ± 0.074
0.729LeuCys: 0.729 ± 0.022
5.056LeuAsp: 5.056 ± 0.056
6.169LeuGlu: 6.169 ± 0.071
5.124LeuPhe: 5.124 ± 0.061
6.126LeuGly: 6.126 ± 0.067
1.641LeuHis: 1.641 ± 0.034
6.586LeuIle: 6.586 ± 0.088
7.26LeuLys: 7.26 ± 0.084
9.659LeuLeu: 9.659 ± 0.123
2.228LeuMet: 2.228 ± 0.046
5.339LeuAsn: 5.339 ± 0.062
4.025LeuPro: 4.025 ± 0.054
3.552LeuGln: 3.552 ± 0.057
3.682LeuArg: 3.682 ± 0.057
6.842LeuSer: 6.842 ± 0.087
5.202LeuThr: 5.202 ± 0.062
6.316LeuVal: 6.316 ± 0.073
0.984LeuTrp: 0.984 ± 0.026
3.148LeuTyr: 3.148 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
1.799MetAla: 1.799 ± 0.039
0.132MetCys: 0.132 ± 0.009
1.266MetAsp: 1.266 ± 0.032
1.566MetGlu: 1.566 ± 0.033
0.793MetPhe: 0.793 ± 0.025
1.682MetGly: 1.682 ± 0.037
0.419MetHis: 0.419 ± 0.015
1.447MetIle: 1.447 ± 0.036
2.191MetLys: 2.191 ± 0.04
2.174MetLeu: 2.174 ± 0.041
0.585MetMet: 0.585 ± 0.022
1.385MetAsn: 1.385 ± 0.028
0.972MetPro: 0.972 ± 0.024
0.951MetGln: 0.951 ± 0.028
0.998MetArg: 0.998 ± 0.03
1.327MetSer: 1.327 ± 0.029
1.242MetThr: 1.242 ± 0.028
1.524MetVal: 1.524 ± 0.035
0.19MetTrp: 0.19 ± 0.012
0.641MetTyr: 0.641 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
3.071AsnAla: 3.071 ± 0.041
0.4AsnCys: 0.4 ± 0.019
2.792AsnAsp: 2.792 ± 0.044
3.35AsnGlu: 3.35 ± 0.048
2.472AsnPhe: 2.472 ± 0.043
3.662AsnGly: 3.662 ± 0.052
1.037AsnHis: 1.037 ± 0.03
3.452AsnIle: 3.452 ± 0.054
3.261AsnLys: 3.261 ± 0.049
4.911AsnLeu: 4.911 ± 0.061
1.192AsnMet: 1.192 ± 0.029
2.676AsnAsn: 2.676 ± 0.051
2.606AsnPro: 2.606 ± 0.045
2.475AsnGln: 2.475 ± 0.046
2.369AsnArg: 2.369 ± 0.047
3.311AsnSer: 3.311 ± 0.063
2.73AsnThr: 2.73 ± 0.054
2.786AsnVal: 2.786 ± 0.05
0.834AsnTrp: 0.834 ± 0.027
2.686AsnTyr: 2.686 ± 0.049
0.0AsnXaa: 0.0 ± 0.0
Pro
2.684ProAla: 2.684 ± 0.049
0.242ProCys: 0.242 ± 0.012
2.551ProAsp: 2.551 ± 0.043
3.603ProGlu: 3.603 ± 0.048
1.887ProPhe: 1.887 ± 0.038
2.666ProGly: 2.666 ± 0.048
0.718ProHis: 0.718 ± 0.023
2.396ProIle: 2.396 ± 0.038
2.141ProLys: 2.141 ± 0.041
3.173ProLeu: 3.173 ± 0.05
0.818ProMet: 0.818 ± 0.02
1.945ProAsn: 1.945 ± 0.035
0.926ProPro: 0.926 ± 0.03
1.372ProGln: 1.372 ± 0.032
1.228ProArg: 1.228 ± 0.03
2.274ProSer: 2.274 ± 0.037
1.992ProThr: 1.992 ± 0.055
3.053ProVal: 3.053 ± 0.059
0.475ProTrp: 0.475 ± 0.019
1.507ProTyr: 1.507 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
3.134GlnAla: 3.134 ± 0.057
0.219GlnCys: 0.219 ± 0.011
1.777GlnAsp: 1.777 ± 0.039
2.942GlnGlu: 2.942 ± 0.047
1.91GlnPhe: 1.91 ± 0.04
2.269GlnGly: 2.269 ± 0.035
0.802GlnHis: 0.802 ± 0.023
2.713GlnIle: 2.713 ± 0.043
3.005GlnLys: 3.005 ± 0.053
4.689GlnLeu: 4.689 ± 0.073
0.921GlnMet: 0.921 ± 0.028
2.054GlnAsn: 2.054 ± 0.038
1.391GlnPro: 1.391 ± 0.032
2.255GlnGln: 2.255 ± 0.055
1.631GlnArg: 1.631 ± 0.034
2.276GlnSer: 2.276 ± 0.042
2.253GlnThr: 2.253 ± 0.039
2.843GlnVal: 2.843 ± 0.052
0.508GlnTrp: 0.508 ± 0.019
1.363GlnTyr: 1.363 ± 0.036
0.0GlnXaa: 0.0 ± 0.0
Arg
2.431ArgAla: 2.431 ± 0.047
0.286ArgCys: 0.286 ± 0.015
2.03ArgAsp: 2.03 ± 0.037
2.948ArgGlu: 2.948 ± 0.049
2.415ArgPhe: 2.415 ± 0.046
2.312ArgGly: 2.312 ± 0.05
0.724ArgHis: 0.724 ± 0.021
3.195ArgIle: 3.195 ± 0.048
3.222ArgLys: 3.222 ± 0.05
4.196ArgLeu: 4.196 ± 0.065
1.176ArgMet: 1.176 ± 0.03
2.246ArgAsn: 2.246 ± 0.04
1.367ArgPro: 1.367 ± 0.033
1.634ArgGln: 1.634 ± 0.033
1.755ArgArg: 1.755 ± 0.037
2.338ArgSer: 2.338 ± 0.043
2.096ArgThr: 2.096 ± 0.035
2.559ArgVal: 2.559 ± 0.043
0.635ArgTrp: 0.635 ± 0.023
1.955ArgTyr: 1.955 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
3.95SerAla: 3.95 ± 0.053
0.672SerCys: 0.672 ± 0.022
3.283SerAsp: 3.283 ± 0.044
3.89SerGlu: 3.89 ± 0.057
3.806SerPhe: 3.806 ± 0.054
4.868SerGly: 4.868 ± 0.073
1.168SerHis: 1.168 ± 0.031
4.495SerIle: 4.495 ± 0.058
3.845SerLys: 3.845 ± 0.054
6.256SerLeu: 6.256 ± 0.071
1.461SerMet: 1.461 ± 0.033
3.04SerAsn: 3.04 ± 0.056
2.404SerPro: 2.404 ± 0.041
2.38SerGln: 2.38 ± 0.045
2.579SerArg: 2.579 ± 0.048
4.524SerSer: 4.524 ± 0.083
3.096SerThr: 3.096 ± 0.051
4.097SerVal: 4.097 ± 0.061
1.02SerTrp: 1.02 ± 0.031
2.897SerTyr: 2.897 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
3.678ThrAla: 3.678 ± 0.063
0.397ThrCys: 0.397 ± 0.017
3.189ThrAsp: 3.189 ± 0.062
3.103ThrGlu: 3.103 ± 0.043
2.486ThrPhe: 2.486 ± 0.04
4.111ThrGly: 4.111 ± 0.071
0.974ThrHis: 0.974 ± 0.023
4.105ThrIle: 4.105 ± 0.066
2.903ThrLys: 2.903 ± 0.043
4.621ThrLeu: 4.621 ± 0.067
1.035ThrMet: 1.035 ± 0.025
2.695ThrAsn: 2.695 ± 0.052
2.516ThrPro: 2.516 ± 0.042
1.927ThrGln: 1.927 ± 0.034
1.92ThrArg: 1.92 ± 0.033
3.374ThrSer: 3.374 ± 0.064
3.131ThrThr: 3.131 ± 0.063
3.481ThrVal: 3.481 ± 0.069
0.596ThrTrp: 0.596 ± 0.022
2.105ThrTyr: 2.105 ± 0.046
0.0ThrXaa: 0.0 ± 0.0
Val
4.624ValAla: 4.624 ± 0.06
0.625ValCys: 0.625 ± 0.026
3.906ValAsp: 3.906 ± 0.057
4.391ValGlu: 4.391 ± 0.062
3.358ValPhe: 3.358 ± 0.049
4.216ValGly: 4.216 ± 0.059
1.163ValHis: 1.163 ± 0.029
4.434ValIle: 4.434 ± 0.06
4.097ValLys: 4.097 ± 0.055
6.3ValLeu: 6.3 ± 0.061
1.362ValMet: 1.362 ± 0.036
3.344ValAsn: 3.344 ± 0.052
2.587ValPro: 2.587 ± 0.04
1.979ValGln: 1.979 ± 0.039
2.638ValArg: 2.638 ± 0.039
4.389ValSer: 4.389 ± 0.061
3.346ValThr: 3.346 ± 0.06
4.748ValVal: 4.748 ± 0.072
0.792ValTrp: 0.792 ± 0.023
2.301ValTyr: 2.301 ± 0.042
0.0ValXaa: 0.0 ± 0.0
Trp
0.802TrpAla: 0.802 ± 0.024
0.12TrpCys: 0.12 ± 0.011
0.785TrpAsp: 0.785 ± 0.025
0.941TrpGlu: 0.941 ± 0.028
0.618TrpPhe: 0.618 ± 0.021
1.061TrpGly: 1.061 ± 0.029
0.306TrpHis: 0.306 ± 0.014
0.832TrpIle: 0.832 ± 0.023
0.908TrpLys: 0.908 ± 0.027
1.313TrpLeu: 1.313 ± 0.036
0.361TrpMet: 0.361 ± 0.015
0.743TrpAsn: 0.743 ± 0.024
0.369TrpPro: 0.369 ± 0.015
0.596TrpGln: 0.596 ± 0.025
0.604TrpArg: 0.604 ± 0.023
0.826TrpSer: 0.826 ± 0.029
0.692TrpThr: 0.692 ± 0.025
0.815TrpVal: 0.815 ± 0.023
0.223TrpTrp: 0.223 ± 0.012
0.498TrpTyr: 0.498 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.645TyrAla: 2.645 ± 0.05
0.404TyrCys: 0.404 ± 0.015
2.263TyrAsp: 2.263 ± 0.046
2.407TyrGlu: 2.407 ± 0.035
2.329TyrPhe: 2.329 ± 0.042
2.858TyrGly: 2.858 ± 0.048
0.937TyrHis: 0.937 ± 0.027
2.176TyrIle: 2.176 ± 0.044
2.269TyrLys: 2.269 ± 0.04
4.125TyrLeu: 4.125 ± 0.059
0.81TyrMet: 0.81 ± 0.024
2.087TyrAsn: 2.087 ± 0.048
1.736TyrPro: 1.736 ± 0.041
1.938TyrGln: 1.938 ± 0.043
1.978TyrArg: 1.978 ± 0.037
2.713TyrSer: 2.713 ± 0.044
2.192TyrThr: 2.192 ± 0.045
2.002TyrVal: 2.002 ± 0.043
0.649TyrTrp: 0.649 ± 0.031
1.894TyrTyr: 1.894 ± 0.05
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4090 proteins (1510668 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski