Amino acid dipepetide frequency for Terricaulis silvestris

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.882AlaAla: 19.882 ± 0.217
1.261AlaCys: 1.261 ± 0.034
7.332AlaAsp: 7.332 ± 0.073
8.178AlaGlu: 8.178 ± 0.095
5.232AlaPhe: 5.232 ± 0.082
10.516AlaGly: 10.516 ± 0.103
2.594AlaHis: 2.594 ± 0.054
6.661AlaIle: 6.661 ± 0.09
4.217AlaLys: 4.217 ± 0.083
14.928AlaLeu: 14.928 ± 0.162
3.631AlaMet: 3.631 ± 0.055
3.473AlaAsn: 3.473 ± 0.051
7.755AlaPro: 7.755 ± 0.11
5.007AlaGln: 5.007 ± 0.075
10.452AlaArg: 10.452 ± 0.13
7.025AlaSer: 7.025 ± 0.09
6.326AlaThr: 6.326 ± 0.083
8.66AlaVal: 8.66 ± 0.095
1.82AlaTrp: 1.82 ± 0.048
2.9AlaTyr: 2.9 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
1.284CysAla: 1.284 ± 0.038
0.088CysCys: 0.088 ± 0.01
0.519CysAsp: 0.519 ± 0.022
0.43CysGlu: 0.43 ± 0.018
0.28CysPhe: 0.28 ± 0.016
0.868CysGly: 0.868 ± 0.03
0.16CysHis: 0.16 ± 0.011
0.355CysIle: 0.355 ± 0.018
0.186CysLys: 0.186 ± 0.013
0.656CysLeu: 0.656 ± 0.026
0.154CysMet: 0.154 ± 0.011
0.211CysAsn: 0.211 ± 0.015
0.387CysPro: 0.387 ± 0.018
0.208CysGln: 0.208 ± 0.014
0.493CysArg: 0.493 ± 0.021
0.433CysSer: 0.433 ± 0.022
0.43CysThr: 0.43 ± 0.019
0.675CysVal: 0.675 ± 0.026
0.108CysTrp: 0.108 ± 0.009
0.171CysTyr: 0.171 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
8.259AspAla: 8.259 ± 0.087
0.443AspCys: 0.443 ± 0.022
3.376AspAsp: 3.376 ± 0.061
3.625AspGlu: 3.625 ± 0.055
2.217AspPhe: 2.217 ± 0.043
5.224AspGly: 5.224 ± 0.081
1.121AspHis: 1.121 ± 0.033
2.764AspIle: 2.764 ± 0.05
1.432AspLys: 1.432 ± 0.04
5.591AspLeu: 5.591 ± 0.078
1.214AspMet: 1.214 ± 0.033
1.303AspAsn: 1.303 ± 0.035
3.448AspPro: 3.448 ± 0.064
1.722AspGln: 1.722 ± 0.036
4.153AspArg: 4.153 ± 0.071
2.234AspSer: 2.234 ± 0.045
2.551AspThr: 2.551 ± 0.057
4.47AspVal: 4.47 ± 0.061
1.036AspTrp: 1.036 ± 0.034
1.468AspTyr: 1.468 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
8.455GluAla: 8.455 ± 0.09
0.373GluCys: 0.373 ± 0.018
3.008GluAsp: 3.008 ± 0.049
3.404GluGlu: 3.404 ± 0.066
1.868GluPhe: 1.868 ± 0.041
4.705GluGly: 4.705 ± 0.063
1.244GluHis: 1.244 ± 0.04
3.438GluIle: 3.438 ± 0.059
1.845GluLys: 1.845 ± 0.046
5.377GluLeu: 5.377 ± 0.071
1.467GluMet: 1.467 ± 0.036
1.572GluAsn: 1.572 ± 0.04
2.868GluPro: 2.868 ± 0.061
2.231GluGln: 2.231 ± 0.051
5.692GluArg: 5.692 ± 0.078
2.637GluSer: 2.637 ± 0.048
3.663GluThr: 3.663 ± 0.057
3.567GluVal: 3.567 ± 0.059
0.802GluTrp: 0.802 ± 0.026
1.031GluTyr: 1.031 ± 0.034
0.0GluXaa: 0.0 ± 0.0
Phe
5.419PheAla: 5.419 ± 0.082
0.394PheCys: 0.394 ± 0.02
2.739PheAsp: 2.739 ± 0.052
2.412PheGlu: 2.412 ± 0.05
1.436PhePhe: 1.436 ± 0.044
3.691PheGly: 3.691 ± 0.06
0.696PheHis: 0.696 ± 0.023
1.671PheIle: 1.671 ± 0.039
0.997PheLys: 0.997 ± 0.029
3.089PheLeu: 3.089 ± 0.059
0.775PheMet: 0.775 ± 0.027
1.202PheAsn: 1.202 ± 0.032
1.498PhePro: 1.498 ± 0.037
0.962PheGln: 0.962 ± 0.029
2.21PheArg: 2.21 ± 0.049
2.142PheSer: 2.142 ± 0.043
2.117PheThr: 2.117 ± 0.041
2.906PheVal: 2.906 ± 0.058
0.576PheTrp: 0.576 ± 0.025
0.939PheTyr: 0.939 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
12.043GlyAla: 12.043 ± 0.142
0.718GlyCys: 0.718 ± 0.026
4.907GlyAsp: 4.907 ± 0.074
5.194GlyGlu: 5.194 ± 0.068
3.566GlyPhe: 3.566 ± 0.063
8.052GlyGly: 8.052 ± 0.124
1.526GlyHis: 1.526 ± 0.038
3.474GlyIle: 3.474 ± 0.065
2.646GlyLys: 2.646 ± 0.059
7.843GlyLeu: 7.843 ± 0.097
2.007GlyMet: 2.007 ± 0.04
1.938GlyAsn: 1.938 ± 0.05
3.648GlyPro: 3.648 ± 0.056
2.732GlyGln: 2.732 ± 0.051
6.175GlyArg: 6.175 ± 0.076
4.042GlySer: 4.042 ± 0.063
3.173GlyThr: 3.173 ± 0.066
7.358GlyVal: 7.358 ± 0.079
1.403GlyTrp: 1.403 ± 0.04
2.179GlyTyr: 2.179 ± 0.05
0.0GlyXaa: 0.0 ± 0.0
His
2.446HisAla: 2.446 ± 0.048
0.199HisCys: 0.199 ± 0.012
1.135HisAsp: 1.135 ± 0.029
1.096HisGlu: 1.096 ± 0.031
0.766HisPhe: 0.766 ± 0.027
1.82HisGly: 1.82 ± 0.047
0.482HisHis: 0.482 ± 0.023
0.804HisIle: 0.804 ± 0.033
0.484HisLys: 0.484 ± 0.025
1.744HisLeu: 1.744 ± 0.039
0.434HisMet: 0.434 ± 0.016
0.471HisAsn: 0.471 ± 0.018
1.138HisPro: 1.138 ± 0.034
0.548HisGln: 0.548 ± 0.021
1.28HisArg: 1.28 ± 0.036
0.836HisSer: 0.836 ± 0.03
0.822HisThr: 0.822 ± 0.028
1.454HisVal: 1.454 ± 0.036
0.326HisTrp: 0.326 ± 0.015
0.536HisTyr: 0.536 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
7.843IleAla: 7.843 ± 0.095
0.472IleCys: 0.472 ± 0.02
3.547IleAsp: 3.547 ± 0.053
3.685IleGlu: 3.685 ± 0.068
1.557IlePhe: 1.557 ± 0.043
4.773IleGly: 4.773 ± 0.063
0.777IleHis: 0.777 ± 0.029
2.133IleIle: 2.133 ± 0.048
1.243IleLys: 1.243 ± 0.04
3.635IleLeu: 3.635 ± 0.061
0.887IleMet: 0.887 ± 0.028
1.363IleAsn: 1.363 ± 0.036
2.018IlePro: 2.018 ± 0.044
1.208IleGln: 1.208 ± 0.029
2.809IleArg: 2.809 ± 0.05
2.703IleSer: 2.703 ± 0.047
2.62IleThr: 2.62 ± 0.043
4.293IleVal: 4.293 ± 0.066
0.66IleTrp: 0.66 ± 0.024
1.083IleTyr: 1.083 ± 0.033
0.0IleXaa: 0.0 ± 0.0
Lys
3.653LysAla: 3.653 ± 0.072
0.155LysCys: 0.155 ± 0.013
1.472LysAsp: 1.472 ± 0.046
1.39LysGlu: 1.39 ± 0.04
0.903LysPhe: 0.903 ± 0.031
2.112LysGly: 2.112 ± 0.048
0.658LysHis: 0.658 ± 0.027
1.395LysIle: 1.395 ± 0.041
1.331LysLys: 1.331 ± 0.045
3.192LysLeu: 3.192 ± 0.062
0.585LysMet: 0.585 ± 0.022
0.758LysAsn: 0.758 ± 0.024
1.943LysPro: 1.943 ± 0.049
1.023LysGln: 1.023 ± 0.029
2.652LysArg: 2.652 ± 0.056
1.585LysSer: 1.585 ± 0.038
1.608LysThr: 1.608 ± 0.041
1.824LysVal: 1.824 ± 0.048
0.338LysTrp: 0.338 ± 0.017
0.559LysTyr: 0.559 ± 0.025
0.0LysXaa: 0.0 ± 0.0
Leu
13.506LeuAla: 13.506 ± 0.138
0.806LeuCys: 0.806 ± 0.027
5.661LeuAsp: 5.661 ± 0.063
5.383LeuGlu: 5.383 ± 0.072
3.448LeuPhe: 3.448 ± 0.064
7.738LeuGly: 7.738 ± 0.089
1.672LeuHis: 1.672 ± 0.041
5.004LeuIle: 5.004 ± 0.067
3.198LeuLys: 3.198 ± 0.057
8.281LeuLeu: 8.281 ± 0.118
2.206LeuMet: 2.206 ± 0.044
2.779LeuAsn: 2.779 ± 0.047
4.649LeuPro: 4.649 ± 0.066
2.776LeuGln: 2.776 ± 0.043
7.015LeuArg: 7.015 ± 0.087
5.792LeuSer: 5.792 ± 0.075
5.472LeuThr: 5.472 ± 0.07
6.631LeuVal: 6.631 ± 0.084
1.175LeuTrp: 1.175 ± 0.033
2.013LeuTyr: 2.013 ± 0.048
0.0LeuXaa: 0.0 ± 0.0
Met
2.984MetAla: 2.984 ± 0.055
0.148MetCys: 0.148 ± 0.011
1.073MetAsp: 1.073 ± 0.03
1.008MetGlu: 1.008 ± 0.031
0.799MetPhe: 0.799 ± 0.029
1.745MetGly: 1.745 ± 0.039
0.445MetHis: 0.445 ± 0.02
1.238MetIle: 1.238 ± 0.035
0.872MetLys: 0.872 ± 0.029
2.295MetLeu: 2.295 ± 0.046
0.587MetMet: 0.587 ± 0.024
0.733MetAsn: 0.733 ± 0.024
1.263MetPro: 1.263 ± 0.037
0.791MetGln: 0.791 ± 0.027
2.098MetArg: 2.098 ± 0.039
1.71MetSer: 1.71 ± 0.038
1.66MetThr: 1.66 ± 0.04
1.32MetVal: 1.32 ± 0.027
0.244MetTrp: 0.244 ± 0.014
0.276MetTyr: 0.276 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
4.021AsnAla: 4.021 ± 0.059
0.218AsnCys: 0.218 ± 0.014
1.528AsnAsp: 1.528 ± 0.042
1.448AsnGlu: 1.448 ± 0.034
0.965AsnPhe: 0.965 ± 0.031
2.53AsnGly: 2.53 ± 0.062
0.448AsnHis: 0.448 ± 0.018
1.306AsnIle: 1.306 ± 0.037
0.59AsnLys: 0.59 ± 0.021
2.43AsnLeu: 2.43 ± 0.047
0.533AsnMet: 0.533 ± 0.023
0.772AsnAsn: 0.772 ± 0.035
1.703AsnPro: 1.703 ± 0.036
0.792AsnGln: 0.792 ± 0.027
1.775AsnArg: 1.775 ± 0.044
1.153AsnSer: 1.153 ± 0.035
1.39AsnThr: 1.39 ± 0.037
2.177AsnVal: 2.177 ± 0.047
0.444AsnTrp: 0.444 ± 0.019
0.673AsnTyr: 0.673 ± 0.027
0.0AsnXaa: 0.0 ± 0.0
Pro
6.519ProAla: 6.519 ± 0.085
0.299ProCys: 0.299 ± 0.017
3.355ProAsp: 3.355 ± 0.057
3.585ProGlu: 3.585 ± 0.056
1.969ProPhe: 1.969 ± 0.04
4.223ProGly: 4.223 ± 0.063
1.07ProHis: 1.07 ± 0.032
2.535ProIle: 2.535 ± 0.048
1.643ProLys: 1.643 ± 0.044
4.535ProLeu: 4.535 ± 0.063
1.166ProMet: 1.166 ± 0.03
1.626ProAsn: 1.626 ± 0.039
3.254ProPro: 3.254 ± 0.084
1.744ProGln: 1.744 ± 0.044
3.259ProArg: 3.259 ± 0.051
2.901ProSer: 2.901 ± 0.043
2.785ProThr: 2.785 ± 0.054
3.628ProVal: 3.628 ± 0.057
0.738ProTrp: 0.738 ± 0.027
1.067ProTyr: 1.067 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
4.265GlnAla: 4.265 ± 0.067
0.252GlnCys: 0.252 ± 0.018
1.448GlnAsp: 1.448 ± 0.032
1.496GlnGlu: 1.496 ± 0.038
1.136GlnPhe: 1.136 ± 0.03
2.547GlnGly: 2.547 ± 0.052
0.659GlnHis: 0.659 ± 0.025
1.726GlnIle: 1.726 ± 0.037
0.799GlnLys: 0.799 ± 0.028
3.014GlnLeu: 3.014 ± 0.05
0.83GlnMet: 0.83 ± 0.028
0.908GlnAsn: 0.908 ± 0.031
1.662GlnPro: 1.662 ± 0.044
1.179GlnGln: 1.179 ± 0.034
2.87GlnArg: 2.87 ± 0.056
1.871GlnSer: 1.871 ± 0.045
1.935GlnThr: 1.935 ± 0.045
2.165GlnVal: 2.165 ± 0.043
0.447GlnTrp: 0.447 ± 0.02
0.614GlnTyr: 0.614 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
10.24ArgAla: 10.24 ± 0.125
0.503ArgCys: 0.503 ± 0.023
4.54ArgAsp: 4.54 ± 0.07
4.656ArgGlu: 4.656 ± 0.064
3.091ArgPhe: 3.091 ± 0.05
5.654ArgGly: 5.654 ± 0.083
1.41ArgHis: 1.41 ± 0.04
3.972ArgIle: 3.972 ± 0.063
1.959ArgLys: 1.959 ± 0.046
7.391ArgLeu: 7.391 ± 0.093
1.981ArgMet: 1.981 ± 0.042
1.922ArgAsn: 1.922 ± 0.037
3.521ArgPro: 3.521 ± 0.06
2.293ArgGln: 2.293 ± 0.054
6.076ArgArg: 6.076 ± 0.089
3.523ArgSer: 3.523 ± 0.066
3.753ArgThr: 3.753 ± 0.054
5.569ArgVal: 5.569 ± 0.074
1.187ArgTrp: 1.187 ± 0.036
1.936ArgTyr: 1.936 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
6.899SerAla: 6.899 ± 0.071
0.374SerCys: 0.374 ± 0.017
3.061SerAsp: 3.061 ± 0.055
3.051SerGlu: 3.051 ± 0.052
2.204SerPhe: 2.204 ± 0.047
5.121SerGly: 5.121 ± 0.065
0.916SerHis: 0.916 ± 0.028
2.666SerIle: 2.666 ± 0.052
1.432SerLys: 1.432 ± 0.041
4.798SerLeu: 4.798 ± 0.068
1.292SerMet: 1.292 ± 0.033
1.47SerAsn: 1.47 ± 0.038
2.784SerPro: 2.784 ± 0.052
1.519SerGln: 1.519 ± 0.035
3.585SerArg: 3.585 ± 0.06
2.71SerSer: 2.71 ± 0.06
2.526SerThr: 2.526 ± 0.051
3.83SerVal: 3.83 ± 0.059
0.777SerTrp: 0.777 ± 0.024
1.303SerTyr: 1.303 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
6.331ThrAla: 6.331 ± 0.078
0.386ThrCys: 0.386 ± 0.021
2.739ThrAsp: 2.739 ± 0.053
2.606ThrGlu: 2.606 ± 0.051
2.054ThrPhe: 2.054 ± 0.044
4.547ThrGly: 4.547 ± 0.063
1.001ThrHis: 1.001 ± 0.029
2.667ThrIle: 2.667 ± 0.051
1.334ThrLys: 1.334 ± 0.033
5.578ThrLeu: 5.578 ± 0.08
1.098ThrMet: 1.098 ± 0.029
1.328ThrAsn: 1.328 ± 0.037
3.782ThrPro: 3.782 ± 0.063
1.758ThrGln: 1.758 ± 0.035
3.661ThrArg: 3.661 ± 0.062
2.664ThrSer: 2.664 ± 0.048
2.745ThrThr: 2.745 ± 0.055
3.51ThrVal: 3.51 ± 0.058
0.728ThrTrp: 0.728 ± 0.023
1.269ThrTyr: 1.269 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
9.371ValAla: 9.371 ± 0.113
0.644ValCys: 0.644 ± 0.023
3.925ValAsp: 3.925 ± 0.059
4.574ValGlu: 4.574 ± 0.064
2.884ValPhe: 2.884 ± 0.054
5.624ValGly: 5.624 ± 0.075
1.269ValHis: 1.269 ± 0.033
3.85ValIle: 3.85 ± 0.056
2.006ValLys: 2.006 ± 0.053
7.237ValLeu: 7.237 ± 0.088
1.687ValMet: 1.687 ± 0.043
2.028ValAsn: 2.028 ± 0.046
2.85ValPro: 2.85 ± 0.046
2.116ValGln: 2.116 ± 0.039
5.575ValArg: 5.575 ± 0.064
4.249ValSer: 4.249 ± 0.061
4.288ValThr: 4.288 ± 0.061
5.628ValVal: 5.628 ± 0.075
1.011ValTrp: 1.011 ± 0.027
1.413ValTyr: 1.413 ± 0.037
0.0ValXaa: 0.0 ± 0.0
Trp
1.563TrpAla: 1.563 ± 0.044
0.14TrpCys: 0.14 ± 0.009
0.778TrpAsp: 0.778 ± 0.029
0.633TrpGlu: 0.633 ± 0.024
0.575TrpPhe: 0.575 ± 0.022
1.019TrpGly: 1.019 ± 0.028
0.249TrpHis: 0.249 ± 0.016
0.788TrpIle: 0.788 ± 0.027
0.399TrpLys: 0.399 ± 0.019
1.567TrpLeu: 1.567 ± 0.043
0.364TrpMet: 0.364 ± 0.017
0.421TrpAsn: 0.421 ± 0.02
0.736TrpPro: 0.736 ± 0.026
0.438TrpGln: 0.438 ± 0.022
1.612TrpArg: 1.612 ± 0.041
0.962TrpSer: 0.962 ± 0.031
0.884TrpThr: 0.884 ± 0.03
0.836TrpVal: 0.836 ± 0.027
0.271TrpTrp: 0.271 ± 0.017
0.256TrpTyr: 0.256 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.868TyrAla: 2.868 ± 0.049
0.224TyrCys: 0.224 ± 0.015
1.506TyrAsp: 1.506 ± 0.044
1.394TyrGlu: 1.394 ± 0.039
0.923TyrPhe: 0.923 ± 0.03
2.115TyrGly: 2.115 ± 0.039
0.422TyrHis: 0.422 ± 0.018
0.859TyrIle: 0.859 ± 0.027
0.525TyrLys: 0.525 ± 0.025
2.025TyrLeu: 2.025 ± 0.043
0.394TyrMet: 0.394 ± 0.019
0.619TyrAsn: 0.619 ± 0.024
0.992TyrPro: 0.992 ± 0.03
0.777TyrGln: 0.777 ± 0.027
1.736TyrArg: 1.736 ± 0.044
1.146TyrSer: 1.146 ± 0.036
1.036TyrThr: 1.036 ± 0.031
1.728TyrVal: 1.728 ± 0.04
0.399TyrTrp: 0.399 ± 0.02
0.598TyrTyr: 0.598 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3942 proteins (1171740 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski