Amino acid dipepetide frequency for Ordospora colligata OC4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.189AlaAla: 2.189 ± 0.077
1.151AlaCys: 1.151 ± 0.037
2.622AlaAsp: 2.622 ± 0.062
3.536AlaGlu: 3.536 ± 0.08
2.349AlaPhe: 2.349 ± 0.065
2.514AlaGly: 2.514 ± 0.072
0.962AlaHis: 0.962 ± 0.036
4.064AlaIle: 4.064 ± 0.086
3.527AlaLys: 3.527 ± 0.077
4.841AlaLeu: 4.841 ± 0.099
1.82AlaMet: 1.82 ± 0.057
2.194AlaAsn: 2.194 ± 0.064
1.165AlaPro: 1.165 ± 0.048
1.379AlaGln: 1.379 ± 0.047
2.517AlaArg: 2.517 ± 0.056
3.817AlaSer: 3.817 ± 0.094
1.898AlaThr: 1.898 ± 0.056
3.474AlaVal: 3.474 ± 0.071
0.33AlaTrp: 0.33 ± 0.025
1.867AlaTyr: 1.867 ± 0.059
0.0AlaXaa: 0.0 ± 0.0
Cys
1.108CysAla: 1.108 ± 0.042
0.461CysCys: 0.461 ± 0.026
1.218CysAsp: 1.218 ± 0.046
1.711CysGlu: 1.711 ± 0.053
1.131CysPhe: 1.131 ± 0.041
1.496CysGly: 1.496 ± 0.06
0.355CysHis: 0.355 ± 0.023
2.327CysIle: 2.327 ± 0.065
1.932CysLys: 1.932 ± 0.061
2.087CysLeu: 2.087 ± 0.062
1.049CysMet: 1.049 ± 0.041
1.185CysAsn: 1.185 ± 0.04
0.518CysPro: 0.518 ± 0.024
0.383CysGln: 0.383 ± 0.027
1.221CysArg: 1.221 ± 0.041
1.812CysSer: 1.812 ± 0.052
1.142CysThr: 1.142 ± 0.041
2.057CysVal: 2.057 ± 0.062
0.147CysTrp: 0.147 ± 0.014
0.737CysTyr: 0.737 ± 0.034
0.0CysXaa: 0.0 ± 0.0
Asp
3.607AspAla: 3.607 ± 0.07
1.1AspCys: 1.1 ± 0.038
3.645AspAsp: 3.645 ± 0.086
5.845AspGlu: 5.845 ± 0.111
2.666AspPhe: 2.666 ± 0.08
3.955AspGly: 3.955 ± 0.091
0.808AspHis: 0.808 ± 0.036
4.517AspIle: 4.517 ± 0.083
3.696AspLys: 3.696 ± 0.099
4.711AspLeu: 4.711 ± 0.092
2.109AspMet: 2.109 ± 0.059
2.171AspAsn: 2.171 ± 0.059
1.517AspPro: 1.517 ± 0.052
1.263AspGln: 1.263 ± 0.049
2.523AspArg: 2.523 ± 0.066
3.465AspSer: 3.465 ± 0.083
2.385AspThr: 2.385 ± 0.066
5.524AspVal: 5.524 ± 0.121
0.397AspTrp: 0.397 ± 0.025
1.848AspTyr: 1.848 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
3.821GluAla: 3.821 ± 0.083
2.453GluCys: 2.453 ± 0.078
4.633GluAsp: 4.633 ± 0.091
7.094GluGlu: 7.094 ± 0.139
3.572GluPhe: 3.572 ± 0.078
4.025GluGly: 4.025 ± 0.094
1.539GluHis: 1.539 ± 0.051
6.672GluIle: 6.672 ± 0.12
6.324GluLys: 6.324 ± 0.147
6.029GluLeu: 6.029 ± 0.098
3.71GluMet: 3.71 ± 0.087
3.986GluAsn: 3.986 ± 0.093
1.454GluPro: 1.454 ± 0.052
2.194GluGln: 2.194 ± 0.059
4.177GluArg: 4.177 ± 0.1
5.596GluSer: 5.596 ± 0.095
2.65GluThr: 2.65 ± 0.066
5.277GluVal: 5.277 ± 0.104
0.684GluTrp: 0.684 ± 0.036
3.801GluTyr: 3.801 ± 0.091
0.0GluXaa: 0.0 ± 0.0
Phe
1.977PheAla: 1.977 ± 0.065
1.226PheCys: 1.226 ± 0.042
3.086PheAsp: 3.086 ± 0.076
3.936PheGlu: 3.936 ± 0.076
2.467PhePhe: 2.467 ± 0.072
2.918PheGly: 2.918 ± 0.069
0.818PheHis: 0.818 ± 0.035
3.24PheIle: 3.24 ± 0.072
3.064PheLys: 3.064 ± 0.069
4.323PheLeu: 4.323 ± 0.098
1.527PheMet: 1.527 ± 0.047
2.07PheAsn: 2.07 ± 0.063
1.244PhePro: 1.244 ± 0.045
0.9PheGln: 0.9 ± 0.036
2.191PheArg: 2.191 ± 0.067
3.547PheSer: 3.547 ± 0.092
1.666PheThr: 1.666 ± 0.052
4.158PheVal: 4.158 ± 0.085
0.344PheTrp: 0.344 ± 0.022
1.958PheTyr: 1.958 ± 0.061
0.0PheXaa: 0.0 ± 0.0
Gly
2.281GlyAla: 2.281 ± 0.063
1.524GlyCys: 1.524 ± 0.057
2.968GlyAsp: 2.968 ± 0.07
3.603GlyGlu: 3.603 ± 0.079
2.793GlyPhe: 2.793 ± 0.073
2.785GlyGly: 2.785 ± 0.08
1.08GlyHis: 1.08 ± 0.05
5.025GlyIle: 5.025 ± 0.086
4.451GlyLys: 4.451 ± 0.088
4.69GlyLeu: 4.69 ± 0.097
2.596GlyMet: 2.596 ± 0.072
2.636GlyAsn: 2.636 ± 0.076
0.959GlyPro: 0.959 ± 0.039
1.258GlyGln: 1.258 ± 0.046
2.945GlyArg: 2.945 ± 0.081
4.172GlySer: 4.172 ± 0.1
2.501GlyThr: 2.501 ± 0.079
4.285GlyVal: 4.285 ± 0.083
0.444GlyTrp: 0.444 ± 0.028
2.347GlyTyr: 2.347 ± 0.068
0.0GlyXaa: 0.0 ± 0.0
His
1.06HisAla: 1.06 ± 0.041
0.388HisCys: 0.388 ± 0.028
0.991HisAsp: 0.991 ± 0.042
1.652HisGlu: 1.652 ± 0.052
0.869HisPhe: 0.869 ± 0.035
1.351HisGly: 1.351 ± 0.049
0.4HisHis: 0.4 ± 0.025
1.347HisIle: 1.347 ± 0.05
1.457HisLys: 1.457 ± 0.058
1.761HisLeu: 1.761 ± 0.052
0.628HisMet: 0.628 ± 0.037
0.965HisAsn: 0.965 ± 0.039
0.735HisPro: 0.735 ± 0.034
0.554HisGln: 0.554 ± 0.026
1.081HisArg: 1.081 ± 0.045
1.378HisSer: 1.378 ± 0.049
0.945HisThr: 0.945 ± 0.043
1.566HisVal: 1.566 ± 0.056
0.102HisTrp: 0.102 ± 0.014
0.631HisTyr: 0.631 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
3.88IleAla: 3.88 ± 0.075
2.133IleCys: 2.133 ± 0.059
5.64IleAsp: 5.64 ± 0.094
6.642IleGlu: 6.642 ± 0.123
3.237IlePhe: 3.237 ± 0.077
4.468IleGly: 4.468 ± 0.091
1.697IleHis: 1.697 ± 0.053
4.448IleIle: 4.448 ± 0.095
5.708IleLys: 5.708 ± 0.094
6.849IleLeu: 6.849 ± 0.127
2.129IleMet: 2.129 ± 0.07
3.995IleAsn: 3.995 ± 0.093
2.622IlePro: 2.622 ± 0.071
2.327IleGln: 2.327 ± 0.052
4.279IleArg: 4.279 ± 0.077
5.966IleSer: 5.966 ± 0.108
2.932IleThr: 2.932 ± 0.082
5.221IleVal: 5.221 ± 0.092
0.509IleTrp: 0.509 ± 0.03
2.571IleTyr: 2.571 ± 0.061
0.0IleXaa: 0.0 ± 0.0
Lys
4.189LysAla: 4.189 ± 0.08
1.738LysCys: 1.738 ± 0.054
4.313LysAsp: 4.313 ± 0.081
6.811LysGlu: 6.811 ± 0.166
2.375LysPhe: 2.375 ± 0.059
3.682LysGly: 3.682 ± 0.079
1.888LysHis: 1.888 ± 0.059
6.434LysIle: 6.434 ± 0.108
7.004LysLys: 7.004 ± 0.14
5.527LysLeu: 5.527 ± 0.099
3.317LysMet: 3.317 ± 0.069
4.202LysAsn: 4.202 ± 0.091
1.93LysPro: 1.93 ± 0.058
2.621LysGln: 2.621 ± 0.066
4.549LysArg: 4.549 ± 0.094
5.314LysSer: 5.314 ± 0.105
3.603LysThr: 3.603 ± 0.076
4.974LysVal: 4.974 ± 0.099
0.528LysTrp: 0.528 ± 0.032
3.517LysTyr: 3.517 ± 0.077
0.0LysXaa: 0.0 ± 0.0
Leu
3.874LeuAla: 3.874 ± 0.087
2.337LeuCys: 2.337 ± 0.061
4.866LeuAsp: 4.866 ± 0.098
6.427LeuGlu: 6.427 ± 0.102
4.192LeuPhe: 4.192 ± 0.102
4.512LeuGly: 4.512 ± 0.084
1.691LeuHis: 1.691 ± 0.054
5.717LeuIle: 5.717 ± 0.113
7.331LeuLys: 7.331 ± 0.124
7.88LeuLeu: 7.88 ± 0.122
2.906LeuMet: 2.906 ± 0.075
4.74LeuAsn: 4.74 ± 0.085
2.205LeuPro: 2.205 ± 0.062
2.366LeuGln: 2.366 ± 0.059
4.965LeuArg: 4.965 ± 0.095
6.728LeuSer: 6.728 ± 0.114
2.847LeuThr: 2.847 ± 0.093
6.037LeuVal: 6.037 ± 0.104
0.591LeuTrp: 0.591 ± 0.028
3.28LeuTyr: 3.28 ± 0.077
0.0LeuXaa: 0.0 ± 0.0
Met
1.828MetAla: 1.828 ± 0.062
0.979MetCys: 0.979 ± 0.038
2.171MetAsp: 2.171 ± 0.049
2.361MetGlu: 2.361 ± 0.065
2.022MetPhe: 2.022 ± 0.059
1.494MetGly: 1.494 ± 0.05
1.137MetHis: 1.137 ± 0.043
2.847MetIle: 2.847 ± 0.076
3.514MetLys: 3.514 ± 0.088
3.573MetLeu: 3.573 ± 0.074
1.412MetMet: 1.412 ± 0.046
2.54MetAsn: 2.54 ± 0.07
1.119MetPro: 1.119 ± 0.043
1.229MetGln: 1.229 ± 0.045
1.863MetArg: 1.863 ± 0.058
2.725MetSer: 2.725 ± 0.064
1.175MetThr: 1.175 ± 0.035
2.095MetVal: 2.095 ± 0.056
0.244MetTrp: 0.244 ± 0.02
1.566MetTyr: 1.566 ± 0.05
0.0MetXaa: 0.0 ± 0.0
Asn
3.233AsnAla: 3.233 ± 0.071
0.729AsnCys: 0.729 ± 0.034
3.043AsnAsp: 3.043 ± 0.083
5.055AsnGlu: 5.055 ± 0.101
1.674AsnPhe: 1.674 ± 0.054
3.51AsnGly: 3.51 ± 0.079
0.881AsnHis: 0.881 ± 0.039
3.733AsnIle: 3.733 ± 0.083
3.987AsnLys: 3.987 ± 0.089
3.586AsnLeu: 3.586 ± 0.075
1.714AsnMet: 1.714 ± 0.053
2.633AsnAsn: 2.633 ± 0.076
1.533AsnPro: 1.533 ± 0.055
1.401AsnGln: 1.401 ± 0.052
2.469AsnArg: 2.469 ± 0.057
3.206AsnSer: 3.206 ± 0.085
2.68AsnThr: 2.68 ± 0.076
3.87AsnVal: 3.87 ± 0.073
0.27AsnTrp: 0.27 ± 0.021
1.421AsnTyr: 1.421 ± 0.047
0.0AsnXaa: 0.0 ± 0.0
Pro
1.12ProAla: 1.12 ± 0.047
0.554ProCys: 0.554 ± 0.033
1.399ProAsp: 1.399 ± 0.051
2.161ProGlu: 2.161 ± 0.058
1.384ProPhe: 1.384 ± 0.047
1.471ProGly: 1.471 ± 0.057
0.49ProHis: 0.49 ± 0.03
1.713ProIle: 1.713 ± 0.061
1.867ProLys: 1.867 ± 0.052
2.149ProLeu: 2.149 ± 0.062
0.787ProMet: 0.787 ± 0.038
1.275ProAsn: 1.275 ± 0.051
0.698ProPro: 0.698 ± 0.035
0.742ProGln: 0.742 ± 0.038
1.382ProArg: 1.382 ± 0.046
2.213ProSer: 2.213 ± 0.078
1.167ProThr: 1.167 ± 0.049
2.036ProVal: 2.036 ± 0.069
0.22ProTrp: 0.22 ± 0.017
1.08ProTyr: 1.08 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
1.384GlnAla: 1.384 ± 0.049
0.531GlnCys: 0.531 ± 0.027
1.393GlnAsp: 1.393 ± 0.053
2.123GlnGlu: 2.123 ± 0.067
0.974GlnPhe: 0.974 ± 0.042
1.42GlnGly: 1.42 ± 0.047
0.545GlnHis: 0.545 ± 0.026
2.116GlnIle: 2.116 ± 0.063
2.419GlnLys: 2.419 ± 0.081
1.894GlnLeu: 1.894 ± 0.049
1.181GlnMet: 1.181 ± 0.038
1.437GlnAsn: 1.437 ± 0.056
0.714GlnPro: 0.714 ± 0.049
0.957GlnGln: 0.957 ± 0.045
1.626GlnArg: 1.626 ± 0.05
1.991GlnSer: 1.991 ± 0.066
1.195GlnThr: 1.195 ± 0.051
1.663GlnVal: 1.663 ± 0.054
0.172GlnTrp: 0.172 ± 0.018
0.991GlnTyr: 0.991 ± 0.042
0.0GlnXaa: 0.0 ± 0.0
Arg
2.203ArgAla: 2.203 ± 0.057
1.322ArgCys: 1.322 ± 0.048
2.565ArgAsp: 2.565 ± 0.057
3.426ArgGlu: 3.426 ± 0.081
2.576ArgPhe: 2.576 ± 0.067
2.363ArgGly: 2.363 ± 0.056
1.139ArgHis: 1.139 ± 0.04
4.976ArgIle: 4.976 ± 0.083
4.414ArgLys: 4.414 ± 0.097
4.552ArgLeu: 4.552 ± 0.09
2.695ArgMet: 2.695 ± 0.074
2.9ArgAsn: 2.9 ± 0.072
1.044ArgPro: 1.044 ± 0.044
1.28ArgGln: 1.28 ± 0.043
3.452ArgArg: 3.452 ± 0.087
3.846ArgSer: 3.846 ± 0.077
2.056ArgThr: 2.056 ± 0.06
3.164ArgVal: 3.164 ± 0.075
0.411ArgTrp: 0.411 ± 0.025
2.334ArgTyr: 2.334 ± 0.066
0.0ArgXaa: 0.0 ± 0.0
Ser
3.106SerAla: 3.106 ± 0.07
1.362SerCys: 1.362 ± 0.05
3.93SerAsp: 3.93 ± 0.079
5.364SerGlu: 5.364 ± 0.096
3.286SerPhe: 3.286 ± 0.079
4.585SerGly: 4.585 ± 0.095
1.275SerHis: 1.275 ± 0.047
6.36SerIle: 6.36 ± 0.119
6.481SerLys: 6.481 ± 0.117
6.216SerLeu: 6.216 ± 0.116
2.864SerMet: 2.864 ± 0.072
3.797SerAsn: 3.797 ± 0.083
1.71SerPro: 1.71 ± 0.07
1.899SerGln: 1.899 ± 0.067
3.944SerArg: 3.944 ± 0.071
5.88SerSer: 5.88 ± 0.133
3.468SerThr: 3.468 ± 0.084
5.705SerVal: 5.705 ± 0.092
0.417SerTrp: 0.417 ± 0.026
2.296SerTyr: 2.296 ± 0.061
0.0SerXaa: 0.0 ± 0.0
Thr
1.846ThrAla: 1.846 ± 0.058
0.852ThrCys: 0.852 ± 0.037
2.282ThrAsp: 2.282 ± 0.067
2.936ThrGlu: 2.936 ± 0.073
1.862ThrPhe: 1.862 ± 0.059
2.554ThrGly: 2.554 ± 0.07
0.942ThrHis: 0.942 ± 0.04
3.089ThrIle: 3.089 ± 0.071
3.016ThrLys: 3.016 ± 0.077
3.536ThrLeu: 3.536 ± 0.081
1.202ThrMet: 1.202 ± 0.047
2.203ThrAsn: 2.203 ± 0.061
1.398ThrPro: 1.398 ± 0.064
1.193ThrGln: 1.193 ± 0.052
2.05ThrArg: 2.05 ± 0.058
3.241ThrSer: 3.241 ± 0.087
1.898ThrThr: 1.898 ± 0.071
2.78ThrVal: 2.78 ± 0.074
0.276ThrTrp: 0.276 ± 0.022
1.451ThrTyr: 1.451 ± 0.051
0.0ThrXaa: 0.0 ± 0.0
Val
3.116ValAla: 3.116 ± 0.081
2.226ValCys: 2.226 ± 0.071
4.557ValAsp: 4.557 ± 0.096
5.475ValGlu: 5.475 ± 0.1
4.816ValPhe: 4.816 ± 0.103
3.562ValGly: 3.562 ± 0.079
1.469ValHis: 1.469 ± 0.051
5.252ValIle: 5.252 ± 0.086
4.757ValLys: 4.757 ± 0.096
7.042ValLeu: 7.042 ± 0.103
2.531ValMet: 2.531 ± 0.082
3.404ValAsn: 3.404 ± 0.067
2.105ValPro: 2.105 ± 0.063
1.803ValGln: 1.803 ± 0.052
3.181ValArg: 3.181 ± 0.077
5.652ValSer: 5.652 ± 0.098
2.214ValThr: 2.214 ± 0.064
5.716ValVal: 5.716 ± 0.127
0.59ValTrp: 0.59 ± 0.03
3.426ValTyr: 3.426 ± 0.07
0.0ValXaa: 0.0 ± 0.0
Trp
0.31TrpAla: 0.31 ± 0.023
0.161TrpCys: 0.161 ± 0.016
0.407TrpAsp: 0.407 ± 0.027
0.399TrpGlu: 0.399 ± 0.027
0.258TrpPhe: 0.258 ± 0.019
0.313TrpGly: 0.313 ± 0.026
0.124TrpHis: 0.124 ± 0.013
0.723TrpIle: 0.723 ± 0.033
0.704TrpLys: 0.704 ± 0.034
0.501TrpLeu: 0.501 ± 0.026
0.4TrpMet: 0.4 ± 0.031
0.479TrpAsn: 0.479 ± 0.03
0.169TrpPro: 0.169 ± 0.017
0.16TrpGln: 0.16 ± 0.013
0.391TrpArg: 0.391 ± 0.027
0.492TrpSer: 0.492 ± 0.027
0.307TrpThr: 0.307 ± 0.021
0.337TrpVal: 0.337 ± 0.025
0.051TrpTrp: 0.051 ± 0.008
0.292TrpTyr: 0.292 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.081TyrAla: 2.081 ± 0.051
0.852TyrCys: 0.852 ± 0.035
2.02TyrAsp: 2.02 ± 0.055
3.134TyrGlu: 3.134 ± 0.076
2.158TyrPhe: 2.158 ± 0.06
2.366TyrGly: 2.366 ± 0.06
0.621TyrHis: 0.621 ± 0.029
2.655TyrIle: 2.655 ± 0.066
2.863TyrLys: 2.863 ± 0.077
3.745TyrLeu: 3.745 ± 0.074
1.314TyrMet: 1.314 ± 0.045
1.815TyrAsn: 1.815 ± 0.059
1.067TyrPro: 1.067 ± 0.037
0.816TyrGln: 0.816 ± 0.036
1.913TyrArg: 1.913 ± 0.053
2.906TyrSer: 2.906 ± 0.074
1.88TyrThr: 1.88 ± 0.058
2.957TyrVal: 2.957 ± 0.065
0.25TyrTrp: 0.25 ± 0.018
1.429TyrTyr: 1.429 ± 0.05
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1810 proteins (644518 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski