Amino acid dipepetide frequency for Virgibacillus necropolis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.087AlaAla: 5.087 ± 0.092
0.614AlaCys: 0.614 ± 0.023
3.366AlaAsp: 3.366 ± 0.063
4.254AlaGlu: 4.254 ± 0.062
3.258AlaPhe: 3.258 ± 0.059
5.076AlaGly: 5.076 ± 0.077
1.253AlaHis: 1.253 ± 0.034
6.319AlaIle: 6.319 ± 0.082
4.648AlaLys: 4.648 ± 0.064
6.857AlaLeu: 6.857 ± 0.092
1.951AlaMet: 1.951 ± 0.043
2.962AlaAsn: 2.962 ± 0.054
2.025AlaPro: 2.025 ± 0.043
2.007AlaGln: 2.007 ± 0.045
2.427AlaArg: 2.427 ± 0.044
4.089AlaSer: 4.089 ± 0.063
3.842AlaThr: 3.842 ± 0.064
5.149AlaVal: 5.149 ± 0.076
0.633AlaTrp: 0.633 ± 0.023
2.325AlaTyr: 2.325 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
0.402CysAla: 0.402 ± 0.017
0.079CysCys: 0.079 ± 0.01
0.42CysAsp: 0.42 ± 0.023
0.42CysGlu: 0.42 ± 0.02
0.307CysPhe: 0.307 ± 0.016
0.662CysGly: 0.662 ± 0.024
0.185CysHis: 0.185 ± 0.011
0.511CysIle: 0.511 ± 0.023
0.383CysLys: 0.383 ± 0.018
0.615CysLeu: 0.615 ± 0.021
0.196CysMet: 0.196 ± 0.013
0.322CysAsn: 0.322 ± 0.015
0.323CysPro: 0.323 ± 0.017
0.24CysGln: 0.24 ± 0.014
0.22CysArg: 0.22 ± 0.014
0.499CysSer: 0.499 ± 0.019
0.393CysThr: 0.393 ± 0.018
0.394CysVal: 0.394 ± 0.019
0.065CysTrp: 0.065 ± 0.008
0.235CysTyr: 0.235 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.42AspAla: 3.42 ± 0.06
0.392AspCys: 0.392 ± 0.021
2.794AspAsp: 2.794 ± 0.06
4.327AspGlu: 4.327 ± 0.074
2.41AspPhe: 2.41 ± 0.046
3.513AspGly: 3.513 ± 0.057
1.207AspHis: 1.207 ± 0.03
4.241AspIle: 4.241 ± 0.06
3.684AspLys: 3.684 ± 0.059
5.104AspLeu: 5.104 ± 0.061
1.382AspMet: 1.382 ± 0.038
2.249AspAsn: 2.249 ± 0.042
2.141AspPro: 2.141 ± 0.046
2.062AspGln: 2.062 ± 0.044
2.162AspArg: 2.162 ± 0.041
2.889AspSer: 2.889 ± 0.052
2.654AspThr: 2.654 ± 0.049
3.922AspVal: 3.922 ± 0.062
0.623AspTrp: 0.623 ± 0.022
2.282AspTyr: 2.282 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
4.942GluAla: 4.942 ± 0.079
0.358GluCys: 0.358 ± 0.02
3.786GluAsp: 3.786 ± 0.074
6.236GluGlu: 6.236 ± 0.093
2.563GluPhe: 2.563 ± 0.045
4.127GluGly: 4.127 ± 0.06
1.395GluHis: 1.395 ± 0.037
5.562GluIle: 5.562 ± 0.073
6.656GluLys: 6.656 ± 0.082
6.747GluLeu: 6.747 ± 0.08
2.172GluMet: 2.172 ± 0.046
3.899GluAsn: 3.899 ± 0.059
1.892GluPro: 1.892 ± 0.046
3.152GluGln: 3.152 ± 0.061
3.124GluArg: 3.124 ± 0.052
3.885GluSer: 3.885 ± 0.063
3.923GluThr: 3.923 ± 0.063
4.951GluVal: 4.951 ± 0.083
0.784GluTrp: 0.784 ± 0.026
2.123GluTyr: 2.123 ± 0.045
0.0GluXaa: 0.0 ± 0.0
Phe
3.058PheAla: 3.058 ± 0.054
0.322PheCys: 0.322 ± 0.016
2.458PheAsp: 2.458 ± 0.049
2.7PheGlu: 2.7 ± 0.052
2.303PhePhe: 2.303 ± 0.06
3.463PheGly: 3.463 ± 0.065
1.022PheHis: 1.022 ± 0.029
4.091PheIle: 4.091 ± 0.075
2.363PheLys: 2.363 ± 0.049
4.43PheLeu: 4.43 ± 0.082
1.187PheMet: 1.187 ± 0.033
1.955PheAsn: 1.955 ± 0.04
1.679PhePro: 1.679 ± 0.042
1.529PheGln: 1.529 ± 0.038
1.461PheArg: 1.461 ± 0.035
3.224PheSer: 3.224 ± 0.059
2.733PheThr: 2.733 ± 0.05
3.231PheVal: 3.231 ± 0.063
0.49PheTrp: 0.49 ± 0.025
1.716PheTyr: 1.716 ± 0.043
0.0PheXaa: 0.0 ± 0.0
Gly
4.844GlyAla: 4.844 ± 0.095
0.589GlyCys: 0.589 ± 0.025
3.213GlyAsp: 3.213 ± 0.058
4.337GlyGlu: 4.337 ± 0.068
3.529GlyPhe: 3.529 ± 0.068
4.998GlyGly: 4.998 ± 0.087
1.367GlyHis: 1.367 ± 0.038
6.398GlyIle: 6.398 ± 0.081
4.972GlyLys: 4.972 ± 0.067
6.611GlyLeu: 6.611 ± 0.099
2.183GlyMet: 2.183 ± 0.046
2.926GlyAsn: 2.926 ± 0.051
1.829GlyPro: 1.829 ± 0.04
2.009GlyGln: 2.009 ± 0.042
2.342GlyArg: 2.342 ± 0.045
4.374GlySer: 4.374 ± 0.059
4.085GlyThr: 4.085 ± 0.06
5.273GlyVal: 5.273 ± 0.074
0.848GlyTrp: 0.848 ± 0.028
2.798GlyTyr: 2.798 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
1.441HisAla: 1.441 ± 0.038
0.191HisCys: 0.191 ± 0.012
1.133HisAsp: 1.133 ± 0.034
1.343HisGlu: 1.343 ± 0.033
1.028HisPhe: 1.028 ± 0.027
1.476HisGly: 1.476 ± 0.04
0.637HisHis: 0.637 ± 0.025
1.506HisIle: 1.506 ± 0.036
1.131HisLys: 1.131 ± 0.032
1.992HisLeu: 1.992 ± 0.052
0.536HisMet: 0.536 ± 0.021
0.845HisAsn: 0.845 ± 0.028
1.075HisPro: 1.075 ± 0.028
0.792HisGln: 0.792 ± 0.025
0.783HisArg: 0.783 ± 0.026
1.128HisSer: 1.128 ± 0.031
1.136HisThr: 1.136 ± 0.031
1.545HisVal: 1.545 ± 0.04
0.205HisTrp: 0.205 ± 0.013
0.827HisTyr: 0.827 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.346IleAla: 6.346 ± 0.091
0.672IleCys: 0.672 ± 0.026
4.67IleAsp: 4.67 ± 0.065
5.668IleGlu: 5.668 ± 0.082
3.49IlePhe: 3.49 ± 0.069
6.451IleGly: 6.451 ± 0.103
1.739IleHis: 1.739 ± 0.044
6.646IleIle: 6.646 ± 0.11
4.953IleLys: 4.953 ± 0.077
7.149IleLeu: 7.149 ± 0.103
1.956IleMet: 1.956 ± 0.044
3.784IleAsn: 3.784 ± 0.057
3.594IlePro: 3.594 ± 0.048
2.839IleGln: 2.839 ± 0.053
2.949IleArg: 2.949 ± 0.05
5.145IleSer: 5.145 ± 0.073
4.716IleThr: 4.716 ± 0.07
5.97IleVal: 5.97 ± 0.078
0.692IleTrp: 0.692 ± 0.027
2.469IleTyr: 2.469 ± 0.056
0.0IleXaa: 0.0 ± 0.0
Lys
4.433LysAla: 4.433 ± 0.059
0.305LysCys: 0.305 ± 0.018
4.054LysAsp: 4.054 ± 0.06
6.764LysGlu: 6.764 ± 0.088
1.973LysPhe: 1.973 ± 0.045
4.248LysGly: 4.248 ± 0.06
1.355LysHis: 1.355 ± 0.033
4.713LysIle: 4.713 ± 0.069
6.0LysLys: 6.0 ± 0.083
5.934LysLeu: 5.934 ± 0.067
2.176LysMet: 2.176 ± 0.04
3.615LysAsn: 3.615 ± 0.067
2.276LysPro: 2.276 ± 0.047
3.199LysGln: 3.199 ± 0.063
3.14LysArg: 3.14 ± 0.059
3.961LysSer: 3.961 ± 0.071
3.596LysThr: 3.596 ± 0.064
4.64LysVal: 4.64 ± 0.07
0.815LysTrp: 0.815 ± 0.028
2.185LysTyr: 2.185 ± 0.048
0.0LysXaa: 0.0 ± 0.0
Leu
7.085LeuAla: 7.085 ± 0.096
0.595LeuCys: 0.595 ± 0.022
4.881LeuAsp: 4.881 ± 0.068
6.266LeuGlu: 6.266 ± 0.084
4.846LeuPhe: 4.846 ± 0.082
6.53LeuGly: 6.53 ± 0.095
1.899LeuHis: 1.899 ± 0.037
7.65LeuIle: 7.65 ± 0.113
6.118LeuLys: 6.118 ± 0.088
9.607LeuLeu: 9.607 ± 0.119
2.498LeuMet: 2.498 ± 0.048
4.379LeuAsn: 4.379 ± 0.063
3.81LeuPro: 3.81 ± 0.062
3.296LeuGln: 3.296 ± 0.051
3.239LeuArg: 3.239 ± 0.051
6.483LeuSer: 6.483 ± 0.081
5.652LeuThr: 5.652 ± 0.072
6.425LeuVal: 6.425 ± 0.084
0.8LeuTrp: 0.8 ± 0.027
3.092LeuTyr: 3.092 ± 0.058
0.0LeuXaa: 0.0 ± 0.0
Met
1.949MetAla: 1.949 ± 0.042
0.141MetCys: 0.141 ± 0.01
1.592MetAsp: 1.592 ± 0.039
2.087MetGlu: 2.087 ± 0.045
1.08MetPhe: 1.08 ± 0.032
1.834MetGly: 1.834 ± 0.048
0.471MetHis: 0.471 ± 0.019
2.205MetIle: 2.205 ± 0.048
2.41MetLys: 2.41 ± 0.049
2.578MetLeu: 2.578 ± 0.05
0.816MetMet: 0.816 ± 0.027
1.557MetAsn: 1.557 ± 0.042
0.955MetPro: 0.955 ± 0.028
0.946MetGln: 0.946 ± 0.03
1.034MetArg: 1.034 ± 0.033
1.668MetSer: 1.668 ± 0.04
1.619MetThr: 1.619 ± 0.038
1.842MetVal: 1.842 ± 0.042
0.199MetTrp: 0.199 ± 0.013
0.784MetTyr: 0.784 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
2.751AsnAla: 2.751 ± 0.042
0.332AsnCys: 0.332 ± 0.017
2.624AsnAsp: 2.624 ± 0.042
3.791AsnGlu: 3.791 ± 0.069
1.856AsnPhe: 1.856 ± 0.044
3.393AsnGly: 3.393 ± 0.059
1.123AsnHis: 1.123 ± 0.031
3.391AsnIle: 3.391 ± 0.057
3.397AsnLys: 3.397 ± 0.064
4.097AsnLeu: 4.097 ± 0.06
1.282AsnMet: 1.282 ± 0.034
2.363AsnAsn: 2.363 ± 0.047
2.207AsnPro: 2.207 ± 0.042
2.142AsnGln: 2.142 ± 0.052
2.041AsnArg: 2.041 ± 0.043
2.365AsnSer: 2.365 ± 0.049
2.374AsnThr: 2.374 ± 0.052
3.274AsnVal: 3.274 ± 0.055
0.552AsnTrp: 0.552 ± 0.023
1.784AsnTyr: 1.784 ± 0.04
0.0AsnXaa: 0.0 ± 0.0
Pro
2.181ProAla: 2.181 ± 0.048
0.217ProCys: 0.217 ± 0.015
2.139ProAsp: 2.139 ± 0.045
2.992ProGlu: 2.992 ± 0.054
1.976ProPhe: 1.976 ± 0.044
2.365ProGly: 2.365 ± 0.053
0.803ProHis: 0.803 ± 0.028
3.006ProIle: 3.006 ± 0.055
2.227ProLys: 2.227 ± 0.046
3.493ProLeu: 3.493 ± 0.052
0.84ProMet: 0.84 ± 0.026
1.716ProAsn: 1.716 ± 0.04
0.98ProPro: 0.98 ± 0.031
0.951ProGln: 0.951 ± 0.03
1.025ProArg: 1.025 ± 0.03
2.329ProSer: 2.329 ± 0.052
2.122ProThr: 2.122 ± 0.039
2.828ProVal: 2.828 ± 0.052
0.367ProTrp: 0.367 ± 0.019
1.36ProTyr: 1.36 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
2.67GlnAla: 2.67 ± 0.051
0.164GlnCys: 0.164 ± 0.012
1.781GlnAsp: 1.781 ± 0.041
2.744GlnGlu: 2.744 ± 0.056
1.506GlnPhe: 1.506 ± 0.037
2.09GlnGly: 2.09 ± 0.045
0.714GlnHis: 0.714 ± 0.028
2.504GlnIle: 2.504 ± 0.049
2.618GlnLys: 2.618 ± 0.056
3.803GlnLeu: 3.803 ± 0.058
1.019GlnMet: 1.019 ± 0.033
1.597GlnAsn: 1.597 ± 0.036
1.14GlnPro: 1.14 ± 0.03
1.602GlnGln: 1.602 ± 0.053
1.367GlnArg: 1.367 ± 0.034
2.148GlnSer: 2.148 ± 0.046
2.064GlnThr: 2.064 ± 0.042
2.442GlnVal: 2.442 ± 0.041
0.415GlnTrp: 0.415 ± 0.019
1.132GlnTyr: 1.132 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
2.24ArgAla: 2.24 ± 0.04
0.249ArgCys: 0.249 ± 0.014
1.981ArgAsp: 1.981 ± 0.038
2.794ArgGlu: 2.794 ± 0.045
1.837ArgPhe: 1.837 ± 0.036
2.208ArgGly: 2.208 ± 0.043
0.764ArgHis: 0.764 ± 0.027
2.944ArgIle: 2.944 ± 0.052
2.975ArgLys: 2.975 ± 0.05
3.636ArgLeu: 3.636 ± 0.058
1.181ArgMet: 1.181 ± 0.031
1.898ArgAsn: 1.898 ± 0.038
1.17ArgPro: 1.17 ± 0.032
1.33ArgGln: 1.33 ± 0.029
1.592ArgArg: 1.592 ± 0.04
2.136ArgSer: 2.136 ± 0.044
2.042ArgThr: 2.042 ± 0.042
2.458ArgVal: 2.458 ± 0.049
0.361ArgTrp: 0.361 ± 0.018
1.404ArgTyr: 1.404 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
3.655SerAla: 3.655 ± 0.059
0.362SerCys: 0.362 ± 0.017
3.123SerAsp: 3.123 ± 0.048
4.058SerGlu: 4.058 ± 0.062
3.226SerPhe: 3.226 ± 0.063
4.623SerGly: 4.623 ± 0.063
1.244SerHis: 1.244 ± 0.031
5.599SerIle: 5.599 ± 0.065
3.919SerLys: 3.919 ± 0.063
6.028SerLeu: 6.028 ± 0.08
1.772SerMet: 1.772 ± 0.042
2.826SerAsn: 2.826 ± 0.052
2.047SerPro: 2.047 ± 0.042
1.86SerGln: 1.86 ± 0.04
2.176SerArg: 2.176 ± 0.046
4.046SerSer: 4.046 ± 0.066
3.332SerThr: 3.332 ± 0.061
4.196SerVal: 4.196 ± 0.064
0.681SerTrp: 0.681 ± 0.03
2.387SerTyr: 2.387 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
3.808ThrAla: 3.808 ± 0.057
0.393ThrCys: 0.393 ± 0.018
2.885ThrAsp: 2.885 ± 0.052
3.552ThrGlu: 3.552 ± 0.061
2.8ThrPhe: 2.8 ± 0.054
4.22ThrGly: 4.22 ± 0.066
1.11ThrHis: 1.11 ± 0.029
5.138ThrIle: 5.138 ± 0.065
3.613ThrLys: 3.613 ± 0.052
5.239ThrLeu: 5.239 ± 0.066
1.413ThrMet: 1.413 ± 0.038
2.783ThrAsn: 2.783 ± 0.049
2.302ThrPro: 2.302 ± 0.05
1.416ThrGln: 1.416 ± 0.032
1.839ThrArg: 1.839 ± 0.038
3.546ThrSer: 3.546 ± 0.051
3.192ThrThr: 3.192 ± 0.058
4.241ThrVal: 4.241 ± 0.067
0.56ThrTrp: 0.56 ± 0.023
2.089ThrTyr: 2.089 ± 0.043
0.0ThrXaa: 0.0 ± 0.0
Val
5.057ValAla: 5.057 ± 0.077
0.532ValCys: 0.532 ± 0.025
3.975ValAsp: 3.975 ± 0.057
4.907ValGlu: 4.907 ± 0.074
3.109ValPhe: 3.109 ± 0.054
5.011ValGly: 5.011 ± 0.082
1.398ValHis: 1.398 ± 0.038
5.986ValIle: 5.986 ± 0.07
4.514ValLys: 4.514 ± 0.064
6.72ValLeu: 6.72 ± 0.097
1.933ValMet: 1.933 ± 0.049
3.337ValAsn: 3.337 ± 0.061
2.739ValPro: 2.739 ± 0.051
2.247ValGln: 2.247 ± 0.045
2.421ValArg: 2.421 ± 0.046
4.611ValSer: 4.611 ± 0.068
4.228ValThr: 4.228 ± 0.062
5.258ValVal: 5.258 ± 0.077
0.646ValTrp: 0.646 ± 0.029
2.299ValTyr: 2.299 ± 0.048
0.0ValXaa: 0.0 ± 0.0
Trp
0.627TrpAla: 0.627 ± 0.026
0.061TrpCys: 0.061 ± 0.006
0.581TrpAsp: 0.581 ± 0.025
0.655TrpGlu: 0.655 ± 0.025
0.524TrpPhe: 0.524 ± 0.023
0.715TrpGly: 0.715 ± 0.026
0.2TrpHis: 0.2 ± 0.012
0.922TrpIle: 0.922 ± 0.03
0.71TrpLys: 0.71 ± 0.027
1.087TrpLeu: 1.087 ± 0.033
0.346TrpMet: 0.346 ± 0.02
0.547TrpAsn: 0.547 ± 0.021
0.28TrpPro: 0.28 ± 0.015
0.357TrpGln: 0.357 ± 0.016
0.365TrpArg: 0.365 ± 0.017
0.569TrpSer: 0.569 ± 0.028
0.53TrpThr: 0.53 ± 0.018
0.689TrpVal: 0.689 ± 0.026
0.132TrpTrp: 0.132 ± 0.012
0.365TrpTyr: 0.365 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.146TyrAla: 2.146 ± 0.04
0.302TyrCys: 0.302 ± 0.016
2.004TyrAsp: 2.004 ± 0.044
2.384TyrGlu: 2.384 ± 0.047
1.796TyrPhe: 1.796 ± 0.039
2.522TyrGly: 2.522 ± 0.044
0.863TyrHis: 0.863 ± 0.026
2.567TyrIle: 2.567 ± 0.051
2.09TyrLys: 2.09 ± 0.046
3.422TyrLeu: 3.422 ± 0.062
0.926TyrMet: 0.926 ± 0.028
1.58TyrAsn: 1.58 ± 0.037
1.455TyrPro: 1.455 ± 0.035
1.543TyrGln: 1.543 ± 0.035
1.492TyrArg: 1.492 ± 0.033
2.062TyrSer: 2.062 ± 0.044
1.906TyrThr: 1.906 ± 0.038
2.204TyrVal: 2.204 ± 0.037
0.393TyrTrp: 0.393 ± 0.017
1.455TyrTyr: 1.455 ± 0.043
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3964 proteins (1189348 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski