Amino acid dipepetide frequency for Granulicella tundricola (strain ATCC BAA-1859 / DSM 23138 / MP5ACTX9)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.247AlaAla: 14.247 ± 0.138
0.941AlaCys: 0.941 ± 0.026
5.179AlaAsp: 5.179 ± 0.054
6.27AlaGlu: 6.27 ± 0.102
3.833AlaPhe: 3.833 ± 0.043
9.407AlaGly: 9.407 ± 0.103
2.273AlaHis: 2.273 ± 0.037
5.474AlaIle: 5.474 ± 0.062
4.304AlaLys: 4.304 ± 0.063
11.064AlaLeu: 11.064 ± 0.106
2.773AlaMet: 2.773 ± 0.041
3.545AlaAsn: 3.545 ± 0.055
5.522AlaPro: 5.522 ± 0.078
4.478AlaGln: 4.478 ± 0.069
5.749AlaArg: 5.749 ± 0.079
6.692AlaSer: 6.692 ± 0.075
7.018AlaThr: 7.018 ± 0.099
7.789AlaVal: 7.789 ± 0.073
1.309AlaTrp: 1.309 ± 0.039
2.678AlaTyr: 2.678 ± 0.038
0.0AlaXaa: 0.0 ± 0.0
Cys
0.873CysAla: 0.873 ± 0.026
0.127CysCys: 0.127 ± 0.011
0.405CysAsp: 0.405 ± 0.018
0.393CysGlu: 0.393 ± 0.017
0.35CysPhe: 0.35 ± 0.015
0.924CysGly: 0.924 ± 0.028
0.226CysHis: 0.226 ± 0.015
0.41CysIle: 0.41 ± 0.021
0.251CysLys: 0.251 ± 0.012
0.814CysLeu: 0.814 ± 0.022
0.169CysMet: 0.169 ± 0.009
0.277CysAsn: 0.277 ± 0.018
0.419CysPro: 0.419 ± 0.019
0.225CysGln: 0.225 ± 0.009
0.469CysArg: 0.469 ± 0.021
0.573CysSer: 0.573 ± 0.02
0.6CysThr: 0.6 ± 0.023
0.625CysVal: 0.625 ± 0.022
0.118CysTrp: 0.118 ± 0.008
0.223CysTyr: 0.223 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
5.464AspAla: 5.464 ± 0.06
0.415AspCys: 0.415 ± 0.018
2.353AspAsp: 2.353 ± 0.038
3.007AspGlu: 3.007 ± 0.061
2.123AspPhe: 2.123 ± 0.035
4.548AspGly: 4.548 ± 0.086
1.257AspHis: 1.257 ± 0.031
2.206AspIle: 2.206 ± 0.04
1.796AspLys: 1.796 ± 0.034
5.382AspLeu: 5.382 ± 0.07
0.951AspMet: 0.951 ± 0.027
1.357AspAsn: 1.357 ± 0.036
3.314AspPro: 3.314 ± 0.053
1.871AspGln: 1.871 ± 0.039
2.989AspArg: 2.989 ± 0.057
2.764AspSer: 2.764 ± 0.043
2.721AspThr: 2.721 ± 0.045
3.504AspVal: 3.504 ± 0.044
0.772AspTrp: 0.772 ± 0.02
1.528AspTyr: 1.528 ± 0.033
0.0AspXaa: 0.0 ± 0.0
Glu
6.12GluAla: 6.12 ± 0.088
0.352GluCys: 0.352 ± 0.016
2.557GluAsp: 2.557 ± 0.05
3.011GluGlu: 3.011 ± 0.068
1.943GluPhe: 1.943 ± 0.039
3.624GluGly: 3.624 ± 0.056
1.261GluHis: 1.261 ± 0.028
3.045GluIle: 3.045 ± 0.055
2.164GluLys: 2.164 ± 0.056
5.397GluLeu: 5.397 ± 0.072
1.466GluMet: 1.466 ± 0.031
1.582GluAsn: 1.582 ± 0.029
2.348GluPro: 2.348 ± 0.049
2.321GluGln: 2.321 ± 0.042
3.835GluArg: 3.835 ± 0.069
2.991GluSer: 2.991 ± 0.054
3.168GluThr: 3.168 ± 0.047
3.695GluVal: 3.695 ± 0.058
0.641GluTrp: 0.641 ± 0.021
1.345GluTyr: 1.345 ± 0.03
0.0GluXaa: 0.0 ± 0.0
Phe
4.247PheAla: 4.247 ± 0.055
0.379PheCys: 0.379 ± 0.014
2.363PheAsp: 2.363 ± 0.037
1.982PheGlu: 1.982 ± 0.037
1.692PhePhe: 1.692 ± 0.035
3.556PheGly: 3.556 ± 0.06
0.976PheHis: 0.976 ± 0.029
1.591PheIle: 1.591 ± 0.024
1.186PheLys: 1.186 ± 0.027
3.695PheLeu: 3.695 ± 0.056
0.709PheMet: 0.709 ± 0.023
1.456PheAsn: 1.456 ± 0.035
1.783PhePro: 1.783 ± 0.03
1.306PheGln: 1.306 ± 0.03
2.094PheArg: 2.094 ± 0.036
2.695PheSer: 2.695 ± 0.044
2.696PheThr: 2.696 ± 0.041
2.671PheVal: 2.671 ± 0.042
0.521PheTrp: 0.521 ± 0.019
1.135PheTyr: 1.135 ± 0.031
0.0PheXaa: 0.0 ± 0.0
Gly
7.704GlyAla: 7.704 ± 0.083
0.84GlyCys: 0.84 ± 0.024
3.861GlyAsp: 3.861 ± 0.051
3.903GlyGlu: 3.903 ± 0.055
3.393GlyPhe: 3.393 ± 0.049
6.909GlyGly: 6.909 ± 0.107
1.868GlyHis: 1.868 ± 0.036
4.307GlyIle: 4.307 ± 0.062
3.677GlyLys: 3.677 ± 0.061
7.877GlyLeu: 7.877 ± 0.073
2.027GlyMet: 2.027 ± 0.037
2.772GlyAsn: 2.772 ± 0.069
3.329GlyPro: 3.329 ± 0.046
2.989GlyGln: 2.989 ± 0.048
4.485GlyArg: 4.485 ± 0.049
5.516GlySer: 5.516 ± 0.083
6.007GlyThr: 6.007 ± 0.121
5.999GlyVal: 5.999 ± 0.084
1.179GlyTrp: 1.179 ± 0.034
2.472GlyTyr: 2.472 ± 0.05
0.0GlyXaa: 0.0 ± 0.0
His
2.256HisAla: 2.256 ± 0.041
0.235HisCys: 0.235 ± 0.013
1.221HisAsp: 1.221 ± 0.025
1.186HisGlu: 1.186 ± 0.033
1.025HisPhe: 1.025 ± 0.028
1.911HisGly: 1.911 ± 0.038
0.675HisHis: 0.675 ± 0.021
1.17HisIle: 1.17 ± 0.026
0.682HisLys: 0.682 ± 0.017
2.436HisLeu: 2.436 ± 0.052
0.481HisMet: 0.481 ± 0.018
0.686HisAsn: 0.686 ± 0.023
1.54HisPro: 1.54 ± 0.036
0.847HisGln: 0.847 ± 0.025
1.333HisArg: 1.333 ± 0.029
1.27HisSer: 1.27 ± 0.036
1.335HisThr: 1.335 ± 0.03
1.467HisVal: 1.467 ± 0.029
0.343HisTrp: 0.343 ± 0.016
0.693HisTyr: 0.693 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
6.149IleAla: 6.149 ± 0.05
0.427IleCys: 0.427 ± 0.019
2.82IleAsp: 2.82 ± 0.047
2.898IleGlu: 2.898 ± 0.044
1.82IlePhe: 1.82 ± 0.036
4.078IleGly: 4.078 ± 0.055
1.17IleHis: 1.17 ± 0.027
1.89IleIle: 1.89 ± 0.028
1.575IleLys: 1.575 ± 0.033
4.704IleLeu: 4.704 ± 0.081
0.749IleMet: 0.749 ± 0.026
1.677IleAsn: 1.677 ± 0.037
2.933IlePro: 2.933 ± 0.046
1.796IleGln: 1.796 ± 0.034
2.868IleArg: 2.868 ± 0.047
3.279IleSer: 3.279 ± 0.058
3.441IleThr: 3.441 ± 0.055
3.666IleVal: 3.666 ± 0.055
0.488IleTrp: 0.488 ± 0.02
1.275IleTyr: 1.275 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
4.123LysAla: 4.123 ± 0.069
0.183LysCys: 0.183 ± 0.014
2.07LysAsp: 2.07 ± 0.041
1.89LysGlu: 1.89 ± 0.04
1.146LysPhe: 1.146 ± 0.034
2.671LysGly: 2.671 ± 0.049
0.827LysHis: 0.827 ± 0.026
1.92LysIle: 1.92 ± 0.04
1.813LysLys: 1.813 ± 0.057
3.902LysLeu: 3.902 ± 0.044
1.006LysMet: 1.006 ± 0.024
1.21LysAsn: 1.21 ± 0.025
2.367LysPro: 2.367 ± 0.049
1.546LysGln: 1.546 ± 0.036
2.199LysArg: 2.199 ± 0.043
2.278LysSer: 2.278 ± 0.042
2.487LysThr: 2.487 ± 0.043
2.774LysVal: 2.774 ± 0.05
0.387LysTrp: 0.387 ± 0.018
0.974LysTyr: 0.974 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
11.599LeuAla: 11.599 ± 0.112
0.943LeuCys: 0.943 ± 0.025
5.211LeuAsp: 5.211 ± 0.049
5.271LeuGlu: 5.271 ± 0.083
3.552LeuPhe: 3.552 ± 0.045
7.543LeuGly: 7.543 ± 0.075
2.333LeuHis: 2.333 ± 0.043
4.866LeuIle: 4.866 ± 0.071
4.069LeuLys: 4.069 ± 0.046
10.188LeuLeu: 10.188 ± 0.112
2.07LeuMet: 2.07 ± 0.038
3.459LeuAsn: 3.459 ± 0.053
5.784LeuPro: 5.784 ± 0.073
3.576LeuGln: 3.576 ± 0.056
6.311LeuArg: 6.311 ± 0.081
6.581LeuSer: 6.581 ± 0.079
7.029LeuThr: 7.029 ± 0.069
6.409LeuVal: 6.409 ± 0.075
1.228LeuTrp: 1.228 ± 0.034
2.458LeuTyr: 2.458 ± 0.044
0.0LeuXaa: 0.0 ± 0.0
Met
2.413MetAla: 2.413 ± 0.04
0.14MetCys: 0.14 ± 0.009
1.047MetAsp: 1.047 ± 0.027
1.144MetGlu: 1.144 ± 0.028
0.638MetPhe: 0.638 ± 0.019
1.595MetGly: 1.595 ± 0.034
0.534MetHis: 0.534 ± 0.02
1.036MetIle: 1.036 ± 0.021
1.097MetLys: 1.097 ± 0.026
2.274MetLeu: 2.274 ± 0.04
0.56MetMet: 0.56 ± 0.02
0.794MetAsn: 0.794 ± 0.023
1.394MetPro: 1.394 ± 0.033
0.95MetGln: 0.95 ± 0.019
1.513MetArg: 1.513 ± 0.036
1.457MetSer: 1.457 ± 0.027
1.537MetThr: 1.537 ± 0.027
1.499MetVal: 1.499 ± 0.029
0.194MetTrp: 0.194 ± 0.012
0.392MetTyr: 0.392 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.465AsnAla: 3.465 ± 0.042
0.308AsnCys: 0.308 ± 0.018
1.597AsnAsp: 1.597 ± 0.032
1.519AsnGlu: 1.519 ± 0.034
1.419AsnPhe: 1.419 ± 0.04
3.214AsnGly: 3.214 ± 0.061
0.738AsnHis: 0.738 ± 0.024
1.595AsnIle: 1.595 ± 0.047
1.044AsnLys: 1.044 ± 0.028
3.48AsnLeu: 3.48 ± 0.045
0.588AsnMet: 0.588 ± 0.02
1.226AsnAsn: 1.226 ± 0.039
2.432AsnPro: 2.432 ± 0.05
1.349AsnGln: 1.349 ± 0.032
1.732AsnArg: 1.732 ± 0.037
2.077AsnSer: 2.077 ± 0.052
2.047AsnThr: 2.047 ± 0.046
2.428AsnVal: 2.428 ± 0.059
0.425AsnTrp: 0.425 ± 0.019
1.104AsnTyr: 1.104 ± 0.033
0.0AsnXaa: 0.0 ± 0.0
Pro
6.439ProAla: 6.439 ± 0.079
0.336ProCys: 0.336 ± 0.013
3.106ProAsp: 3.106 ± 0.048
3.522ProGlu: 3.522 ± 0.063
1.971ProPhe: 1.971 ± 0.043
4.691ProGly: 4.691 ± 0.048
1.186ProHis: 1.186 ± 0.029
2.722ProIle: 2.722 ± 0.046
2.005ProLys: 2.005 ± 0.04
4.708ProLeu: 4.708 ± 0.055
1.135ProMet: 1.135 ± 0.024
1.997ProAsn: 1.997 ± 0.041
2.657ProPro: 2.657 ± 0.064
2.069ProGln: 2.069 ± 0.042
2.449ProArg: 2.449 ± 0.046
3.403ProSer: 3.403 ± 0.048
3.828ProThr: 3.828 ± 0.055
4.099ProVal: 4.099 ± 0.062
0.647ProTrp: 0.647 ± 0.024
1.451ProTyr: 1.451 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
4.246GlnAla: 4.246 ± 0.064
0.213GlnCys: 0.213 ± 0.014
1.69GlnAsp: 1.69 ± 0.034
1.666GlnGlu: 1.666 ± 0.034
1.434GlnPhe: 1.434 ± 0.033
2.623GlnGly: 2.623 ± 0.05
0.929GlnHis: 0.929 ± 0.026
2.134GlnIle: 2.134 ± 0.035
1.424GlnLys: 1.424 ± 0.031
3.616GlnLeu: 3.616 ± 0.062
1.01GlnMet: 1.01 ± 0.03
1.293GlnAsn: 1.293 ± 0.033
2.224GlnPro: 2.224 ± 0.036
2.172GlnGln: 2.172 ± 0.066
2.426GlnArg: 2.426 ± 0.049
2.456GlnSer: 2.456 ± 0.047
2.664GlnThr: 2.664 ± 0.045
2.66GlnVal: 2.66 ± 0.041
0.49GlnTrp: 0.49 ± 0.024
1.009GlnTyr: 1.009 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
5.43ArgAla: 5.43 ± 0.068
0.445ArgCys: 0.445 ± 0.017
2.806ArgAsp: 2.806 ± 0.039
3.4ArgGlu: 3.4 ± 0.056
2.574ArgPhe: 2.574 ± 0.051
3.672ArgGly: 3.672 ± 0.048
1.314ArgHis: 1.314 ± 0.033
3.335ArgIle: 3.335 ± 0.053
2.252ArgLys: 2.252 ± 0.047
5.936ArgLeu: 5.936 ± 0.072
1.592ArgMet: 1.592 ± 0.032
1.963ArgAsn: 1.963 ± 0.042
2.794ArgPro: 2.794 ± 0.048
2.206ArgGln: 2.206 ± 0.044
3.863ArgArg: 3.863 ± 0.057
3.447ArgSer: 3.447 ± 0.054
3.567ArgThr: 3.567 ± 0.048
4.197ArgVal: 4.197 ± 0.074
0.906ArgTrp: 0.906 ± 0.022
1.723ArgTyr: 1.723 ± 0.033
0.0ArgXaa: 0.0 ± 0.0
Ser
6.801SerAla: 6.801 ± 0.091
0.53SerCys: 0.53 ± 0.018
2.981SerAsp: 2.981 ± 0.033
2.947SerGlu: 2.947 ± 0.049
2.763SerPhe: 2.763 ± 0.046
5.938SerGly: 5.938 ± 0.089
1.362SerHis: 1.362 ± 0.029
3.31SerIle: 3.31 ± 0.055
2.185SerLys: 2.185 ± 0.03
6.521SerLeu: 6.521 ± 0.081
1.312SerMet: 1.312 ± 0.029
2.286SerAsn: 2.286 ± 0.053
3.477SerPro: 3.477 ± 0.049
2.189SerGln: 2.189 ± 0.039
3.299SerArg: 3.299 ± 0.041
4.564SerSer: 4.564 ± 0.081
4.266SerThr: 4.266 ± 0.066
4.464SerVal: 4.464 ± 0.069
0.735SerTrp: 0.735 ± 0.026
1.75SerTyr: 1.75 ± 0.031
0.0SerXaa: 0.0 ± 0.0
Thr
7.512ThrAla: 7.512 ± 0.11
0.592ThrCys: 0.592 ± 0.022
3.094ThrAsp: 3.094 ± 0.058
2.857ThrGlu: 2.857 ± 0.045
2.684ThrPhe: 2.684 ± 0.05
5.854ThrGly: 5.854 ± 0.087
1.396ThrHis: 1.396 ± 0.034
3.475ThrIle: 3.475 ± 0.062
2.088ThrLys: 2.088 ± 0.035
7.133ThrLeu: 7.133 ± 0.079
1.178ThrMet: 1.178 ± 0.025
2.217ThrAsn: 2.217 ± 0.046
4.524ThrPro: 4.524 ± 0.071
2.41ThrGln: 2.41 ± 0.042
3.185ThrArg: 3.185 ± 0.043
4.26ThrSer: 4.26 ± 0.063
4.44ThrThr: 4.44 ± 0.095
5.126ThrVal: 5.126 ± 0.094
0.783ThrTrp: 0.783 ± 0.029
1.785ThrTyr: 1.785 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
7.756ValAla: 7.756 ± 0.069
0.695ValCys: 0.695 ± 0.022
3.809ValAsp: 3.809 ± 0.06
4.052ValGlu: 4.052 ± 0.073
2.687ValPhe: 2.687 ± 0.042
5.153ValGly: 5.153 ± 0.077
1.532ValHis: 1.532 ± 0.03
3.447ValIle: 3.447 ± 0.057
2.614ValLys: 2.614 ± 0.051
7.136ValLeu: 7.136 ± 0.074
1.584ValMet: 1.584 ± 0.032
2.454ValAsn: 2.454 ± 0.059
3.678ValPro: 3.678 ± 0.057
2.469ValGln: 2.469 ± 0.042
4.097ValArg: 4.097 ± 0.059
4.773ValSer: 4.773 ± 0.058
5.153ValThr: 5.153 ± 0.099
5.499ValVal: 5.499 ± 0.071
0.873ValTrp: 0.873 ± 0.027
1.768ValTyr: 1.768 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
0.997TrpAla: 0.997 ± 0.027
0.121TrpCys: 0.121 ± 0.009
0.574TrpAsp: 0.574 ± 0.021
0.547TrpGlu: 0.547 ± 0.022
0.533TrpPhe: 0.533 ± 0.022
0.858TrpGly: 0.858 ± 0.026
0.366TrpHis: 0.366 ± 0.018
0.668TrpIle: 0.668 ± 0.024
0.609TrpLys: 0.609 ± 0.022
1.446TrpLeu: 1.446 ± 0.036
0.389TrpMet: 0.389 ± 0.015
0.548TrpAsn: 0.548 ± 0.026
0.533TrpPro: 0.533 ± 0.021
0.588TrpGln: 0.588 ± 0.021
0.831TrpArg: 0.831 ± 0.029
0.855TrpSer: 0.855 ± 0.026
0.794TrpThr: 0.794 ± 0.023
0.826TrpVal: 0.826 ± 0.024
0.21TrpTrp: 0.21 ± 0.014
0.329TrpTyr: 0.329 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.705TyrAla: 2.705 ± 0.045
0.247TyrCys: 0.247 ± 0.012
1.583TyrAsp: 1.583 ± 0.037
1.401TyrGlu: 1.401 ± 0.028
1.15TyrPhe: 1.15 ± 0.029
2.35TyrGly: 2.35 ± 0.046
0.565TyrHis: 0.565 ± 0.021
1.102TyrIle: 1.102 ± 0.029
0.939TyrLys: 0.939 ± 0.03
2.771TyrLeu: 2.771 ± 0.038
0.455TyrMet: 0.455 ± 0.018
0.989TyrAsn: 0.989 ± 0.036
1.372TyrPro: 1.372 ± 0.028
1.023TyrGln: 1.023 ± 0.027
1.693TyrArg: 1.693 ± 0.034
1.745TyrSer: 1.745 ± 0.04
1.794TyrThr: 1.794 ± 0.041
1.831TyrVal: 1.831 ± 0.032
0.377TyrTrp: 0.377 ± 0.017
0.761TyrTyr: 0.761 ± 0.022
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4514 proteins (1572207 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski