Amino acid dipepetide frequency for Marmota monax (Woodchuck)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.106AlaAla: 7.106 ± 0.037
1.421AlaCys: 1.421 ± 0.011
2.83AlaAsp: 2.83 ± 0.014
4.736AlaGlu: 4.736 ± 0.024
2.561AlaPhe: 2.561 ± 0.015
4.943AlaGly: 4.943 ± 0.026
1.623AlaHis: 1.623 ± 0.01
2.702AlaIle: 2.702 ± 0.013
3.275AlaLys: 3.275 ± 0.019
7.31AlaLeu: 7.31 ± 0.033
1.467AlaMet: 1.467 ± 0.01
1.929AlaAsn: 1.929 ± 0.013
4.557AlaPro: 4.557 ± 0.032
3.353AlaGln: 3.353 ± 0.017
3.85AlaArg: 3.85 ± 0.02
5.915AlaSer: 5.915 ± 0.027
3.675AlaThr: 3.675 ± 0.028
4.64AlaVal: 4.64 ± 0.02
0.839AlaTrp: 0.839 ± 0.009
1.462AlaTyr: 1.462 ± 0.01
0.001AlaXaa: 0.001 ± 0.0
Cys
1.31CysAla: 1.31 ± 0.009
0.66CysCys: 0.66 ± 0.012
0.989CysAsp: 0.989 ± 0.01
1.262CysGlu: 1.262 ± 0.013
0.823CysPhe: 0.823 ± 0.008
1.794CysGly: 1.794 ± 0.017
0.686CysHis: 0.686 ± 0.007
0.892CysIle: 0.892 ± 0.01
1.112CysLys: 1.112 ± 0.011
2.178CysLeu: 2.178 ± 0.016
0.418CysMet: 0.418 ± 0.006
0.746CysAsn: 0.746 ± 0.009
1.497CysPro: 1.497 ± 0.013
1.073CysGln: 1.073 ± 0.012
1.335CysArg: 1.335 ± 0.01
2.069CysSer: 2.069 ± 0.015
1.1CysThr: 1.1 ± 0.009
1.28CysVal: 1.28 ± 0.01
0.298CysTrp: 0.298 ± 0.005
0.555CysTyr: 0.555 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
2.77AspAla: 2.77 ± 0.015
1.029AspCys: 1.029 ± 0.009
2.533AspAsp: 2.533 ± 0.018
3.248AspGlu: 3.248 ± 0.018
2.016AspPhe: 2.016 ± 0.013
3.22AspGly: 3.22 ± 0.021
1.127AspHis: 1.127 ± 0.007
2.493AspIle: 2.493 ± 0.015
2.407AspLys: 2.407 ± 0.015
4.939AspLeu: 4.939 ± 0.023
1.102AspMet: 1.102 ± 0.008
1.593AspAsn: 1.593 ± 0.012
2.989AspPro: 2.989 ± 0.015
1.841AspGln: 1.841 ± 0.012
2.447AspArg: 2.447 ± 0.015
4.18AspSer: 4.18 ± 0.021
2.428AspThr: 2.428 ± 0.014
2.955AspVal: 2.955 ± 0.022
0.624AspTrp: 0.624 ± 0.008
1.37AspTyr: 1.37 ± 0.011
0.0AspXaa: 0.0 ± 0.0
Glu
5.29GluAla: 5.29 ± 0.028
1.381GluCys: 1.381 ± 0.02
4.345GluAsp: 4.345 ± 0.02
8.129GluGlu: 8.129 ± 0.053
1.962GluPhe: 1.962 ± 0.011
4.285GluGly: 4.285 ± 0.021
1.484GluHis: 1.484 ± 0.01
2.975GluIle: 2.975 ± 0.02
5.302GluLys: 5.302 ± 0.036
6.383GluLeu: 6.383 ± 0.03
1.658GluMet: 1.658 ± 0.012
2.987GluAsn: 2.987 ± 0.018
3.397GluPro: 3.397 ± 0.023
3.179GluGln: 3.179 ± 0.023
3.959GluArg: 3.959 ± 0.022
4.357GluSer: 4.357 ± 0.019
3.306GluThr: 3.306 ± 0.017
4.203GluVal: 4.203 ± 0.023
0.686GluTrp: 0.686 ± 0.007
1.484GluTyr: 1.484 ± 0.014
0.001GluXaa: 0.001 ± 0.0
Phe
1.886PheAla: 1.886 ± 0.013
0.966PheCys: 0.966 ± 0.008
1.565PheAsp: 1.565 ± 0.009
1.863PheGlu: 1.863 ± 0.01
1.574PhePhe: 1.574 ± 0.012
2.143PheGly: 2.143 ± 0.015
1.021PheHis: 1.021 ± 0.009
1.74PheIle: 1.74 ± 0.012
1.679PheLys: 1.679 ± 0.013
4.109PheLeu: 4.109 ± 0.023
0.751PheMet: 0.751 ± 0.007
1.265PheAsn: 1.265 ± 0.01
2.07PhePro: 2.07 ± 0.012
1.748PheGln: 1.748 ± 0.011
1.917PheArg: 1.917 ± 0.014
3.411PheSer: 3.411 ± 0.017
1.949PheThr: 1.949 ± 0.012
2.045PheVal: 2.045 ± 0.013
0.48PheTrp: 0.48 ± 0.005
1.146PheTyr: 1.146 ± 0.01
0.0PheXaa: 0.0 ± 0.0
Gly
4.725GlyAla: 4.725 ± 0.025
1.362GlyCys: 1.362 ± 0.01
3.229GlyAsp: 3.229 ± 0.02
4.202GlyGlu: 4.202 ± 0.025
2.367GlyPhe: 2.367 ± 0.018
5.464GlyGly: 5.464 ± 0.036
1.77GlyHis: 1.77 ± 0.013
2.627GlyIle: 2.627 ± 0.013
3.691GlyLys: 3.691 ± 0.022
6.07GlyLeu: 6.07 ± 0.032
1.289GlyMet: 1.289 ± 0.01
2.294GlyAsn: 2.294 ± 0.014
4.712GlyPro: 4.712 ± 0.042
2.901GlyGln: 2.901 ± 0.015
4.03GlyArg: 4.03 ± 0.021
6.068GlySer: 6.068 ± 0.03
3.717GlyThr: 3.717 ± 0.018
3.529GlyVal: 3.529 ± 0.02
0.837GlyTrp: 0.837 ± 0.009
1.647GlyTyr: 1.647 ± 0.014
0.001GlyXaa: 0.001 ± 0.0
His
1.315HisAla: 1.315 ± 0.009
0.722HisCys: 0.722 ± 0.008
0.837HisAsp: 0.837 ± 0.007
1.268HisGlu: 1.268 ± 0.009
1.121HisPhe: 1.121 ± 0.008
1.559HisGly: 1.559 ± 0.012
0.926HisHis: 0.926 ± 0.012
1.218HisIle: 1.218 ± 0.009
1.228HisLys: 1.228 ± 0.009
3.017HisLeu: 3.017 ± 0.016
0.59HisMet: 0.59 ± 0.006
0.839HisAsn: 0.839 ± 0.008
1.776HisPro: 1.776 ± 0.014
1.407HisGln: 1.407 ± 0.013
1.664HisArg: 1.664 ± 0.011
2.368HisSer: 2.368 ± 0.015
1.6HisThr: 1.6 ± 0.014
1.48HisVal: 1.48 ± 0.009
0.36HisTrp: 0.36 ± 0.005
0.783HisTyr: 0.783 ± 0.007
0.0HisXaa: 0.0 ± 0.0
Ile
2.409IleAla: 2.409 ± 0.013
1.056IleCys: 1.056 ± 0.009
1.885IleAsp: 1.885 ± 0.013
2.45IleGlu: 2.45 ± 0.016
1.789IlePhe: 1.789 ± 0.013
2.065IleGly: 2.065 ± 0.013
1.322IleHis: 1.322 ± 0.011
2.319IleIle: 2.319 ± 0.021
2.46IleLys: 2.46 ± 0.019
4.47IleLeu: 4.47 ± 0.019
0.967IleMet: 0.967 ± 0.008
1.703IleAsn: 1.703 ± 0.011
2.597IlePro: 2.597 ± 0.014
2.228IleGln: 2.228 ± 0.016
2.254IleArg: 2.254 ± 0.013
3.639IleSer: 3.639 ± 0.017
2.591IleThr: 2.591 ± 0.03
2.353IleVal: 2.353 ± 0.019
0.48IleTrp: 0.48 ± 0.004
1.322IleTyr: 1.322 ± 0.01
0.0IleXaa: 0.0 ± 0.0
Lys
3.939LysAla: 3.939 ± 0.021
1.087LysCys: 1.087 ± 0.012
3.042LysAsp: 3.042 ± 0.023
5.003LysGlu: 5.003 ± 0.032
1.623LysPhe: 1.623 ± 0.012
3.171LysGly: 3.171 ± 0.029
1.336LysHis: 1.336 ± 0.011
2.592LysIle: 2.592 ± 0.017
4.576LysLys: 4.576 ± 0.036
4.937LysLeu: 4.937 ± 0.025
1.451LysMet: 1.451 ± 0.012
2.279LysAsn: 2.279 ± 0.015
3.133LysPro: 3.133 ± 0.026
2.491LysGln: 2.491 ± 0.016
3.158LysArg: 3.158 ± 0.017
3.833LysSer: 3.833 ± 0.019
2.978LysThr: 2.978 ± 0.019
3.384LysVal: 3.384 ± 0.025
0.576LysTrp: 0.576 ± 0.007
1.451LysTyr: 1.451 ± 0.013
0.0LysXaa: 0.0 ± 0.0
Leu
6.904LeuAla: 6.904 ± 0.028
2.25LeuCys: 2.25 ± 0.014
4.654LeuAsp: 4.654 ± 0.021
7.114LeuGlu: 7.114 ± 0.031
3.327LeuPhe: 3.327 ± 0.018
6.174LeuGly: 6.174 ± 0.031
2.803LeuHis: 2.803 ± 0.016
3.761LeuIle: 3.761 ± 0.021
5.591LeuLys: 5.591 ± 0.028
11.069LeuLeu: 11.069 ± 0.053
2.038LeuMet: 2.038 ± 0.012
3.353LeuAsn: 3.353 ± 0.015
6.373LeuPro: 6.373 ± 0.028
5.895LeuGln: 5.895 ± 0.03
6.049LeuArg: 6.049 ± 0.025
8.209LeuSer: 8.209 ± 0.029
5.08LeuThr: 5.08 ± 0.021
5.577LeuVal: 5.577 ± 0.025
1.193LeuTrp: 1.193 ± 0.01
2.433LeuTyr: 2.433 ± 0.015
0.001LeuXaa: 0.001 ± 0.0
Met
1.951MetAla: 1.951 ± 0.012
0.389MetCys: 0.389 ± 0.005
1.232MetAsp: 1.232 ± 0.01
1.866MetGlu: 1.866 ± 0.012
0.705MetPhe: 0.705 ± 0.007
1.315MetGly: 1.315 ± 0.012
0.456MetHis: 0.456 ± 0.005
0.807MetIle: 0.807 ± 0.007
1.385MetLys: 1.385 ± 0.01
1.963MetLeu: 1.963 ± 0.011
0.575MetMet: 0.575 ± 0.008
0.859MetAsn: 0.859 ± 0.009
1.095MetPro: 1.095 ± 0.014
0.932MetGln: 0.932 ± 0.009
1.025MetArg: 1.025 ± 0.008
1.58MetSer: 1.58 ± 0.01
1.101MetThr: 1.101 ± 0.009
1.439MetVal: 1.439 ± 0.013
0.241MetTrp: 0.241 ± 0.004
0.611MetTyr: 0.611 ± 0.006
0.0MetXaa: 0.0 ± 0.0
Asn
1.941AsnAla: 1.941 ± 0.013
0.792AsnCys: 0.792 ± 0.008
1.422AsnAsp: 1.422 ± 0.01
2.104AsnGlu: 2.104 ± 0.013
1.412AsnPhe: 1.412 ± 0.01
2.288AsnGly: 2.288 ± 0.014
0.915AsnHis: 0.915 ± 0.009
1.976AsnIle: 1.976 ± 0.016
2.125AsnLys: 2.125 ± 0.014
3.634AsnLeu: 3.634 ± 0.015
0.865AsnMet: 0.865 ± 0.008
1.428AsnAsn: 1.428 ± 0.012
2.161AsnPro: 2.161 ± 0.015
1.638AsnGln: 1.638 ± 0.014
1.76AsnArg: 1.76 ± 0.01
3.046AsnSer: 3.046 ± 0.018
1.894AsnThr: 1.894 ± 0.013
2.09AsnVal: 2.09 ± 0.015
0.425AsnTrp: 0.425 ± 0.005
1.031AsnTyr: 1.031 ± 0.009
0.0AsnXaa: 0.0 ± 0.0
Pro
5.231ProAla: 5.231 ± 0.033
1.237ProCys: 1.237 ± 0.012
2.791ProAsp: 2.791 ± 0.014
4.638ProGlu: 4.638 ± 0.022
1.951ProPhe: 1.951 ± 0.012
5.819ProGly: 5.819 ± 0.062
1.554ProHis: 1.554 ± 0.012
1.855ProIle: 1.855 ± 0.015
2.826ProLys: 2.826 ± 0.027
5.621ProLeu: 5.621 ± 0.026
1.128ProMet: 1.128 ± 0.011
1.806ProAsn: 1.806 ± 0.013
6.775ProPro: 6.775 ± 0.063
3.103ProGln: 3.103 ± 0.025
3.732ProArg: 3.732 ± 0.022
6.075ProSer: 6.075 ± 0.032
3.28ProThr: 3.28 ± 0.022
3.963ProVal: 3.963 ± 0.021
0.755ProTrp: 0.755 ± 0.008
1.46ProTyr: 1.46 ± 0.013
0.001ProXaa: 0.001 ± 0.0
Gln
3.593GlnAla: 3.593 ± 0.02
0.929GlnCys: 0.929 ± 0.011
2.382GlnAsp: 2.382 ± 0.013
4.022GlnGlu: 4.022 ± 0.026
1.336GlnPhe: 1.336 ± 0.01
3.054GlnGly: 3.054 ± 0.019
1.329GlnHis: 1.329 ± 0.011
1.924GlnIle: 1.924 ± 0.014
2.907GlnLys: 2.907 ± 0.02
4.838GlnLeu: 4.838 ± 0.026
1.118GlnMet: 1.118 ± 0.009
1.851GlnAsn: 1.851 ± 0.016
2.976GlnPro: 2.976 ± 0.021
3.196GlnGln: 3.196 ± 0.038
3.047GlnArg: 3.047 ± 0.017
3.194GlnSer: 3.194 ± 0.018
2.268GlnThr: 2.268 ± 0.013
2.943GlnVal: 2.943 ± 0.013
0.541GlnTrp: 0.541 ± 0.006
1.093GlnTyr: 1.093 ± 0.008
0.0GlnXaa: 0.0 ± 0.0
Arg
4.137ArgAla: 4.137 ± 0.022
1.223ArgCys: 1.223 ± 0.011
2.703ArgAsp: 2.703 ± 0.015
3.966ArgGlu: 3.966 ± 0.02
1.803ArgPhe: 1.803 ± 0.011
3.922ArgGly: 3.922 ± 0.027
1.564ArgHis: 1.564 ± 0.012
2.317ArgIle: 2.317 ± 0.013
3.588ArgLys: 3.588 ± 0.019
5.457ArgLeu: 5.457 ± 0.025
1.178ArgMet: 1.178 ± 0.009
2.058ArgAsn: 2.058 ± 0.013
3.565ArgPro: 3.565 ± 0.021
2.639ArgGln: 2.639 ± 0.017
4.582ArgArg: 4.582 ± 0.028
4.348ArgSer: 4.348 ± 0.029
2.837ArgThr: 2.837 ± 0.016
3.24ArgVal: 3.24 ± 0.02
0.728ArgTrp: 0.728 ± 0.007
1.393ArgTyr: 1.393 ± 0.009
0.0ArgXaa: 0.0 ± 0.0
Ser
5.456SerAla: 5.456 ± 0.027
1.92SerCys: 1.92 ± 0.016
3.803SerAsp: 3.803 ± 0.023
5.175SerGlu: 5.175 ± 0.022
3.011SerPhe: 3.011 ± 0.015
5.874SerGly: 5.874 ± 0.031
2.218SerHis: 2.218 ± 0.013
3.131SerIle: 3.131 ± 0.015
3.969SerLys: 3.969 ± 0.02
8.407SerLeu: 8.407 ± 0.032
1.608SerMet: 1.608 ± 0.01
2.631SerAsn: 2.631 ± 0.015
6.43SerPro: 6.43 ± 0.039
4.029SerGln: 4.029 ± 0.021
4.696SerArg: 4.696 ± 0.025
9.673SerSer: 9.673 ± 0.061
4.549SerThr: 4.549 ± 0.024
4.906SerVal: 4.906 ± 0.021
1.12SerTrp: 1.12 ± 0.01
2.051SerTyr: 2.051 ± 0.012
0.001SerXaa: 0.001 ± 0.0
Thr
3.724ThrAla: 3.724 ± 0.018
1.322ThrCys: 1.322 ± 0.013
2.347ThrAsp: 2.347 ± 0.012
3.485ThrGlu: 3.485 ± 0.017
2.025ThrPhe: 2.025 ± 0.013
3.592ThrGly: 3.592 ± 0.022
1.337ThrHis: 1.337 ± 0.011
2.457ThrIle: 2.457 ± 0.026
2.556ThrLys: 2.556 ± 0.019
5.353ThrLeu: 5.353 ± 0.02
1.107ThrMet: 1.107 ± 0.009
1.683ThrAsn: 1.683 ± 0.012
3.74ThrPro: 3.74 ± 0.024
2.371ThrGln: 2.371 ± 0.015
2.502ThrArg: 2.502 ± 0.013
4.726ThrSer: 4.726 ± 0.03
3.15ThrThr: 3.15 ± 0.046
3.814ThrVal: 3.814 ± 0.024
0.707ThrTrp: 0.707 ± 0.009
1.349ThrTyr: 1.349 ± 0.011
0.001ThrXaa: 0.001 ± 0.0
Val
4.322ValAla: 4.322 ± 0.02
1.442ValCys: 1.442 ± 0.011
2.873ValAsp: 2.873 ± 0.017
3.796ValGlu: 3.796 ± 0.023
2.322ValPhe: 2.322 ± 0.015
3.446ValGly: 3.446 ± 0.017
1.583ValHis: 1.583 ± 0.01
2.79ValIle: 2.79 ± 0.018
3.2ValLys: 3.2 ± 0.025
6.28ValLeu: 6.28 ± 0.024
1.336ValMet: 1.336 ± 0.01
2.133ValAsn: 2.133 ± 0.013
3.89ValPro: 3.89 ± 0.027
2.781ValGln: 2.781 ± 0.014
2.989ValArg: 2.989 ± 0.015
4.946ValSer: 4.946 ± 0.022
3.753ValThr: 3.753 ± 0.031
3.995ValVal: 3.995 ± 0.024
0.692ValTrp: 0.692 ± 0.007
1.526ValTyr: 1.526 ± 0.009
0.001ValXaa: 0.001 ± 0.0
Trp
0.859TrpAla: 0.859 ± 0.009
0.237TrpCys: 0.237 ± 0.004
0.634TrpAsp: 0.634 ± 0.007
0.816TrpGlu: 0.816 ± 0.007
0.426TrpPhe: 0.426 ± 0.006
0.799TrpGly: 0.799 ± 0.008
0.302TrpHis: 0.302 ± 0.004
0.513TrpIle: 0.513 ± 0.006
0.787TrpLys: 0.787 ± 0.007
1.22TrpLeu: 1.22 ± 0.012
0.309TrpMet: 0.309 ± 0.005
0.508TrpAsn: 0.508 ± 0.005
0.593TrpPro: 0.593 ± 0.006
0.537TrpGln: 0.537 ± 0.007
0.759TrpArg: 0.759 ± 0.007
0.878TrpSer: 0.878 ± 0.007
0.677TrpThr: 0.677 ± 0.008
0.73TrpVal: 0.73 ± 0.007
0.197TrpTrp: 0.197 ± 0.003
0.33TrpTyr: 0.33 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.323TyrAla: 1.323 ± 0.01
0.653TyrCys: 0.653 ± 0.008
1.218TyrAsp: 1.218 ± 0.01
1.609TyrGlu: 1.609 ± 0.013
1.174TyrPhe: 1.174 ± 0.01
1.602TyrGly: 1.602 ± 0.013
0.723TyrHis: 0.723 ± 0.007
1.27TyrIle: 1.27 ± 0.011
1.375TyrLys: 1.375 ± 0.015
2.594TyrLeu: 2.594 ± 0.015
0.573TyrMet: 0.573 ± 0.006
1.001TyrAsn: 1.001 ± 0.008
1.274TyrPro: 1.274 ± 0.008
1.202TyrGln: 1.202 ± 0.009
1.493TyrArg: 1.493 ± 0.011
2.138TyrSer: 2.138 ± 0.014
1.417TyrThr: 1.417 ± 0.012
1.528TyrVal: 1.528 ± 0.01
0.33TyrTrp: 0.33 ± 0.005
0.89TyrTyr: 0.89 ± 0.008
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.007XaaXaa: 0.007 ± 0.004
Statistics based on 41281 proteins (18551965 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski