Amino acid dipepetide frequency for Anaerosporomusa subterranea

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.044AlaAla: 11.044 ± 0.133
1.11AlaCys: 1.11 ± 0.033
4.918AlaAsp: 4.918 ± 0.065
5.876AlaGlu: 5.876 ± 0.073
3.488AlaPhe: 3.488 ± 0.063
7.91AlaGly: 7.91 ± 0.105
1.462AlaHis: 1.462 ± 0.036
7.01AlaIle: 7.01 ± 0.089
5.553AlaLys: 5.553 ± 0.076
9.417AlaLeu: 9.417 ± 0.124
2.844AlaMet: 2.844 ± 0.047
3.283AlaAsn: 3.283 ± 0.058
2.866AlaPro: 2.866 ± 0.059
3.21AlaGln: 3.21 ± 0.053
4.148AlaArg: 4.148 ± 0.066
5.147AlaSer: 5.147 ± 0.08
4.697AlaThr: 4.697 ± 0.078
8.137AlaVal: 8.137 ± 0.107
0.795AlaTrp: 0.795 ± 0.03
2.565AlaTyr: 2.565 ± 0.05
0.0AlaXaa: 0.0 ± 0.0
Cys
0.879CysAla: 0.879 ± 0.025
0.208CysCys: 0.208 ± 0.015
0.604CysAsp: 0.604 ± 0.022
0.596CysGlu: 0.596 ± 0.026
0.464CysPhe: 0.464 ± 0.02
1.307CysGly: 1.307 ± 0.038
0.271CysHis: 0.271 ± 0.018
0.792CysIle: 0.792 ± 0.028
0.458CysLys: 0.458 ± 0.022
1.15CysLeu: 1.15 ± 0.029
0.278CysMet: 0.278 ± 0.016
0.378CysAsn: 0.378 ± 0.019
0.63CysPro: 0.63 ± 0.028
0.454CysGln: 0.454 ± 0.02
0.682CysArg: 0.682 ± 0.024
0.707CysSer: 0.707 ± 0.023
0.532CysThr: 0.532 ± 0.024
0.778CysVal: 0.778 ± 0.027
0.127CysTrp: 0.127 ± 0.01
0.329CysTyr: 0.329 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
4.196AspAla: 4.196 ± 0.064
0.643AspCys: 0.643 ± 0.026
2.211AspAsp: 2.211 ± 0.051
3.212AspGlu: 3.212 ± 0.054
2.115AspPhe: 2.115 ± 0.041
3.495AspGly: 3.495 ± 0.06
0.884AspHis: 0.884 ± 0.027
4.259AspIle: 4.259 ± 0.071
2.857AspLys: 2.857 ± 0.052
4.724AspLeu: 4.724 ± 0.064
1.391AspMet: 1.391 ± 0.04
1.843AspAsn: 1.843 ± 0.039
2.066AspPro: 2.066 ± 0.042
1.703AspGln: 1.703 ± 0.04
2.506AspArg: 2.506 ± 0.043
2.811AspSer: 2.811 ± 0.054
2.449AspThr: 2.449 ± 0.059
3.751AspVal: 3.751 ± 0.063
0.582AspTrp: 0.582 ± 0.023
1.79AspTyr: 1.79 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
5.568GluAla: 5.568 ± 0.063
0.585GluCys: 0.585 ± 0.025
2.528GluAsp: 2.528 ± 0.048
3.837GluGlu: 3.837 ± 0.061
2.178GluPhe: 2.178 ± 0.049
3.373GluGly: 3.373 ± 0.053
1.159GluHis: 1.159 ± 0.032
4.59GluIle: 4.59 ± 0.076
4.149GluLys: 4.149 ± 0.075
6.397GluLeu: 6.397 ± 0.071
1.869GluMet: 1.869 ± 0.046
2.448GluAsn: 2.448 ± 0.045
1.96GluPro: 1.96 ± 0.048
2.966GluGln: 2.966 ± 0.054
3.437GluArg: 3.437 ± 0.062
2.873GluSer: 2.873 ± 0.044
3.274GluThr: 3.274 ± 0.057
4.3GluVal: 4.3 ± 0.067
0.56GluTrp: 0.56 ± 0.024
1.866GluTyr: 1.866 ± 0.039
0.0GluXaa: 0.0 ± 0.0
Phe
3.827PheAla: 3.827 ± 0.058
0.569PheCys: 0.569 ± 0.023
2.143PheAsp: 2.143 ± 0.047
1.857PheGlu: 1.857 ± 0.039
1.748PhePhe: 1.748 ± 0.041
3.395PheGly: 3.395 ± 0.057
0.728PheHis: 0.728 ± 0.03
2.887PheIle: 2.887 ± 0.049
1.538PheLys: 1.538 ± 0.036
3.755PheLeu: 3.755 ± 0.065
0.956PheMet: 0.956 ± 0.031
1.417PheAsn: 1.417 ± 0.038
1.549PhePro: 1.549 ± 0.034
1.232PheGln: 1.232 ± 0.036
1.818PheArg: 1.818 ± 0.044
2.882PheSer: 2.882 ± 0.056
2.273PheThr: 2.273 ± 0.044
2.808PheVal: 2.808 ± 0.057
0.429PheTrp: 0.429 ± 0.023
1.141PheTyr: 1.141 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
6.553GlyAla: 6.553 ± 0.098
1.126GlyCys: 1.126 ± 0.035
3.419GlyAsp: 3.419 ± 0.067
4.228GlyGlu: 4.228 ± 0.063
3.263GlyPhe: 3.263 ± 0.051
5.902GlyGly: 5.902 ± 0.09
1.539GlyHis: 1.539 ± 0.04
6.177GlyIle: 6.177 ± 0.074
4.808GlyLys: 4.808 ± 0.084
7.851GlyLeu: 7.851 ± 0.088
2.446GlyMet: 2.446 ± 0.051
2.618GlyAsn: 2.618 ± 0.059
2.146GlyPro: 2.146 ± 0.051
2.771GlyGln: 2.771 ± 0.066
3.715GlyArg: 3.715 ± 0.06
4.314GlySer: 4.314 ± 0.065
4.081GlyThr: 4.081 ± 0.065
6.262GlyVal: 6.262 ± 0.085
0.778GlyTrp: 0.778 ± 0.029
2.661GlyTyr: 2.661 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
1.509HisAla: 1.509 ± 0.042
0.29HisCys: 0.29 ± 0.016
0.989HisAsp: 0.989 ± 0.029
1.049HisGlu: 1.049 ± 0.031
0.781HisPhe: 0.781 ± 0.027
1.528HisGly: 1.528 ± 0.04
0.493HisHis: 0.493 ± 0.024
1.378HisIle: 1.378 ± 0.032
0.834HisLys: 0.834 ± 0.029
1.738HisLeu: 1.738 ± 0.039
0.495HisMet: 0.495 ± 0.022
0.736HisAsn: 0.736 ± 0.027
1.047HisPro: 1.047 ± 0.034
0.741HisGln: 0.741 ± 0.026
0.91HisArg: 0.91 ± 0.027
1.114HisSer: 1.114 ± 0.037
1.039HisThr: 1.039 ± 0.031
1.239HisVal: 1.239 ± 0.031
0.214HisTrp: 0.214 ± 0.014
0.631HisTyr: 0.631 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
7.621IleAla: 7.621 ± 0.086
0.858IleCys: 0.858 ± 0.026
3.909IleAsp: 3.909 ± 0.064
4.32IleGlu: 4.32 ± 0.055
2.444IlePhe: 2.444 ± 0.055
5.965IleGly: 5.965 ± 0.075
1.373IleHis: 1.373 ± 0.038
5.165IleIle: 5.165 ± 0.082
3.549IleLys: 3.549 ± 0.065
6.744IleLeu: 6.744 ± 0.085
1.667IleMet: 1.667 ± 0.043
2.83IleAsn: 2.83 ± 0.06
3.318IlePro: 3.318 ± 0.06
2.248IleGln: 2.248 ± 0.046
3.504IleArg: 3.504 ± 0.059
4.519IleSer: 4.519 ± 0.071
4.266IleThr: 4.266 ± 0.071
5.661IleVal: 5.661 ± 0.069
0.608IleTrp: 0.608 ± 0.025
1.917IleTyr: 1.917 ± 0.043
0.0IleXaa: 0.0 ± 0.0
Lys
4.989LysAla: 4.989 ± 0.078
0.437LysCys: 0.437 ± 0.021
2.6LysAsp: 2.6 ± 0.052
3.629LysGlu: 3.629 ± 0.077
1.536LysPhe: 1.536 ± 0.039
3.325LysGly: 3.325 ± 0.058
0.906LysHis: 0.906 ± 0.029
3.717LysIle: 3.717 ± 0.058
3.174LysLys: 3.174 ± 0.064
5.245LysLeu: 5.245 ± 0.074
1.614LysMet: 1.614 ± 0.032
2.144LysAsn: 2.144 ± 0.05
2.299LysPro: 2.299 ± 0.044
2.626LysGln: 2.626 ± 0.041
2.626LysArg: 2.626 ± 0.043
2.836LysSer: 2.836 ± 0.055
3.141LysThr: 3.141 ± 0.056
4.072LysVal: 4.072 ± 0.065
0.46LysTrp: 0.46 ± 0.023
1.606LysTyr: 1.606 ± 0.044
0.0LysXaa: 0.0 ± 0.0
Leu
11.08LeuAla: 11.08 ± 0.115
1.15LeuCys: 1.15 ± 0.033
4.883LeuAsp: 4.883 ± 0.072
5.578LeuGlu: 5.578 ± 0.081
4.158LeuPhe: 4.158 ± 0.068
7.794LeuGly: 7.794 ± 0.094
1.75LeuHis: 1.75 ± 0.042
6.471LeuIle: 6.471 ± 0.079
4.928LeuLys: 4.928 ± 0.068
10.832LeuLeu: 10.832 ± 0.136
2.466LeuMet: 2.466 ± 0.047
3.564LeuAsn: 3.564 ± 0.06
4.682LeuPro: 4.682 ± 0.066
3.555LeuGln: 3.555 ± 0.06
4.995LeuArg: 4.995 ± 0.081
6.802LeuSer: 6.802 ± 0.077
6.163LeuThr: 6.163 ± 0.085
7.182LeuVal: 7.182 ± 0.086
0.889LeuTrp: 0.889 ± 0.028
2.7LeuTyr: 2.7 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
2.81MetAla: 2.81 ± 0.048
0.235MetCys: 0.235 ± 0.016
1.255MetAsp: 1.255 ± 0.036
1.543MetGlu: 1.543 ± 0.039
0.909MetPhe: 0.909 ± 0.026
2.072MetGly: 2.072 ± 0.046
0.443MetHis: 0.443 ± 0.02
1.864MetIle: 1.864 ± 0.042
1.663MetLys: 1.663 ± 0.041
2.904MetLeu: 2.904 ± 0.052
0.746MetMet: 0.746 ± 0.028
1.138MetAsn: 1.138 ± 0.03
1.254MetPro: 1.254 ± 0.037
1.125MetGln: 1.125 ± 0.033
1.358MetArg: 1.358 ± 0.036
1.699MetSer: 1.699 ± 0.046
1.625MetThr: 1.625 ± 0.039
1.947MetVal: 1.947 ± 0.042
0.178MetTrp: 0.178 ± 0.012
0.624MetTyr: 0.624 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
3.083AsnAla: 3.083 ± 0.052
0.468AsnCys: 0.468 ± 0.02
1.702AsnAsp: 1.702 ± 0.049
1.926AsnGlu: 1.926 ± 0.05
1.326AsnPhe: 1.326 ± 0.038
2.778AsnGly: 2.778 ± 0.061
0.743AsnHis: 0.743 ± 0.026
2.771AsnIle: 2.771 ± 0.057
1.911AsnLys: 1.911 ± 0.051
3.81AsnLeu: 3.81 ± 0.061
1.016AsnMet: 1.016 ± 0.03
1.509AsnAsn: 1.509 ± 0.044
2.137AsnPro: 2.137 ± 0.053
1.515AsnGln: 1.515 ± 0.045
1.928AsnArg: 1.928 ± 0.042
2.243AsnSer: 2.243 ± 0.057
1.95AsnThr: 1.95 ± 0.046
2.548AsnVal: 2.548 ± 0.054
0.441AsnTrp: 0.441 ± 0.023
1.215AsnTyr: 1.215 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
3.596ProAla: 3.596 ± 0.067
0.421ProCys: 0.421 ± 0.019
2.502ProAsp: 2.502 ± 0.051
3.129ProGlu: 3.129 ± 0.057
1.763ProPhe: 1.763 ± 0.044
3.405ProGly: 3.405 ± 0.061
0.824ProHis: 0.824 ± 0.028
2.663ProIle: 2.663 ± 0.052
1.864ProLys: 1.864 ± 0.042
3.987ProLeu: 3.987 ± 0.069
1.002ProMet: 1.002 ± 0.033
1.396ProAsn: 1.396 ± 0.034
1.439ProPro: 1.439 ± 0.036
1.557ProGln: 1.557 ± 0.04
1.607ProArg: 1.607 ± 0.034
2.117ProSer: 2.117 ± 0.046
2.133ProThr: 2.133 ± 0.041
3.608ProVal: 3.608 ± 0.056
0.415ProTrp: 0.415 ± 0.021
1.271ProTyr: 1.271 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
4.056GlnAla: 4.056 ± 0.067
0.352GlnCys: 0.352 ± 0.019
1.724GlnAsp: 1.724 ± 0.044
2.61GlnGlu: 2.61 ± 0.051
1.375GlnPhe: 1.375 ± 0.034
2.597GlnGly: 2.597 ± 0.053
0.743GlnHis: 0.743 ± 0.026
2.657GlnIle: 2.657 ± 0.053
2.036GlnLys: 2.036 ± 0.041
3.664GlnLeu: 3.664 ± 0.057
1.148GlnMet: 1.148 ± 0.03
1.404GlnAsn: 1.404 ± 0.043
1.676GlnPro: 1.676 ± 0.04
1.97GlnGln: 1.97 ± 0.053
1.904GlnArg: 1.904 ± 0.041
2.098GlnSer: 2.098 ± 0.044
2.2GlnThr: 2.2 ± 0.046
2.81GlnVal: 2.81 ± 0.046
0.305GlnTrp: 0.305 ± 0.017
1.176GlnTyr: 1.176 ± 0.036
0.0GlnXaa: 0.0 ± 0.0
Arg
3.529ArgAla: 3.529 ± 0.057
0.527ArgCys: 0.527 ± 0.024
2.476ArgAsp: 2.476 ± 0.051
3.635ArgGlu: 3.635 ± 0.062
1.979ArgPhe: 1.979 ± 0.042
3.08ArgGly: 3.08 ± 0.055
1.06ArgHis: 1.06 ± 0.031
3.487ArgIle: 3.487 ± 0.065
2.622ArgLys: 2.622 ± 0.047
5.354ArgLeu: 5.354 ± 0.08
1.522ArgMet: 1.522 ± 0.038
1.92ArgAsn: 1.92 ± 0.037
1.975ArgPro: 1.975 ± 0.046
2.646ArgGln: 2.646 ± 0.05
2.883ArgArg: 2.883 ± 0.056
2.598ArgSer: 2.598 ± 0.047
2.426ArgThr: 2.426 ± 0.045
3.492ArgVal: 3.492 ± 0.059
0.494ArgTrp: 0.494 ± 0.022
1.664ArgTyr: 1.664 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
4.944SerAla: 4.944 ± 0.074
0.647SerCys: 0.647 ± 0.027
2.78SerAsp: 2.78 ± 0.043
3.254SerGlu: 3.254 ± 0.053
2.451SerPhe: 2.451 ± 0.052
5.095SerGly: 5.095 ± 0.071
1.165SerHis: 1.165 ± 0.033
4.237SerIle: 4.237 ± 0.07
2.676SerLys: 2.676 ± 0.05
6.434SerLeu: 6.434 ± 0.074
1.565SerMet: 1.565 ± 0.04
1.95SerAsn: 1.95 ± 0.043
2.371SerPro: 2.371 ± 0.043
2.323SerGln: 2.323 ± 0.053
3.06SerArg: 3.06 ± 0.055
3.345SerSer: 3.345 ± 0.065
2.942SerThr: 2.942 ± 0.056
4.647SerVal: 4.647 ± 0.065
0.552SerTrp: 0.552 ± 0.023
1.734SerTyr: 1.734 ± 0.044
0.0SerXaa: 0.0 ± 0.0
Thr
5.659ThrAla: 5.659 ± 0.091
0.57ThrCys: 0.57 ± 0.022
2.692ThrAsp: 2.692 ± 0.048
2.962ThrGlu: 2.962 ± 0.053
2.068ThrPhe: 2.068 ± 0.045
4.909ThrGly: 4.909 ± 0.075
0.994ThrHis: 0.994 ± 0.031
3.997ThrIle: 3.997 ± 0.067
2.492ThrLys: 2.492 ± 0.05
5.606ThrLeu: 5.606 ± 0.076
1.313ThrMet: 1.313 ± 0.033
1.931ThrAsn: 1.931 ± 0.04
2.713ThrPro: 2.713 ± 0.047
1.641ThrGln: 1.641 ± 0.04
2.322ThrArg: 2.322 ± 0.04
2.737ThrSer: 2.737 ± 0.046
3.013ThrThr: 3.013 ± 0.059
4.977ThrVal: 4.977 ± 0.065
0.445ThrTrp: 0.445 ± 0.02
1.504ThrTyr: 1.504 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
7.399ValAla: 7.399 ± 0.096
0.914ValCys: 0.914 ± 0.035
4.098ValAsp: 4.098 ± 0.068
4.58ValGlu: 4.58 ± 0.068
3.147ValPhe: 3.147 ± 0.054
5.605ValGly: 5.605 ± 0.084
1.277ValHis: 1.277 ± 0.035
5.826ValIle: 5.826 ± 0.075
4.146ValLys: 4.146 ± 0.072
7.658ValLeu: 7.658 ± 0.089
2.125ValMet: 2.125 ± 0.047
3.008ValAsn: 3.008 ± 0.054
2.867ValPro: 2.867 ± 0.055
2.345ValGln: 2.345 ± 0.048
3.629ValArg: 3.629 ± 0.062
4.937ValSer: 4.937 ± 0.069
4.333ValThr: 4.333 ± 0.056
6.255ValVal: 6.255 ± 0.098
0.677ValTrp: 0.677 ± 0.026
2.157ValTyr: 2.157 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.672TrpAla: 0.672 ± 0.025
0.108TrpCys: 0.108 ± 0.01
0.439TrpAsp: 0.439 ± 0.022
0.512TrpGlu: 0.512 ± 0.023
0.419TrpPhe: 0.419 ± 0.019
0.751TrpGly: 0.751 ± 0.03
0.23TrpHis: 0.23 ± 0.014
0.538TrpIle: 0.538 ± 0.024
0.408TrpLys: 0.408 ± 0.019
1.25TrpLeu: 1.25 ± 0.041
0.266TrpMet: 0.266 ± 0.016
0.352TrpAsn: 0.352 ± 0.02
0.366TrpPro: 0.366 ± 0.019
0.562TrpGln: 0.562 ± 0.023
0.608TrpArg: 0.608 ± 0.024
0.512TrpSer: 0.512 ± 0.023
0.439TrpThr: 0.439 ± 0.024
0.573TrpVal: 0.573 ± 0.024
0.132TrpTrp: 0.132 ± 0.012
0.282TrpTyr: 0.282 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.466TyrAla: 2.466 ± 0.048
0.405TyrCys: 0.405 ± 0.02
1.616TyrAsp: 1.616 ± 0.04
1.586TyrGlu: 1.586 ± 0.036
1.343TyrPhe: 1.343 ± 0.032
2.374TyrGly: 2.374 ± 0.051
0.709TyrHis: 0.709 ± 0.024
1.999TyrIle: 1.999 ± 0.047
1.317TyrLys: 1.317 ± 0.04
3.103TyrLeu: 3.103 ± 0.058
0.661TyrMet: 0.661 ± 0.026
1.148TyrAsn: 1.148 ± 0.041
1.339TyrPro: 1.339 ± 0.034
1.317TyrGln: 1.317 ± 0.039
1.731TyrArg: 1.731 ± 0.043
1.878TyrSer: 1.878 ± 0.041
1.568TyrThr: 1.568 ± 0.039
1.934TyrVal: 1.934 ± 0.042
0.339TyrTrp: 0.339 ± 0.018
1.106TyrTyr: 1.106 ± 0.029
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3585 proteins (1103724 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski