Amino acid dipepetide frequency for Anaerosacchriphilus polymeriproducens

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.593AlaAla: 4.593 ± 0.084
0.816AlaCys: 0.816 ± 0.027
3.224AlaAsp: 3.224 ± 0.049
3.774AlaGlu: 3.774 ± 0.06
2.699AlaPhe: 2.699 ± 0.054
4.628AlaGly: 4.628 ± 0.067
0.819AlaHis: 0.819 ± 0.031
5.219AlaIle: 5.219 ± 0.087
4.742AlaLys: 4.742 ± 0.072
5.494AlaLeu: 5.494 ± 0.08
1.729AlaMet: 1.729 ± 0.042
2.603AlaAsn: 2.603 ± 0.049
1.475AlaPro: 1.475 ± 0.038
1.839AlaGln: 1.839 ± 0.042
2.005AlaArg: 2.005 ± 0.048
3.465AlaSer: 3.465 ± 0.063
2.812AlaThr: 2.812 ± 0.058
4.659AlaVal: 4.659 ± 0.069
0.438AlaTrp: 0.438 ± 0.021
2.353AlaTyr: 2.353 ± 0.041
0.0AlaXaa: 0.0 ± 0.0
Cys
0.728CysAla: 0.728 ± 0.028
0.239CysCys: 0.239 ± 0.017
0.747CysAsp: 0.747 ± 0.023
0.874CysGlu: 0.874 ± 0.03
0.661CysPhe: 0.661 ± 0.024
1.195CysGly: 1.195 ± 0.037
0.246CysHis: 0.246 ± 0.016
1.269CysIle: 1.269 ± 0.033
1.051CysLys: 1.051 ± 0.032
1.128CysLeu: 1.128 ± 0.031
0.392CysMet: 0.392 ± 0.019
0.707CysAsn: 0.707 ± 0.023
0.504CysPro: 0.504 ± 0.021
0.369CysGln: 0.369 ± 0.017
0.436CysArg: 0.436 ± 0.024
0.893CysSer: 0.893 ± 0.03
0.636CysThr: 0.636 ± 0.023
0.82CysVal: 0.82 ± 0.029
0.11CysTrp: 0.11 ± 0.01
0.543CysTyr: 0.543 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
2.986AspAla: 2.986 ± 0.057
0.716AspCys: 0.716 ± 0.025
2.372AspAsp: 2.372 ± 0.057
4.193AspGlu: 4.193 ± 0.065
2.722AspPhe: 2.722 ± 0.055
3.358AspGly: 3.358 ± 0.061
0.683AspHis: 0.683 ± 0.021
5.249AspIle: 5.249 ± 0.064
4.278AspLys: 4.278 ± 0.064
4.516AspLeu: 4.516 ± 0.067
1.579AspMet: 1.579 ± 0.036
2.628AspAsn: 2.628 ± 0.058
1.293AspPro: 1.293 ± 0.032
1.261AspGln: 1.261 ± 0.032
1.72AspArg: 1.72 ± 0.041
3.212AspSer: 3.212 ± 0.058
2.777AspThr: 2.777 ± 0.048
3.247AspVal: 3.247 ± 0.059
0.495AspTrp: 0.495 ± 0.022
2.874AspTyr: 2.874 ± 0.052
0.0AspXaa: 0.0 ± 0.0
Glu
4.324GluAla: 4.324 ± 0.079
0.781GluCys: 0.781 ± 0.027
3.808GluAsp: 3.808 ± 0.057
7.193GluGlu: 7.193 ± 0.112
2.956GluPhe: 2.956 ± 0.053
4.042GluGly: 4.042 ± 0.067
1.176GluHis: 1.176 ± 0.034
6.932GluIle: 6.932 ± 0.092
7.492GluLys: 7.492 ± 0.087
6.884GluLeu: 6.884 ± 0.08
2.278GluMet: 2.278 ± 0.046
4.784GluAsn: 4.784 ± 0.062
1.645GluPro: 1.645 ± 0.042
2.835GluGln: 2.835 ± 0.054
2.829GluArg: 2.829 ± 0.057
3.691GluSer: 3.691 ± 0.069
3.659GluThr: 3.659 ± 0.06
4.606GluVal: 4.606 ± 0.067
0.589GluTrp: 0.589 ± 0.021
3.274GluTyr: 3.274 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
2.597PheAla: 2.597 ± 0.056
0.745PheCys: 0.745 ± 0.029
2.435PheAsp: 2.435 ± 0.049
3.139PheGlu: 3.139 ± 0.055
2.051PhePhe: 2.051 ± 0.059
2.998PheGly: 2.998 ± 0.051
0.848PheHis: 0.848 ± 0.027
4.036PheIle: 4.036 ± 0.071
2.918PheLys: 2.918 ± 0.045
4.116PheLeu: 4.116 ± 0.078
1.274PheMet: 1.274 ± 0.037
2.065PheAsn: 2.065 ± 0.046
1.307PhePro: 1.307 ± 0.038
1.597PheGln: 1.597 ± 0.043
1.407PheArg: 1.407 ± 0.036
3.184PheSer: 3.184 ± 0.061
2.301PheThr: 2.301 ± 0.048
2.969PheVal: 2.969 ± 0.056
0.402PheTrp: 0.402 ± 0.017
2.035PheTyr: 2.035 ± 0.053
0.0PheXaa: 0.0 ± 0.0
Gly
3.982GlyAla: 3.982 ± 0.08
1.076GlyCys: 1.076 ± 0.036
2.936GlyAsp: 2.936 ± 0.051
4.052GlyGlu: 4.052 ± 0.064
3.109GlyPhe: 3.109 ± 0.055
4.049GlyGly: 4.049 ± 0.069
1.038GlyHis: 1.038 ± 0.033
6.94GlyIle: 6.94 ± 0.091
5.556GlyLys: 5.556 ± 0.076
5.527GlyLeu: 5.527 ± 0.079
2.046GlyMet: 2.046 ± 0.043
3.398GlyAsn: 3.398 ± 0.067
1.111GlyPro: 1.111 ± 0.03
1.757GlyGln: 1.757 ± 0.04
2.269GlyArg: 2.269 ± 0.044
3.695GlySer: 3.695 ± 0.058
3.816GlyThr: 3.816 ± 0.062
4.475GlyVal: 4.475 ± 0.073
0.575GlyTrp: 0.575 ± 0.025
3.077GlyTyr: 3.077 ± 0.054
0.0GlyXaa: 0.0 ± 0.0
His
0.878HisAla: 0.878 ± 0.031
0.269HisCys: 0.269 ± 0.016
0.786HisAsp: 0.786 ± 0.032
0.936HisGlu: 0.936 ± 0.03
0.761HisPhe: 0.761 ± 0.027
1.054HisGly: 1.054 ± 0.029
0.34HisHis: 0.34 ± 0.018
1.445HisIle: 1.445 ± 0.034
1.091HisLys: 1.091 ± 0.03
1.42HisLeu: 1.42 ± 0.033
0.467HisMet: 0.467 ± 0.021
0.816HisAsn: 0.816 ± 0.028
0.685HisPro: 0.685 ± 0.024
0.513HisGln: 0.513 ± 0.024
0.58HisArg: 0.58 ± 0.022
0.904HisSer: 0.904 ± 0.03
0.852HisThr: 0.852 ± 0.03
0.922HisVal: 0.922 ± 0.027
0.16HisTrp: 0.16 ± 0.012
0.808HisTyr: 0.808 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
5.62IleAla: 5.62 ± 0.079
1.437IleCys: 1.437 ± 0.036
4.903IleAsp: 4.903 ± 0.059
6.62IleGlu: 6.62 ± 0.084
4.081IlePhe: 4.081 ± 0.075
5.845IleGly: 5.845 ± 0.091
1.458IleHis: 1.458 ± 0.037
8.12IleIle: 8.12 ± 0.118
6.954IleLys: 6.954 ± 0.074
8.66IleLeu: 8.66 ± 0.122
2.417IleMet: 2.417 ± 0.054
4.601IleAsn: 4.601 ± 0.073
3.346IlePro: 3.346 ± 0.058
3.02IleGln: 3.02 ± 0.061
3.36IleArg: 3.36 ± 0.053
6.407IleSer: 6.407 ± 0.1
4.791IleThr: 4.791 ± 0.069
5.912IleVal: 5.912 ± 0.067
0.683IleTrp: 0.683 ± 0.027
3.462IleTyr: 3.462 ± 0.051
0.0IleXaa: 0.0 ± 0.0
Lys
4.811LysAla: 4.811 ± 0.07
0.833LysCys: 0.833 ± 0.029
4.654LysAsp: 4.654 ± 0.067
8.714LysGlu: 8.714 ± 0.103
2.763LysPhe: 2.763 ± 0.057
4.73LysGly: 4.73 ± 0.065
1.114LysHis: 1.114 ± 0.026
7.166LysIle: 7.166 ± 0.086
8.181LysLys: 8.181 ± 0.11
6.494LysLeu: 6.494 ± 0.082
2.52LysMet: 2.52 ± 0.048
5.404LysAsn: 5.404 ± 0.079
1.881LysPro: 1.881 ± 0.039
2.852LysGln: 2.852 ± 0.057
3.138LysArg: 3.138 ± 0.051
4.635LysSer: 4.635 ± 0.074
4.089LysThr: 4.089 ± 0.059
5.47LysVal: 5.47 ± 0.073
0.671LysTrp: 0.671 ± 0.021
3.581LysTyr: 3.581 ± 0.064
0.0LysXaa: 0.0 ± 0.0
Leu
5.511LeuAla: 5.511 ± 0.074
1.296LeuCys: 1.296 ± 0.037
4.767LeuAsp: 4.767 ± 0.071
6.936LeuGlu: 6.936 ± 0.082
4.093LeuPhe: 4.093 ± 0.078
5.698LeuGly: 5.698 ± 0.076
1.361LeuHis: 1.361 ± 0.035
7.593LeuIle: 7.593 ± 0.101
7.676LeuLys: 7.676 ± 0.085
8.476LeuLeu: 8.476 ± 0.114
2.378LeuMet: 2.378 ± 0.047
5.124LeuAsn: 5.124 ± 0.07
2.887LeuPro: 2.887 ± 0.051
2.669LeuGln: 2.669 ± 0.049
3.01LeuArg: 3.01 ± 0.052
6.276LeuSer: 6.276 ± 0.085
4.683LeuThr: 4.683 ± 0.065
5.294LeuVal: 5.294 ± 0.076
0.691LeuTrp: 0.691 ± 0.022
3.441LeuTyr: 3.441 ± 0.063
0.0LeuXaa: 0.0 ± 0.0
Met
1.773MetAla: 1.773 ± 0.045
0.317MetCys: 0.317 ± 0.018
1.598MetAsp: 1.598 ± 0.035
2.346MetGlu: 2.346 ± 0.046
1.068MetPhe: 1.068 ± 0.029
1.753MetGly: 1.753 ± 0.038
0.419MetHis: 0.419 ± 0.019
2.47MetIle: 2.47 ± 0.045
2.956MetLys: 2.956 ± 0.056
2.5MetLeu: 2.5 ± 0.056
0.729MetMet: 0.729 ± 0.025
1.81MetAsn: 1.81 ± 0.039
0.968MetPro: 0.968 ± 0.03
1.004MetGln: 1.004 ± 0.03
0.989MetArg: 0.989 ± 0.03
1.714MetSer: 1.714 ± 0.041
1.378MetThr: 1.378 ± 0.036
1.677MetVal: 1.677 ± 0.041
0.177MetTrp: 0.177 ± 0.013
0.864MetTyr: 0.864 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.066AsnAla: 3.066 ± 0.053
0.743AsnCys: 0.743 ± 0.026
2.611AsnAsp: 2.611 ± 0.06
3.901AsnGlu: 3.901 ± 0.053
2.141AsnPhe: 2.141 ± 0.051
3.75AsnGly: 3.75 ± 0.065
1.058AsnHis: 1.058 ± 0.029
5.134AsnIle: 5.134 ± 0.069
4.481AsnLys: 4.481 ± 0.067
4.866AsnLeu: 4.866 ± 0.073
1.58AsnMet: 1.58 ± 0.039
3.098AsnAsn: 3.098 ± 0.067
1.961AsnPro: 1.961 ± 0.041
2.224AsnGln: 2.224 ± 0.047
2.13AsnArg: 2.13 ± 0.044
3.253AsnSer: 3.253 ± 0.07
2.834AsnThr: 2.834 ± 0.05
3.369AsnVal: 3.369 ± 0.061
0.459AsnTrp: 0.459 ± 0.021
2.463AsnTyr: 2.463 ± 0.057
0.0AsnXaa: 0.0 ± 0.0
Pro
1.597ProAla: 1.597 ± 0.038
0.346ProCys: 0.346 ± 0.021
1.705ProAsp: 1.705 ± 0.038
2.367ProGlu: 2.367 ± 0.049
1.403ProPhe: 1.403 ± 0.036
1.738ProGly: 1.738 ± 0.043
0.482ProHis: 0.482 ± 0.02
2.695ProIle: 2.695 ± 0.047
2.08ProLys: 2.08 ± 0.04
2.393ProLeu: 2.393 ± 0.046
0.769ProMet: 0.769 ± 0.026
1.529ProAsn: 1.529 ± 0.035
0.566ProPro: 0.566 ± 0.025
0.871ProGln: 0.871 ± 0.028
0.774ProArg: 0.774 ± 0.027
1.72ProSer: 1.72 ± 0.044
1.452ProThr: 1.452 ± 0.04
2.213ProVal: 2.213 ± 0.05
0.279ProTrp: 0.279 ± 0.015
1.349ProTyr: 1.349 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
1.852GlnAla: 1.852 ± 0.037
0.364GlnCys: 0.364 ± 0.018
1.507GlnAsp: 1.507 ± 0.033
2.67GlnGlu: 2.67 ± 0.056
1.263GlnPhe: 1.263 ± 0.029
1.871GlnGly: 1.871 ± 0.043
0.407GlnHis: 0.407 ± 0.016
3.185GlnIle: 3.185 ± 0.064
3.057GlnLys: 3.057 ± 0.057
3.062GlnLeu: 3.062 ± 0.053
1.05GlnMet: 1.05 ± 0.031
1.948GlnAsn: 1.948 ± 0.046
0.745GlnPro: 0.745 ± 0.025
1.141GlnGln: 1.141 ± 0.036
1.134GlnArg: 1.134 ± 0.036
1.78GlnSer: 1.78 ± 0.04
1.641GlnThr: 1.641 ± 0.041
2.079GlnVal: 2.079 ± 0.039
0.26GlnTrp: 0.26 ± 0.015
1.468GlnTyr: 1.468 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
1.783ArgAla: 1.783 ± 0.042
0.422ArgCys: 0.422 ± 0.022
1.69ArgAsp: 1.69 ± 0.037
2.736ArgGlu: 2.736 ± 0.045
1.638ArgPhe: 1.638 ± 0.034
1.943ArgGly: 1.943 ± 0.043
0.558ArgHis: 0.558 ± 0.021
3.371ArgIle: 3.371 ± 0.053
3.222ArgLys: 3.222 ± 0.059
3.127ArgLeu: 3.127 ± 0.057
1.139ArgMet: 1.139 ± 0.034
2.097ArgAsn: 2.097 ± 0.036
0.971ArgPro: 0.971 ± 0.029
1.173ArgGln: 1.173 ± 0.035
1.49ArgArg: 1.49 ± 0.044
1.764ArgSer: 1.764 ± 0.039
1.786ArgThr: 1.786 ± 0.041
2.196ArgVal: 2.196 ± 0.043
0.312ArgTrp: 0.312 ± 0.015
1.508ArgTyr: 1.508 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
3.452SerAla: 3.452 ± 0.054
0.783SerCys: 0.783 ± 0.024
3.085SerAsp: 3.085 ± 0.061
4.109SerGlu: 4.109 ± 0.062
3.07SerPhe: 3.07 ± 0.055
4.443SerGly: 4.443 ± 0.068
0.966SerHis: 0.966 ± 0.027
5.741SerIle: 5.741 ± 0.086
4.936SerLys: 4.936 ± 0.072
5.448SerLeu: 5.448 ± 0.071
1.744SerMet: 1.744 ± 0.043
3.489SerAsn: 3.489 ± 0.063
1.609SerPro: 1.609 ± 0.038
1.946SerGln: 1.946 ± 0.049
2.016SerArg: 2.016 ± 0.052
4.05SerSer: 4.05 ± 0.077
3.167SerThr: 3.167 ± 0.061
4.095SerVal: 4.095 ± 0.06
0.531SerTrp: 0.531 ± 0.02
2.76SerTyr: 2.76 ± 0.057
0.0SerXaa: 0.0 ± 0.0
Thr
3.308ThrAla: 3.308 ± 0.054
0.603ThrCys: 0.603 ± 0.024
2.821ThrAsp: 2.821 ± 0.058
3.233ThrGlu: 3.233 ± 0.053
2.313ThrPhe: 2.313 ± 0.047
3.917ThrGly: 3.917 ± 0.061
0.782ThrHis: 0.782 ± 0.025
4.786ThrIle: 4.786 ± 0.063
3.879ThrLys: 3.879 ± 0.059
4.696ThrLeu: 4.696 ± 0.075
1.324ThrMet: 1.324 ± 0.032
2.718ThrAsn: 2.718 ± 0.056
1.778ThrPro: 1.778 ± 0.039
1.606ThrGln: 1.606 ± 0.039
1.59ThrArg: 1.59 ± 0.034
3.138ThrSer: 3.138 ± 0.056
2.844ThrThr: 2.844 ± 0.06
3.819ThrVal: 3.819 ± 0.07
0.434ThrTrp: 0.434 ± 0.02
2.253ThrTyr: 2.253 ± 0.05
0.0ThrXaa: 0.0 ± 0.0
Val
3.861ValAla: 3.861 ± 0.073
0.972ValCys: 0.972 ± 0.03
3.569ValAsp: 3.569 ± 0.059
4.535ValGlu: 4.535 ± 0.077
3.074ValPhe: 3.074 ± 0.06
4.114ValGly: 4.114 ± 0.067
0.951ValHis: 0.951 ± 0.028
5.957ValIle: 5.957 ± 0.074
5.141ValLys: 5.141 ± 0.068
6.267ValLeu: 6.267 ± 0.075
1.777ValMet: 1.777 ± 0.041
3.29ValAsn: 3.29 ± 0.053
2.024ValPro: 2.024 ± 0.046
1.801ValGln: 1.801 ± 0.035
2.118ValArg: 2.118 ± 0.043
4.458ValSer: 4.458 ± 0.069
3.708ValThr: 3.708 ± 0.069
4.474ValVal: 4.474 ± 0.084
0.52ValTrp: 0.52 ± 0.019
2.581ValTyr: 2.581 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
0.426TrpAla: 0.426 ± 0.022
0.125TrpCys: 0.125 ± 0.01
0.488TrpAsp: 0.488 ± 0.02
0.548TrpGlu: 0.548 ± 0.023
0.423TrpPhe: 0.423 ± 0.022
0.602TrpGly: 0.602 ± 0.026
0.14TrpHis: 0.14 ± 0.011
0.715TrpIle: 0.715 ± 0.025
0.71TrpLys: 0.71 ± 0.026
0.744TrpLeu: 0.744 ± 0.025
0.255TrpMet: 0.255 ± 0.017
0.594TrpAsn: 0.594 ± 0.025
0.182TrpPro: 0.182 ± 0.013
0.242TrpGln: 0.242 ± 0.014
0.299TrpArg: 0.299 ± 0.017
0.473TrpSer: 0.473 ± 0.023
0.406TrpThr: 0.406 ± 0.018
0.43TrpVal: 0.43 ± 0.02
0.118TrpTrp: 0.118 ± 0.011
0.341TrpTyr: 0.341 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.238TyrAla: 2.238 ± 0.047
0.664TyrCys: 0.664 ± 0.022
2.449TyrAsp: 2.449 ± 0.057
2.904TyrGlu: 2.904 ± 0.045
2.136TyrPhe: 2.136 ± 0.048
2.755TyrGly: 2.755 ± 0.053
0.839TyrHis: 0.839 ± 0.031
3.673TyrIle: 3.673 ± 0.069
3.302TyrLys: 3.302 ± 0.053
4.042TyrLeu: 4.042 ± 0.072
1.064TyrMet: 1.064 ± 0.031
2.418TyrAsn: 2.418 ± 0.05
1.39TyrPro: 1.39 ± 0.04
1.761TyrGln: 1.761 ± 0.039
1.633TyrArg: 1.633 ± 0.037
2.727TyrSer: 2.727 ± 0.064
2.213TyrThr: 2.213 ± 0.052
2.468TyrVal: 2.468 ± 0.046
0.357TyrTrp: 0.357 ± 0.016
2.128TyrTyr: 2.128 ± 0.048
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3531 proteins (1128028 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski