Amino acid dipepetide frequency for Leucothrix arctica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.07AlaAla: 8.07 ± 0.115
0.874AlaCys: 0.874 ± 0.027
4.853AlaAsp: 4.853 ± 0.066
5.493AlaGlu: 5.493 ± 0.071
3.325AlaPhe: 3.325 ± 0.064
6.374AlaGly: 6.374 ± 0.095
1.555AlaHis: 1.555 ± 0.038
5.682AlaIle: 5.682 ± 0.082
4.682AlaLys: 4.682 ± 0.073
9.121AlaLeu: 9.121 ± 0.097
2.406AlaMet: 2.406 ± 0.047
3.328AlaAsn: 3.328 ± 0.057
3.01AlaPro: 3.01 ± 0.085
3.233AlaGln: 3.233 ± 0.053
3.555AlaArg: 3.555 ± 0.057
5.832AlaSer: 5.832 ± 0.079
4.895AlaThr: 4.895 ± 0.11
6.045AlaVal: 6.045 ± 0.082
1.073AlaTrp: 1.073 ± 0.03
2.469AlaTyr: 2.469 ± 0.044
0.0AlaXaa: 0.0 ± 0.0
Cys
0.745CysAla: 0.745 ± 0.029
0.132CysCys: 0.132 ± 0.011
0.624CysAsp: 0.624 ± 0.036
0.545CysGlu: 0.545 ± 0.03
0.436CysPhe: 0.436 ± 0.017
0.833CysGly: 0.833 ± 0.027
0.292CysHis: 0.292 ± 0.017
0.608CysIle: 0.608 ± 0.022
0.419CysLys: 0.419 ± 0.019
0.938CysLeu: 0.938 ± 0.03
0.197CysMet: 0.197 ± 0.012
0.37CysAsn: 0.37 ± 0.018
0.465CysPro: 0.465 ± 0.028
0.35CysGln: 0.35 ± 0.018
0.441CysArg: 0.441 ± 0.019
0.701CysSer: 0.701 ± 0.021
0.475CysThr: 0.475 ± 0.019
0.6CysVal: 0.6 ± 0.019
0.122CysTrp: 0.122 ± 0.012
0.31CysTyr: 0.31 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
4.889AspAla: 4.889 ± 0.079
0.55AspCys: 0.55 ± 0.021
3.6AspAsp: 3.6 ± 0.063
4.067AspGlu: 4.067 ± 0.058
2.569AspPhe: 2.569 ± 0.045
4.215AspGly: 4.215 ± 0.11
1.092AspHis: 1.092 ± 0.029
3.94AspIle: 3.94 ± 0.054
3.216AspLys: 3.216 ± 0.065
5.538AspLeu: 5.538 ± 0.071
1.343AspMet: 1.343 ± 0.032
2.605AspAsn: 2.605 ± 0.06
2.259AspPro: 2.259 ± 0.053
2.156AspGln: 2.156 ± 0.044
2.416AspArg: 2.416 ± 0.043
3.818AspSer: 3.818 ± 0.081
3.412AspThr: 3.412 ± 0.069
4.01AspVal: 4.01 ± 0.061
0.879AspTrp: 0.879 ± 0.027
2.147AspTyr: 2.147 ± 0.045
0.0AspXaa: 0.0 ± 0.0
Glu
5.424GluAla: 5.424 ± 0.073
0.497GluCys: 0.497 ± 0.022
3.103GluAsp: 3.103 ± 0.056
3.592GluGlu: 3.592 ± 0.058
2.188GluPhe: 2.188 ± 0.049
3.771GluGly: 3.771 ± 0.059
1.416GluHis: 1.416 ± 0.041
3.872GluIle: 3.872 ± 0.059
3.789GluLys: 3.789 ± 0.064
6.352GluLeu: 6.352 ± 0.096
1.616GluMet: 1.616 ± 0.035
2.652GluAsn: 2.652 ± 0.046
1.943GluPro: 1.943 ± 0.043
3.095GluGln: 3.095 ± 0.06
3.058GluArg: 3.058 ± 0.055
4.163GluSer: 4.163 ± 0.061
3.426GluThr: 3.426 ± 0.058
4.497GluVal: 4.497 ± 0.066
0.809GluTrp: 0.809 ± 0.026
1.884GluTyr: 1.884 ± 0.037
0.0GluXaa: 0.0 ± 0.0
Phe
3.358PheAla: 3.358 ± 0.054
0.445PheCys: 0.445 ± 0.021
2.708PheAsp: 2.708 ± 0.053
2.437PheGlu: 2.437 ± 0.045
1.783PhePhe: 1.783 ± 0.043
3.043PheGly: 3.043 ± 0.053
0.758PheHis: 0.758 ± 0.023
2.608PheIle: 2.608 ± 0.05
2.065PheLys: 2.065 ± 0.046
3.614PheLeu: 3.614 ± 0.063
1.035PheMet: 1.035 ± 0.027
1.795PheAsn: 1.795 ± 0.037
1.451PhePro: 1.451 ± 0.034
1.22PheGln: 1.22 ± 0.029
1.562PheArg: 1.562 ± 0.034
3.273PheSer: 3.273 ± 0.045
2.436PheThr: 2.436 ± 0.051
2.724PheVal: 2.724 ± 0.047
0.544PheTrp: 0.544 ± 0.02
1.358PheTyr: 1.358 ± 0.035
0.0PheXaa: 0.0 ± 0.0
Gly
5.415GlyAla: 5.415 ± 0.075
0.815GlyCys: 0.815 ± 0.043
3.937GlyAsp: 3.937 ± 0.071
4.203GlyGlu: 4.203 ± 0.07
3.176GlyPhe: 3.176 ± 0.051
5.013GlyGly: 5.013 ± 0.068
1.492GlyHis: 1.492 ± 0.031
4.68GlyIle: 4.68 ± 0.067
4.002GlyLys: 4.002 ± 0.061
7.162GlyLeu: 7.162 ± 0.088
2.012GlyMet: 2.012 ± 0.046
2.897GlyAsn: 2.897 ± 0.074
1.666GlyPro: 1.666 ± 0.04
2.516GlyGln: 2.516 ± 0.05
3.073GlyArg: 3.073 ± 0.057
4.643GlySer: 4.643 ± 0.067
3.933GlyThr: 3.933 ± 0.098
5.374GlyVal: 5.374 ± 0.075
0.956GlyTrp: 0.956 ± 0.028
2.404GlyTyr: 2.404 ± 0.052
0.0GlyXaa: 0.0 ± 0.0
His
1.586HisAla: 1.586 ± 0.039
0.262HisCys: 0.262 ± 0.014
1.223HisAsp: 1.223 ± 0.033
1.172HisGlu: 1.172 ± 0.03
0.936HisPhe: 0.936 ± 0.027
1.442HisGly: 1.442 ± 0.04
0.588HisHis: 0.588 ± 0.024
1.315HisIle: 1.315 ± 0.033
1.0HisLys: 1.0 ± 0.03
2.073HisLeu: 2.073 ± 0.042
0.511HisMet: 0.511 ± 0.021
0.815HisAsn: 0.815 ± 0.024
1.093HisPro: 1.093 ± 0.031
0.865HisGln: 0.865 ± 0.028
1.016HisArg: 1.016 ± 0.03
1.376HisSer: 1.376 ± 0.037
1.091HisThr: 1.091 ± 0.034
1.243HisVal: 1.243 ± 0.029
0.342HisTrp: 0.342 ± 0.016
0.815HisTyr: 0.815 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
6.083IleAla: 6.083 ± 0.078
0.643IleCys: 0.643 ± 0.022
4.163IleAsp: 4.163 ± 0.059
4.263IleGlu: 4.263 ± 0.058
2.197IlePhe: 2.197 ± 0.051
4.623IleGly: 4.623 ± 0.07
1.249IleHis: 1.249 ± 0.032
3.832IleIle: 3.832 ± 0.065
3.429IleLys: 3.429 ± 0.054
5.573IleLeu: 5.573 ± 0.074
1.368IleMet: 1.368 ± 0.036
2.941IleAsn: 2.941 ± 0.052
2.934IlePro: 2.934 ± 0.046
2.102IleGln: 2.102 ± 0.04
2.825IleArg: 2.825 ± 0.045
4.714IleSer: 4.714 ± 0.059
4.034IleThr: 4.034 ± 0.075
4.079IleVal: 4.079 ± 0.061
0.661IleTrp: 0.661 ± 0.022
1.713IleTyr: 1.713 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
4.88LysAla: 4.88 ± 0.074
0.308LysCys: 0.308 ± 0.016
3.015LysAsp: 3.015 ± 0.065
3.203LysGlu: 3.203 ± 0.056
1.516LysPhe: 1.516 ± 0.038
3.475LysGly: 3.475 ± 0.061
1.327LysHis: 1.327 ± 0.032
3.129LysIle: 3.129 ± 0.048
3.334LysLys: 3.334 ± 0.076
5.251LysLeu: 5.251 ± 0.073
1.34LysMet: 1.34 ± 0.034
2.316LysAsn: 2.316 ± 0.048
2.499LysPro: 2.499 ± 0.058
2.794LysGln: 2.794 ± 0.049
2.884LysArg: 2.884 ± 0.052
3.658LysSer: 3.658 ± 0.059
3.276LysThr: 3.276 ± 0.049
3.834LysVal: 3.834 ± 0.056
0.554LysTrp: 0.554 ± 0.019
1.46LysTyr: 1.46 ± 0.035
0.0LysXaa: 0.0 ± 0.0
Leu
9.012LeuAla: 9.012 ± 0.1
0.982LeuCys: 0.982 ± 0.029
5.888LeuAsp: 5.888 ± 0.069
6.032LeuGlu: 6.032 ± 0.073
4.167LeuPhe: 4.167 ± 0.066
6.895LeuGly: 6.895 ± 0.088
1.955LeuHis: 1.955 ± 0.038
6.377LeuIle: 6.377 ± 0.079
5.715LeuLys: 5.715 ± 0.074
10.622LeuLeu: 10.622 ± 0.136
2.577LeuMet: 2.577 ± 0.045
4.391LeuAsn: 4.391 ± 0.062
4.541LeuPro: 4.541 ± 0.069
3.525LeuGln: 3.525 ± 0.064
4.457LeuArg: 4.457 ± 0.069
8.284LeuSer: 8.284 ± 0.091
5.997LeuThr: 5.997 ± 0.077
6.789LeuVal: 6.789 ± 0.077
1.119LeuTrp: 1.119 ± 0.03
2.623LeuTyr: 2.623 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
2.283MetAla: 2.283 ± 0.047
0.182MetCys: 0.182 ± 0.012
1.317MetAsp: 1.317 ± 0.034
1.217MetGlu: 1.217 ± 0.029
0.845MetPhe: 0.845 ± 0.029
1.715MetGly: 1.715 ± 0.038
0.465MetHis: 0.465 ± 0.018
1.557MetIle: 1.557 ± 0.032
1.626MetLys: 1.626 ± 0.034
2.583MetLeu: 2.583 ± 0.05
0.747MetMet: 0.747 ± 0.022
1.097MetAsn: 1.097 ± 0.027
1.184MetPro: 1.184 ± 0.03
1.033MetGln: 1.033 ± 0.027
1.192MetArg: 1.192 ± 0.032
2.039MetSer: 2.039 ± 0.038
1.655MetThr: 1.655 ± 0.036
1.715MetVal: 1.715 ± 0.04
0.213MetTrp: 0.213 ± 0.012
0.513MetTyr: 0.513 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.394AsnAla: 3.394 ± 0.054
0.421AsnCys: 0.421 ± 0.018
2.645AsnAsp: 2.645 ± 0.064
2.393AsnGlu: 2.393 ± 0.043
1.586AsnPhe: 1.586 ± 0.032
3.028AsnGly: 3.028 ± 0.063
0.928AsnHis: 0.928 ± 0.026
2.948AsnIle: 2.948 ± 0.056
2.343AsnLys: 2.343 ± 0.049
3.858AsnLeu: 3.858 ± 0.06
0.936AsnMet: 0.936 ± 0.027
2.063AsnAsn: 2.063 ± 0.048
2.107AsnPro: 2.107 ± 0.052
1.633AsnGln: 1.633 ± 0.033
1.878AsnArg: 1.878 ± 0.037
2.704AsnSer: 2.704 ± 0.053
2.758AsnThr: 2.758 ± 0.068
2.704AsnVal: 2.704 ± 0.057
0.6AsnTrp: 0.6 ± 0.024
1.413AsnTyr: 1.413 ± 0.035
0.0AsnXaa: 0.0 ± 0.0
Pro
3.386ProAla: 3.386 ± 0.089
0.294ProCys: 0.294 ± 0.02
2.548ProAsp: 2.548 ± 0.047
3.361ProGlu: 3.361 ± 0.052
1.685ProPhe: 1.685 ± 0.035
2.101ProGly: 2.101 ± 0.041
0.792ProHis: 0.792 ± 0.026
2.442ProIle: 2.442 ± 0.043
2.053ProLys: 2.053 ± 0.045
3.836ProLeu: 3.836 ± 0.052
1.006ProMet: 1.006 ± 0.027
1.715ProAsn: 1.715 ± 0.038
1.296ProPro: 1.296 ± 0.038
1.363ProGln: 1.363 ± 0.036
1.405ProArg: 1.405 ± 0.039
2.889ProSer: 2.889 ± 0.058
2.398ProThr: 2.398 ± 0.084
3.253ProVal: 3.253 ± 0.057
0.531ProTrp: 0.531 ± 0.023
1.282ProTyr: 1.282 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
3.819GlnAla: 3.819 ± 0.061
0.369GlnCys: 0.369 ± 0.023
1.821GlnAsp: 1.821 ± 0.046
2.056GlnGlu: 2.056 ± 0.046
1.429GlnPhe: 1.429 ± 0.03
2.558GlnGly: 2.558 ± 0.051
0.953GlnHis: 0.953 ± 0.029
2.17GlnIle: 2.17 ± 0.044
1.999GlnLys: 1.999 ± 0.044
4.467GlnLeu: 4.467 ± 0.078
0.902GlnMet: 0.902 ± 0.027
1.413GlnAsn: 1.413 ± 0.034
1.473GlnPro: 1.473 ± 0.036
2.497GlnGln: 2.497 ± 0.071
2.22GlnArg: 2.22 ± 0.044
2.74GlnSer: 2.74 ± 0.046
2.146GlnThr: 2.146 ± 0.046
2.689GlnVal: 2.689 ± 0.045
0.55GlnTrp: 0.55 ± 0.02
1.241GlnTyr: 1.241 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
3.263ArgAla: 3.263 ± 0.052
0.467ArgCys: 0.467 ± 0.018
2.439ArgAsp: 2.439 ± 0.041
2.89ArgGlu: 2.89 ± 0.058
2.162ArgPhe: 2.162 ± 0.036
2.697ArgGly: 2.697 ± 0.046
0.998ArgHis: 0.998 ± 0.029
2.915ArgIle: 2.915 ± 0.053
2.484ArgLys: 2.484 ± 0.045
4.96ArgLeu: 4.96 ± 0.075
1.267ArgMet: 1.267 ± 0.029
1.891ArgAsn: 1.891 ± 0.04
1.601ArgPro: 1.601 ± 0.033
1.978ArgGln: 1.978 ± 0.034
2.246ArgArg: 2.246 ± 0.05
2.803ArgSer: 2.803 ± 0.047
2.237ArgThr: 2.237 ± 0.049
3.174ArgVal: 3.174 ± 0.048
0.683ArgTrp: 0.683 ± 0.022
1.707ArgTyr: 1.707 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
5.675SerAla: 5.675 ± 0.074
0.717SerCys: 0.717 ± 0.039
4.352SerAsp: 4.352 ± 0.069
4.049SerGlu: 4.049 ± 0.053
3.154SerPhe: 3.154 ± 0.06
5.548SerGly: 5.548 ± 0.086
1.472SerHis: 1.472 ± 0.037
4.46SerIle: 4.46 ± 0.061
3.661SerLys: 3.661 ± 0.06
7.429SerLeu: 7.429 ± 0.09
1.807SerMet: 1.807 ± 0.045
3.03SerAsn: 3.03 ± 0.062
2.679SerPro: 2.679 ± 0.045
2.559SerGln: 2.559 ± 0.054
3.161SerArg: 3.161 ± 0.057
5.016SerSer: 5.016 ± 0.081
3.977SerThr: 3.977 ± 0.066
5.086SerVal: 5.086 ± 0.074
0.898SerTrp: 0.898 ± 0.029
2.272SerTyr: 2.272 ± 0.057
0.0SerXaa: 0.0 ± 0.0
Thr
5.169ThrAla: 5.169 ± 0.091
0.473ThrCys: 0.473 ± 0.022
3.608ThrAsp: 3.608 ± 0.077
3.437ThrGlu: 3.437 ± 0.055
2.161ThrPhe: 2.161 ± 0.039
4.412ThrGly: 4.412 ± 0.09
1.238ThrHis: 1.238 ± 0.031
3.623ThrIle: 3.623 ± 0.065
2.559ThrLys: 2.559 ± 0.046
6.636ThrLeu: 6.636 ± 0.088
1.223ThrMet: 1.223 ± 0.031
2.34ThrAsn: 2.34 ± 0.071
3.019ThrPro: 3.019 ± 0.089
2.287ThrGln: 2.287 ± 0.06
2.35ThrArg: 2.35 ± 0.045
3.898ThrSer: 3.898 ± 0.069
3.582ThrThr: 3.582 ± 0.093
4.395ThrVal: 4.395 ± 0.096
0.693ThrTrp: 0.693 ± 0.027
1.693ThrTyr: 1.693 ± 0.055
0.0ThrXaa: 0.0 ± 0.0
Val
6.255ValAla: 6.255 ± 0.094
0.692ValCys: 0.692 ± 0.024
4.287ValAsp: 4.287 ± 0.088
4.391ValGlu: 4.391 ± 0.065
2.93ValPhe: 2.93 ± 0.051
4.746ValGly: 4.746 ± 0.07
1.188ValHis: 1.188 ± 0.027
4.9ValIle: 4.9 ± 0.066
3.532ValLys: 3.532 ± 0.055
7.025ValLeu: 7.025 ± 0.076
1.917ValMet: 1.917 ± 0.041
2.975ValAsn: 2.975 ± 0.062
2.645ValPro: 2.645 ± 0.046
2.042ValGln: 2.042 ± 0.038
2.789ValArg: 2.789 ± 0.047
5.354ValSer: 5.354 ± 0.066
4.612ValThr: 4.612 ± 0.1
5.343ValVal: 5.343 ± 0.076
0.764ValTrp: 0.764 ± 0.023
1.919ValTyr: 1.919 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
0.862TrpAla: 0.862 ± 0.027
0.137TrpCys: 0.137 ± 0.011
0.712TrpAsp: 0.712 ± 0.026
0.647TrpGlu: 0.647 ± 0.025
0.593TrpPhe: 0.593 ± 0.021
0.776TrpGly: 0.776 ± 0.025
0.316TrpHis: 0.316 ± 0.016
0.656TrpIle: 0.656 ± 0.023
0.647TrpLys: 0.647 ± 0.021
1.583TrpLeu: 1.583 ± 0.04
0.379TrpMet: 0.379 ± 0.016
0.488TrpAsn: 0.488 ± 0.019
0.454TrpPro: 0.454 ± 0.019
0.687TrpGln: 0.687 ± 0.026
0.65TrpArg: 0.65 ± 0.022
0.869TrpSer: 0.869 ± 0.028
0.624TrpThr: 0.624 ± 0.028
0.949TrpVal: 0.949 ± 0.029
0.194TrpTrp: 0.194 ± 0.013
0.394TrpTyr: 0.394 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.309TyrAla: 2.309 ± 0.045
0.343TyrCys: 0.343 ± 0.019
1.876TyrAsp: 1.876 ± 0.05
1.698TyrGlu: 1.698 ± 0.039
1.381TyrPhe: 1.381 ± 0.038
2.104TyrGly: 2.104 ± 0.042
0.69TyrHis: 0.69 ± 0.026
1.712TyrIle: 1.712 ± 0.038
1.5TyrLys: 1.5 ± 0.04
3.327TyrLeu: 3.327 ± 0.049
0.592TyrMet: 0.592 ± 0.022
1.211TyrAsn: 1.211 ± 0.033
1.317TyrPro: 1.317 ± 0.031
1.576TyrGln: 1.576 ± 0.041
1.685TyrArg: 1.685 ± 0.031
2.228TyrSer: 2.228 ± 0.05
1.834TyrThr: 1.834 ± 0.046
1.802TyrVal: 1.802 ± 0.044
0.431TyrTrp: 0.431 ± 0.017
1.013TyrTyr: 1.013 ± 0.031
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4192 proteins (1404415 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski