Amino acid dipepetide frequency for Donghicola eburneus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.552AlaAla: 14.552 ± 0.166
1.06AlaCys: 1.06 ± 0.033
6.752AlaAsp: 6.752 ± 0.08
8.339AlaGlu: 8.339 ± 0.106
3.962AlaPhe: 3.962 ± 0.062
9.558AlaGly: 9.558 ± 0.097
2.088AlaHis: 2.088 ± 0.043
5.99AlaIle: 5.99 ± 0.064
4.344AlaLys: 4.344 ± 0.067
12.824AlaLeu: 12.824 ± 0.123
3.774AlaMet: 3.774 ± 0.058
2.943AlaAsn: 2.943 ± 0.053
5.407AlaPro: 5.407 ± 0.087
4.322AlaGln: 4.322 ± 0.067
7.211AlaArg: 7.211 ± 0.093
5.674AlaSer: 5.674 ± 0.075
5.781AlaThr: 5.781 ± 0.066
7.763AlaVal: 7.763 ± 0.087
1.348AlaTrp: 1.348 ± 0.033
2.586AlaTyr: 2.586 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
1.102CysAla: 1.102 ± 0.031
0.129CysCys: 0.129 ± 0.012
0.666CysAsp: 0.666 ± 0.029
0.458CysGlu: 0.458 ± 0.022
0.347CysPhe: 0.347 ± 0.02
0.929CysGly: 0.929 ± 0.029
0.276CysHis: 0.276 ± 0.017
0.451CysIle: 0.451 ± 0.018
0.238CysLys: 0.238 ± 0.014
0.837CysLeu: 0.837 ± 0.029
0.216CysMet: 0.216 ± 0.012
0.228CysAsn: 0.228 ± 0.013
0.529CysPro: 0.529 ± 0.021
0.242CysGln: 0.242 ± 0.014
0.512CysArg: 0.512 ± 0.021
0.486CysSer: 0.486 ± 0.021
0.497CysThr: 0.497 ± 0.024
0.65CysVal: 0.65 ± 0.024
0.132CysTrp: 0.132 ± 0.01
0.216CysTyr: 0.216 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
7.116AspAla: 7.116 ± 0.082
0.534AspCys: 0.534 ± 0.021
3.539AspAsp: 3.539 ± 0.084
3.624AspGlu: 3.624 ± 0.058
2.358AspPhe: 2.358 ± 0.045
5.522AspGly: 5.522 ± 0.08
1.358AspHis: 1.358 ± 0.037
3.337AspIle: 3.337 ± 0.053
1.965AspLys: 1.965 ± 0.044
6.669AspLeu: 6.669 ± 0.083
1.787AspMet: 1.787 ± 0.041
1.43AspAsn: 1.43 ± 0.039
3.38AspPro: 3.38 ± 0.059
2.064AspGln: 2.064 ± 0.045
4.004AspArg: 4.004 ± 0.062
2.438AspSer: 2.438 ± 0.05
3.305AspThr: 3.305 ± 0.075
4.463AspVal: 4.463 ± 0.072
1.135AspTrp: 1.135 ± 0.036
1.634AspTyr: 1.634 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
7.884GluAla: 7.884 ± 0.107
0.424GluCys: 0.424 ± 0.018
3.753GluAsp: 3.753 ± 0.068
4.122GluGlu: 4.122 ± 0.073
1.923GluPhe: 1.923 ± 0.039
5.071GluGly: 5.071 ± 0.07
1.252GluHis: 1.252 ± 0.035
3.856GluIle: 3.856 ± 0.06
2.366GluLys: 2.366 ± 0.051
5.43GluLeu: 5.43 ± 0.074
2.111GluMet: 2.111 ± 0.047
2.055GluAsn: 2.055 ± 0.041
2.384GluPro: 2.384 ± 0.044
2.245GluGln: 2.245 ± 0.044
4.242GluArg: 4.242 ± 0.067
2.395GluSer: 2.395 ± 0.048
4.003GluThr: 4.003 ± 0.06
4.489GluVal: 4.489 ± 0.063
0.741GluTrp: 0.741 ± 0.025
1.248GluTyr: 1.248 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
4.29PheAla: 4.29 ± 0.061
0.44PheCys: 0.44 ± 0.019
2.867PheAsp: 2.867 ± 0.049
2.328PheGlu: 2.328 ± 0.044
1.443PhePhe: 1.443 ± 0.037
3.702PheGly: 3.702 ± 0.068
0.743PheHis: 0.743 ± 0.027
1.675PheIle: 1.675 ± 0.04
1.139PheLys: 1.139 ± 0.027
3.234PheLeu: 3.234 ± 0.064
0.91PheMet: 0.91 ± 0.026
1.065PheAsn: 1.065 ± 0.033
1.484PhePro: 1.484 ± 0.035
1.06PheGln: 1.06 ± 0.029
2.082PheArg: 2.082 ± 0.041
2.245PheSer: 2.245 ± 0.046
2.034PheThr: 2.034 ± 0.039
2.787PheVal: 2.787 ± 0.05
0.54PheTrp: 0.54 ± 0.022
1.0PheTyr: 1.0 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
9.324GlyAla: 9.324 ± 0.091
0.852GlyCys: 0.852 ± 0.029
4.794GlyAsp: 4.794 ± 0.086
4.691GlyGlu: 4.691 ± 0.058
3.49GlyPhe: 3.49 ± 0.054
7.166GlyGly: 7.166 ± 0.115
1.887GlyHis: 1.887 ± 0.039
4.644GlyIle: 4.644 ± 0.066
3.2GlyLys: 3.2 ± 0.05
8.58GlyLeu: 8.58 ± 0.098
2.636GlyMet: 2.636 ± 0.051
2.4GlyAsn: 2.4 ± 0.067
3.251GlyPro: 3.251 ± 0.056
3.255GlyGln: 3.255 ± 0.057
5.196GlyArg: 5.196 ± 0.074
4.369GlySer: 4.369 ± 0.07
4.668GlyThr: 4.668 ± 0.063
6.347GlyVal: 6.347 ± 0.071
1.391GlyTrp: 1.391 ± 0.039
2.411GlyTyr: 2.411 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
2.166HisAla: 2.166 ± 0.044
0.233HisCys: 0.233 ± 0.014
1.246HisAsp: 1.246 ± 0.033
1.042HisGlu: 1.042 ± 0.031
0.832HisPhe: 0.832 ± 0.025
1.861HisGly: 1.861 ± 0.036
0.579HisHis: 0.579 ± 0.028
1.027HisIle: 1.027 ± 0.026
0.66HisLys: 0.66 ± 0.024
2.094HisLeu: 2.094 ± 0.048
0.638HisMet: 0.638 ± 0.024
0.51HisAsn: 0.51 ± 0.019
1.29HisPro: 1.29 ± 0.038
0.619HisGln: 0.619 ± 0.022
1.241HisArg: 1.241 ± 0.034
0.993HisSer: 0.993 ± 0.029
0.938HisThr: 0.938 ± 0.027
1.425HisVal: 1.425 ± 0.034
0.312HisTrp: 0.312 ± 0.015
0.564HisTyr: 0.564 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
6.983IleAla: 6.983 ± 0.08
0.655IleCys: 0.655 ± 0.023
3.529IleAsp: 3.529 ± 0.057
3.86IleGlu: 3.86 ± 0.064
1.841IlePhe: 1.841 ± 0.041
4.797IleGly: 4.797 ± 0.064
0.998IleHis: 0.998 ± 0.023
2.514IleIle: 2.514 ± 0.05
1.77IleLys: 1.77 ± 0.038
4.898IleLeu: 4.898 ± 0.079
1.238IleMet: 1.238 ± 0.037
1.496IleAsn: 1.496 ± 0.035
2.461IlePro: 2.461 ± 0.045
1.479IleGln: 1.479 ± 0.038
3.065IleArg: 3.065 ± 0.053
3.311IleSer: 3.311 ± 0.053
3.216IleThr: 3.216 ± 0.052
3.826IleVal: 3.826 ± 0.059
0.727IleTrp: 0.727 ± 0.024
1.258IleTyr: 1.258 ± 0.033
0.0IleXaa: 0.0 ± 0.0
Lys
4.235LysAla: 4.235 ± 0.064
0.227LysCys: 0.227 ± 0.013
2.049LysAsp: 2.049 ± 0.044
2.071LysGlu: 2.071 ± 0.044
1.126LysPhe: 1.126 ± 0.032
2.945LysGly: 2.945 ± 0.052
0.68LysHis: 0.68 ± 0.022
1.871LysIle: 1.871 ± 0.04
1.445LysLys: 1.445 ± 0.039
3.274LysLeu: 3.274 ± 0.049
1.054LysMet: 1.054 ± 0.029
0.968LysAsn: 0.968 ± 0.026
1.877LysPro: 1.877 ± 0.045
1.03LysGln: 1.03 ± 0.03
2.341LysArg: 2.341 ± 0.05
2.072LysSer: 2.072 ± 0.042
2.156LysThr: 2.156 ± 0.043
2.56LysVal: 2.56 ± 0.052
0.457LysTrp: 0.457 ± 0.02
0.777LysTyr: 0.777 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
11.389LeuAla: 11.389 ± 0.103
0.952LeuCys: 0.952 ± 0.03
5.985LeuAsp: 5.985 ± 0.074
5.6LeuGlu: 5.6 ± 0.085
3.507LeuPhe: 3.507 ± 0.069
8.026LeuGly: 8.026 ± 0.086
1.818LeuHis: 1.818 ± 0.043
5.316LeuIle: 5.316 ± 0.079
3.49LeuLys: 3.49 ± 0.059
8.518LeuLeu: 8.518 ± 0.117
2.816LeuMet: 2.816 ± 0.047
3.051LeuAsn: 3.051 ± 0.056
5.28LeuPro: 5.28 ± 0.068
2.995LeuGln: 2.995 ± 0.046
6.583LeuArg: 6.583 ± 0.082
6.838LeuSer: 6.838 ± 0.078
6.098LeuThr: 6.098 ± 0.073
6.578LeuVal: 6.578 ± 0.087
1.251LeuTrp: 1.251 ± 0.035
2.061LeuTyr: 2.061 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
3.486MetAla: 3.486 ± 0.059
0.232MetCys: 0.232 ± 0.014
1.603MetAsp: 1.603 ± 0.036
1.5MetGlu: 1.5 ± 0.033
0.882MetPhe: 0.882 ± 0.028
2.4MetGly: 2.4 ± 0.05
0.527MetHis: 0.527 ± 0.023
1.679MetIle: 1.679 ± 0.04
1.227MetLys: 1.227 ± 0.032
2.659MetLeu: 2.659 ± 0.051
0.871MetMet: 0.871 ± 0.027
0.943MetAsn: 0.943 ± 0.031
1.567MetPro: 1.567 ± 0.036
1.057MetGln: 1.057 ± 0.027
1.917MetArg: 1.917 ± 0.043
1.9MetSer: 1.9 ± 0.042
2.323MetThr: 2.323 ± 0.042
1.975MetVal: 1.975 ± 0.044
0.274MetTrp: 0.274 ± 0.013
0.421MetTyr: 0.421 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
3.447AsnAla: 3.447 ± 0.058
0.303AsnCys: 0.303 ± 0.016
1.813AsnAsp: 1.813 ± 0.045
1.432AsnGlu: 1.432 ± 0.037
1.023AsnPhe: 1.023 ± 0.027
2.795AsnGly: 2.795 ± 0.045
0.558AsnHis: 0.558 ± 0.02
1.507AsnIle: 1.507 ± 0.035
0.791AsnLys: 0.791 ± 0.027
2.686AsnLeu: 2.686 ± 0.043
0.806AsnMet: 0.806 ± 0.024
0.774AsnAsn: 0.774 ± 0.027
1.894AsnPro: 1.894 ± 0.04
0.871AsnGln: 0.871 ± 0.025
1.748AsnArg: 1.748 ± 0.043
1.309AsnSer: 1.309 ± 0.035
1.546AsnThr: 1.546 ± 0.041
1.975AsnVal: 1.975 ± 0.041
0.456AsnTrp: 0.456 ± 0.019
0.697AsnTyr: 0.697 ± 0.023
0.0AsnXaa: 0.0 ± 0.0
Pro
5.32ProAla: 5.32 ± 0.084
0.347ProCys: 0.347 ± 0.018
3.698ProAsp: 3.698 ± 0.057
4.309ProGlu: 4.309 ± 0.057
1.923ProPhe: 1.923 ± 0.043
3.223ProGly: 3.223 ± 0.057
1.01ProHis: 1.01 ± 0.028
2.406ProIle: 2.406 ± 0.046
1.885ProLys: 1.885 ± 0.045
4.378ProLeu: 4.378 ± 0.063
1.383ProMet: 1.383 ± 0.036
1.439ProAsn: 1.439 ± 0.035
1.818ProPro: 1.818 ± 0.04
1.624ProGln: 1.624 ± 0.042
2.47ProArg: 2.47 ± 0.052
2.557ProSer: 2.557 ± 0.046
2.563ProThr: 2.563 ± 0.045
3.882ProVal: 3.882 ± 0.05
0.629ProTrp: 0.629 ± 0.025
1.213ProTyr: 1.213 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
3.863GlnAla: 3.863 ± 0.058
0.226GlnCys: 0.226 ± 0.014
1.851GlnAsp: 1.851 ± 0.037
1.816GlnGlu: 1.816 ± 0.04
1.146GlnPhe: 1.146 ± 0.03
2.717GlnGly: 2.717 ± 0.047
0.635GlnHis: 0.635 ± 0.021
2.384GlnIle: 2.384 ± 0.046
1.218GlnLys: 1.218 ± 0.035
2.929GlnLeu: 2.929 ± 0.05
1.245GlnMet: 1.245 ± 0.032
1.074GlnAsn: 1.074 ± 0.03
1.632GlnPro: 1.632 ± 0.04
1.236GlnGln: 1.236 ± 0.043
2.232GlnArg: 2.232 ± 0.042
2.037GlnSer: 2.037 ± 0.04
2.068GlnThr: 2.068 ± 0.044
2.404GlnVal: 2.404 ± 0.044
0.408GlnTrp: 0.408 ± 0.019
0.635GlnTyr: 0.635 ± 0.023
0.0GlnXaa: 0.0 ± 0.0
Arg
6.99ArgAla: 6.99 ± 0.078
0.49ArgCys: 0.49 ± 0.021
3.907ArgAsp: 3.907 ± 0.064
3.661ArgGlu: 3.661 ± 0.058
2.506ArgPhe: 2.506 ± 0.049
4.268ArgGly: 4.268 ± 0.062
1.315ArgHis: 1.315 ± 0.031
3.584ArgIle: 3.584 ± 0.056
2.413ArgLys: 2.413 ± 0.048
6.592ArgLeu: 6.592 ± 0.098
1.946ArgMet: 1.946 ± 0.043
1.809ArgAsn: 1.809 ± 0.038
2.934ArgPro: 2.934 ± 0.052
2.339ArgGln: 2.339 ± 0.045
4.311ArgArg: 4.311 ± 0.072
3.249ArgSer: 3.249 ± 0.054
3.004ArgThr: 3.004 ± 0.046
4.355ArgVal: 4.355 ± 0.064
0.86ArgTrp: 0.86 ± 0.026
1.58ArgTyr: 1.58 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
6.023SerAla: 6.023 ± 0.077
0.452SerCys: 0.452 ± 0.02
3.475SerAsp: 3.475 ± 0.058
3.249SerGlu: 3.249 ± 0.056
2.345SerPhe: 2.345 ± 0.043
5.568SerGly: 5.568 ± 0.073
1.107SerHis: 1.107 ± 0.029
2.747SerIle: 2.747 ± 0.059
1.744SerLys: 1.744 ± 0.04
5.285SerLeu: 5.285 ± 0.066
1.522SerMet: 1.522 ± 0.034
1.598SerAsn: 1.598 ± 0.042
2.409SerPro: 2.409 ± 0.046
1.825SerGln: 1.825 ± 0.042
3.131SerArg: 3.131 ± 0.047
2.875SerSer: 2.875 ± 0.055
2.735SerThr: 2.735 ± 0.044
3.989SerVal: 3.989 ± 0.058
0.669SerTrp: 0.669 ± 0.022
1.452SerTyr: 1.452 ± 0.036
0.0SerXaa: 0.0 ± 0.0
Thr
6.206ThrAla: 6.206 ± 0.072
0.507ThrCys: 0.507 ± 0.02
3.419ThrAsp: 3.419 ± 0.059
3.385ThrGlu: 3.385 ± 0.048
2.114ThrPhe: 2.114 ± 0.045
5.289ThrGly: 5.289 ± 0.067
1.184ThrHis: 1.184 ± 0.034
2.914ThrIle: 2.914 ± 0.049
1.786ThrLys: 1.786 ± 0.036
6.195ThrLeu: 6.195 ± 0.07
1.298ThrMet: 1.298 ± 0.034
1.466ThrAsn: 1.466 ± 0.033
3.356ThrPro: 3.356 ± 0.049
1.833ThrGln: 1.833 ± 0.04
3.303ThrArg: 3.303 ± 0.056
2.998ThrSer: 2.998 ± 0.048
2.901ThrThr: 2.901 ± 0.054
4.347ThrVal: 4.347 ± 0.072
0.676ThrTrp: 0.676 ± 0.025
1.414ThrTyr: 1.414 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
8.034ValAla: 8.034 ± 0.088
0.688ValCys: 0.688 ± 0.025
4.222ValAsp: 4.222 ± 0.062
4.418ValGlu: 4.418 ± 0.063
2.83ValPhe: 2.83 ± 0.05
5.445ValGly: 5.445 ± 0.067
1.369ValHis: 1.369 ± 0.029
4.256ValIle: 4.256 ± 0.066
2.362ValLys: 2.362 ± 0.051
7.221ValLeu: 7.221 ± 0.084
2.194ValMet: 2.194 ± 0.047
2.056ValAsn: 2.056 ± 0.044
3.414ValPro: 3.414 ± 0.049
2.255ValGln: 2.255 ± 0.041
4.035ValArg: 4.035 ± 0.055
4.382ValSer: 4.382 ± 0.061
4.725ValThr: 4.725 ± 0.07
5.533ValVal: 5.533 ± 0.069
0.909ValTrp: 0.909 ± 0.029
1.584ValTyr: 1.584 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
1.279TrpAla: 1.279 ± 0.035
0.143TrpCys: 0.143 ± 0.011
0.75TrpAsp: 0.75 ± 0.029
0.65TrpGlu: 0.65 ± 0.025
0.534TrpPhe: 0.534 ± 0.022
1.008TrpGly: 1.008 ± 0.03
0.357TrpHis: 0.357 ± 0.016
0.694TrpIle: 0.694 ± 0.025
0.498TrpLys: 0.498 ± 0.021
1.523TrpLeu: 1.523 ± 0.037
0.438TrpMet: 0.438 ± 0.02
0.429TrpAsn: 0.429 ± 0.018
0.639TrpPro: 0.639 ± 0.021
0.588TrpGln: 0.588 ± 0.022
0.96TrpArg: 0.96 ± 0.029
0.798TrpSer: 0.798 ± 0.028
0.799TrpThr: 0.799 ± 0.026
0.891TrpVal: 0.891 ± 0.029
0.193TrpTrp: 0.193 ± 0.014
0.311TrpTyr: 0.311 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.586TyrAla: 2.586 ± 0.047
0.249TyrCys: 0.249 ± 0.014
1.747TyrAsp: 1.747 ± 0.039
1.439TyrGlu: 1.439 ± 0.036
0.936TyrPhe: 0.936 ± 0.028
2.262TyrGly: 2.262 ± 0.042
0.528TyrHis: 0.528 ± 0.021
1.07TyrIle: 1.07 ± 0.034
0.671TyrLys: 0.671 ± 0.022
2.328TyrLeu: 2.328 ± 0.045
0.56TyrMet: 0.56 ± 0.021
0.699TyrAsn: 0.699 ± 0.023
1.072TyrPro: 1.072 ± 0.03
0.82TyrGln: 0.82 ± 0.024
1.529TyrArg: 1.529 ± 0.035
1.275TyrSer: 1.275 ± 0.037
1.229TyrThr: 1.229 ± 0.033
1.693TyrVal: 1.693 ± 0.042
0.373TyrTrp: 0.373 ± 0.019
0.654TyrTyr: 0.654 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4047 proteins (1262338 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski