Amino acid dipepetide frequency for Caenorhabditis remanei (Caenorhabditis vulgaris)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.71AlaAla: 4.71 ± 0.037
1.02AlaCys: 1.02 ± 0.01
2.899AlaAsp: 2.899 ± 0.017
3.98AlaGlu: 3.98 ± 0.034
2.531AlaPhe: 2.531 ± 0.017
2.967AlaGly: 2.967 ± 0.021
1.259AlaHis: 1.259 ± 0.01
3.649AlaIle: 3.649 ± 0.018
3.614AlaLys: 3.614 ± 0.023
5.008AlaLeu: 5.008 ± 0.026
1.528AlaMet: 1.528 ± 0.012
2.506AlaAsn: 2.506 ± 0.017
3.346AlaPro: 3.346 ± 0.045
2.295AlaGln: 2.295 ± 0.021
2.918AlaArg: 2.918 ± 0.022
4.328AlaSer: 4.328 ± 0.028
3.641AlaThr: 3.641 ± 0.02
4.035AlaVal: 4.035 ± 0.02
0.548AlaTrp: 0.548 ± 0.007
1.668AlaTyr: 1.668 ± 0.013
0.001AlaXaa: 0.001 ± 0.0
Cys
1.101CysAla: 1.101 ± 0.012
0.525CysCys: 0.525 ± 0.011
1.073CysAsp: 1.073 ± 0.016
1.197CysGlu: 1.197 ± 0.014
1.008CysPhe: 1.008 ± 0.009
1.237CysGly: 1.237 ± 0.014
0.5CysHis: 0.5 ± 0.008
1.148CysIle: 1.148 ± 0.013
1.065CysLys: 1.065 ± 0.012
1.761CysLeu: 1.761 ± 0.018
0.441CysMet: 0.441 ± 0.006
0.879CysAsn: 0.879 ± 0.009
1.041CysPro: 1.041 ± 0.016
0.827CysGln: 0.827 ± 0.011
1.08CysArg: 1.08 ± 0.012
1.69CysSer: 1.69 ± 0.017
1.042CysThr: 1.042 ± 0.012
1.221CysVal: 1.221 ± 0.015
0.222CysTrp: 0.222 ± 0.004
0.673CysTyr: 0.673 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
3.042AspAla: 3.042 ± 0.025
1.001AspCys: 1.001 ± 0.013
3.836AspAsp: 3.836 ± 0.029
4.507AspGlu: 4.507 ± 0.031
2.586AspPhe: 2.586 ± 0.017
3.352AspGly: 3.352 ± 0.053
1.144AspHis: 1.144 ± 0.011
3.172AspIle: 3.172 ± 0.016
2.834AspLys: 2.834 ± 0.019
4.427AspLeu: 4.427 ± 0.024
1.321AspMet: 1.321 ± 0.011
2.288AspAsn: 2.288 ± 0.015
2.289AspPro: 2.289 ± 0.018
1.919AspGln: 1.919 ± 0.015
2.572AspArg: 2.572 ± 0.019
4.079AspSer: 4.079 ± 0.022
2.544AspThr: 2.544 ± 0.015
3.624AspVal: 3.624 ± 0.019
0.663AspTrp: 0.663 ± 0.007
1.885AspTyr: 1.885 ± 0.015
0.0AspXaa: 0.0 ± 0.0
Glu
3.772GluAla: 3.772 ± 0.029
1.143GluCys: 1.143 ± 0.016
4.071GluAsp: 4.071 ± 0.025
6.862GluGlu: 6.862 ± 0.055
2.767GluPhe: 2.767 ± 0.015
2.683GluGly: 2.683 ± 0.02
1.475GluHis: 1.475 ± 0.014
4.407GluIle: 4.407 ± 0.023
6.267GluLys: 6.267 ± 0.041
5.443GluLeu: 5.443 ± 0.032
2.171GluMet: 2.171 ± 0.014
3.973GluAsn: 3.973 ± 0.02
2.466GluPro: 2.466 ± 0.025
2.718GluGln: 2.718 ± 0.041
3.518GluArg: 3.518 ± 0.023
4.547GluSer: 4.547 ± 0.036
4.01GluThr: 4.01 ± 0.044
3.917GluVal: 3.917 ± 0.025
0.781GluTrp: 0.781 ± 0.008
2.168GluTyr: 2.168 ± 0.015
0.001GluXaa: 0.001 ± 0.0
Phe
2.553PheAla: 2.553 ± 0.018
1.091PheCys: 1.091 ± 0.01
2.742PheAsp: 2.742 ± 0.018
2.998PheGlu: 2.998 ± 0.019
2.645PhePhe: 2.645 ± 0.02
2.818PheGly: 2.818 ± 0.018
1.186PheHis: 1.186 ± 0.01
2.922PheIle: 2.922 ± 0.018
2.542PheLys: 2.542 ± 0.018
4.645PheLeu: 4.645 ± 0.026
1.152PheMet: 1.152 ± 0.01
2.259PheAsn: 2.259 ± 0.014
2.095PhePro: 2.095 ± 0.014
2.011PheGln: 2.011 ± 0.014
2.336PheArg: 2.336 ± 0.018
3.873PheSer: 3.873 ± 0.023
2.427PheThr: 2.427 ± 0.015
3.077PheVal: 3.077 ± 0.021
0.608PheTrp: 0.608 ± 0.007
1.779PheTyr: 1.779 ± 0.013
0.0PheXaa: 0.0 ± 0.0
Gly
3.116GlyAla: 3.116 ± 0.026
1.068GlyCys: 1.068 ± 0.013
2.604GlyAsp: 2.604 ± 0.018
3.071GlyGlu: 3.071 ± 0.024
2.519GlyPhe: 2.519 ± 0.02
3.836GlyGly: 3.836 ± 0.049
1.2GlyHis: 1.2 ± 0.012
3.168GlyIle: 3.168 ± 0.017
3.363GlyLys: 3.363 ± 0.024
3.862GlyLeu: 3.862 ± 0.022
1.309GlyMet: 1.309 ± 0.012
2.628GlyAsn: 2.628 ± 0.026
2.118GlyPro: 2.118 ± 0.028
1.889GlyGln: 1.889 ± 0.02
2.713GlyArg: 2.713 ± 0.022
3.966GlySer: 3.966 ± 0.027
2.994GlyThr: 2.994 ± 0.023
3.292GlyVal: 3.292 ± 0.019
0.6GlyTrp: 0.6 ± 0.008
1.951GlyTyr: 1.951 ± 0.017
0.001GlyXaa: 0.001 ± 0.0
His
1.154HisAla: 1.154 ± 0.012
0.502HisCys: 0.502 ± 0.007
1.062HisAsp: 1.062 ± 0.01
1.291HisGlu: 1.291 ± 0.012
1.345HisPhe: 1.345 ± 0.012
1.237HisGly: 1.237 ± 0.011
0.855HisHis: 0.855 ± 0.015
1.361HisIle: 1.361 ± 0.012
1.144HisLys: 1.144 ± 0.011
2.246HisLeu: 2.246 ± 0.015
0.57HisMet: 0.57 ± 0.007
0.999HisAsn: 0.999 ± 0.009
1.218HisPro: 1.218 ± 0.011
1.088HisGln: 1.088 ± 0.011
1.393HisArg: 1.393 ± 0.011
1.783HisSer: 1.783 ± 0.012
1.063HisThr: 1.063 ± 0.009
1.522HisVal: 1.522 ± 0.015
0.326HisTrp: 0.326 ± 0.005
0.818HisTyr: 0.818 ± 0.008
0.0HisXaa: 0.0 ± 0.0
Ile
3.656IleAla: 3.656 ± 0.019
1.319IleCys: 1.319 ± 0.012
3.647IleAsp: 3.647 ± 0.019
4.176IleGlu: 4.176 ± 0.022
3.206IlePhe: 3.206 ± 0.021
3.344IleGly: 3.344 ± 0.019
1.526IleHis: 1.526 ± 0.012
3.785IleIle: 3.785 ± 0.022
3.27IleLys: 3.27 ± 0.02
5.733IleLeu: 5.733 ± 0.027
1.377IleMet: 1.377 ± 0.011
2.793IleAsn: 2.793 ± 0.017
3.337IlePro: 3.337 ± 0.018
2.608IleGln: 2.608 ± 0.017
3.452IleArg: 3.452 ± 0.017
4.964IleSer: 4.964 ± 0.026
3.334IleThr: 3.334 ± 0.02
4.013IleVal: 4.013 ± 0.023
0.7IleTrp: 0.7 ± 0.009
2.008IleTyr: 2.008 ± 0.017
0.001IleXaa: 0.001 ± 0.0
Lys
3.345LysAla: 3.345 ± 0.023
1.41LysCys: 1.41 ± 0.017
3.135LysAsp: 3.135 ± 0.02
4.998LysGlu: 4.998 ± 0.035
2.708LysPhe: 2.708 ± 0.017
2.436LysGly: 2.436 ± 0.017
1.363LysHis: 1.363 ± 0.012
4.133LysIle: 4.133 ± 0.025
6.404LysLys: 6.404 ± 0.047
5.787LysLeu: 5.787 ± 0.031
2.119LysMet: 2.119 ± 0.015
3.646LysAsn: 3.646 ± 0.018
2.709LysPro: 2.709 ± 0.026
2.439LysGln: 2.439 ± 0.017
3.84LysArg: 3.84 ± 0.021
4.831LysSer: 4.831 ± 0.024
4.172LysThr: 4.172 ± 0.021
3.645LysVal: 3.645 ± 0.021
0.85LysTrp: 0.85 ± 0.009
2.185LysTyr: 2.185 ± 0.014
0.001LysXaa: 0.001 ± 0.0
Leu
5.255LeuAla: 5.255 ± 0.023
1.65LeuCys: 1.65 ± 0.013
4.43LeuAsp: 4.43 ± 0.023
5.997LeuGlu: 5.997 ± 0.035
4.379LeuPhe: 4.379 ± 0.024
3.867LeuGly: 3.867 ± 0.022
2.057LeuHis: 2.057 ± 0.015
5.426LeuIle: 5.426 ± 0.025
6.269LeuLys: 6.269 ± 0.031
8.27LeuLeu: 8.27 ± 0.036
2.208LeuMet: 2.208 ± 0.015
4.314LeuAsn: 4.314 ± 0.024
4.347LeuPro: 4.347 ± 0.036
3.459LeuGln: 3.459 ± 0.021
4.661LeuArg: 4.661 ± 0.025
6.709LeuSer: 6.709 ± 0.026
4.766LeuThr: 4.766 ± 0.024
5.182LeuVal: 5.182 ± 0.023
0.86LeuTrp: 0.86 ± 0.009
2.502LeuTyr: 2.502 ± 0.017
0.001LeuXaa: 0.001 ± 0.0
Met
1.669MetAla: 1.669 ± 0.012
0.487MetCys: 0.487 ± 0.007
1.49MetAsp: 1.49 ± 0.013
1.941MetGlu: 1.941 ± 0.011
1.203MetPhe: 1.203 ± 0.011
1.129MetGly: 1.129 ± 0.012
0.535MetHis: 0.535 ± 0.007
1.66MetIle: 1.66 ± 0.011
2.037MetLys: 2.037 ± 0.015
2.068MetLeu: 2.068 ± 0.015
0.856MetMet: 0.856 ± 0.01
1.465MetAsn: 1.465 ± 0.012
1.117MetPro: 1.117 ± 0.011
0.909MetGln: 0.909 ± 0.008
1.416MetArg: 1.416 ± 0.011
2.237MetSer: 2.237 ± 0.015
1.685MetThr: 1.685 ± 0.012
1.439MetVal: 1.439 ± 0.011
0.251MetTrp: 0.251 ± 0.004
0.765MetTyr: 0.765 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
2.747AsnAla: 2.747 ± 0.015
1.043AsnCys: 1.043 ± 0.011
2.536AsnAsp: 2.536 ± 0.014
3.257AsnGlu: 3.257 ± 0.018
2.386AsnPhe: 2.386 ± 0.017
3.261AsnGly: 3.261 ± 0.022
1.149AsnHis: 1.149 ± 0.011
2.919AsnIle: 2.919 ± 0.015
2.666AsnLys: 2.666 ± 0.017
4.316AsnLeu: 4.316 ± 0.019
1.283AsnMet: 1.283 ± 0.012
2.58AsnAsn: 2.58 ± 0.019
2.336AsnPro: 2.336 ± 0.016
2.178AsnGln: 2.178 ± 0.017
2.632AsnArg: 2.632 ± 0.015
3.997AsnSer: 3.997 ± 0.022
2.552AsnThr: 2.552 ± 0.015
3.159AsnVal: 3.159 ± 0.019
0.629AsnTrp: 0.629 ± 0.008
1.762AsnTyr: 1.762 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
3.096ProAla: 3.096 ± 0.037
0.664ProCys: 0.664 ± 0.013
2.363ProAsp: 2.363 ± 0.031
3.263ProGlu: 3.263 ± 0.029
2.078ProPhe: 2.078 ± 0.015
2.54ProGly: 2.54 ± 0.036
1.003ProHis: 1.003 ± 0.011
3.074ProIle: 3.074 ± 0.02
2.89ProLys: 2.89 ± 0.023
3.803ProLeu: 3.803 ± 0.03
1.126ProMet: 1.126 ± 0.011
2.263ProAsn: 2.263 ± 0.014
3.906ProPro: 3.906 ± 0.044
1.921ProGln: 1.921 ± 0.02
2.365ProArg: 2.365 ± 0.02
4.199ProSer: 4.199 ± 0.028
3.323ProThr: 3.323 ± 0.04
3.214ProVal: 3.214 ± 0.056
0.424ProTrp: 0.424 ± 0.006
1.378ProTyr: 1.378 ± 0.01
0.001ProXaa: 0.001 ± 0.0
Gln
2.088GlnAla: 2.088 ± 0.027
0.837GlnCys: 0.837 ± 0.012
1.555GlnAsp: 1.555 ± 0.014
2.522GlnGlu: 2.522 ± 0.031
1.905GlnPhe: 1.905 ± 0.013
1.61GlnGly: 1.61 ± 0.017
0.985GlnHis: 0.985 ± 0.011
2.455GlnIle: 2.455 ± 0.014
3.112GlnLys: 3.112 ± 0.02
3.809GlnLeu: 3.809 ± 0.021
1.338GlnMet: 1.338 ± 0.013
2.276GlnAsn: 2.276 ± 0.017
1.942GlnPro: 1.942 ± 0.022
2.58GlnGln: 2.58 ± 0.048
2.179GlnArg: 2.179 ± 0.017
2.767GlnSer: 2.767 ± 0.019
2.165GlnThr: 2.165 ± 0.015
2.193GlnVal: 2.193 ± 0.014
0.486GlnTrp: 0.486 ± 0.007
1.314GlnTyr: 1.314 ± 0.011
0.0GlnXaa: 0.0 ± 0.0
Arg
2.856ArgAla: 2.856 ± 0.018
0.994ArgCys: 0.994 ± 0.012
2.64ArgAsp: 2.64 ± 0.018
3.479ArgGlu: 3.479 ± 0.022
2.479ArgPhe: 2.479 ± 0.015
2.558ArgGly: 2.558 ± 0.022
1.341ArgHis: 1.341 ± 0.011
3.393ArgIle: 3.393 ± 0.018
4.131ArgLys: 4.131 ± 0.024
4.564ArgLeu: 4.564 ± 0.021
1.461ArgMet: 1.461 ± 0.011
2.935ArgAsn: 2.935 ± 0.018
2.305ArgPro: 2.305 ± 0.019
2.189ArgGln: 2.189 ± 0.014
4.137ArgArg: 4.137 ± 0.027
3.864ArgSer: 3.864 ± 0.028
2.768ArgThr: 2.768 ± 0.017
3.082ArgVal: 3.082 ± 0.02
0.561ArgTrp: 0.561 ± 0.007
1.633ArgTyr: 1.633 ± 0.011
0.001ArgXaa: 0.001 ± 0.0
Ser
4.588SerAla: 4.588 ± 0.023
1.452SerCys: 1.452 ± 0.017
4.293SerAsp: 4.293 ± 0.021
5.099SerGlu: 5.099 ± 0.032
3.657SerPhe: 3.657 ± 0.021
4.191SerGly: 4.191 ± 0.028
1.682SerHis: 1.682 ± 0.012
4.957SerIle: 4.957 ± 0.024
4.689SerLys: 4.689 ± 0.025
6.588SerLeu: 6.588 ± 0.032
1.901SerMet: 1.901 ± 0.013
3.817SerAsn: 3.817 ± 0.023
3.884SerPro: 3.884 ± 0.029
2.998SerGln: 2.998 ± 0.018
4.004SerArg: 4.004 ± 0.024
8.448SerSer: 8.448 ± 0.067
5.43SerThr: 5.43 ± 0.047
4.828SerVal: 4.828 ± 0.025
0.816SerTrp: 0.816 ± 0.009
2.284SerTyr: 2.284 ± 0.016
0.001SerXaa: 0.001 ± 0.0
Thr
3.568ThrAla: 3.568 ± 0.018
1.218ThrCys: 1.218 ± 0.015
2.918ThrAsp: 2.918 ± 0.043
3.603ThrGlu: 3.603 ± 0.042
2.683ThrPhe: 2.683 ± 0.016
2.984ThrGly: 2.984 ± 0.02
1.19ThrHis: 1.19 ± 0.009
3.902ThrIle: 3.902 ± 0.02
3.281ThrLys: 3.281 ± 0.018
4.771ThrLeu: 4.771 ± 0.019
1.413ThrMet: 1.413 ± 0.012
2.634ThrAsn: 2.634 ± 0.017
3.468ThrPro: 3.468 ± 0.036
1.948ThrGln: 1.948 ± 0.015
2.717ThrArg: 2.717 ± 0.017
5.233ThrSer: 5.233 ± 0.041
4.916ThrThr: 4.916 ± 0.07
4.2ThrVal: 4.2 ± 0.026
0.638ThrTrp: 0.638 ± 0.007
1.641ThrTyr: 1.641 ± 0.013
0.001ThrXaa: 0.001 ± 0.0
Val
3.877ValAla: 3.877 ± 0.02
1.276ValCys: 1.276 ± 0.012
3.487ValAsp: 3.487 ± 0.018
4.364ValGlu: 4.364 ± 0.026
3.291ValPhe: 3.291 ± 0.019
2.883ValGly: 2.883 ± 0.018
1.441ValHis: 1.441 ± 0.012
3.938ValIle: 3.938 ± 0.024
3.841ValLys: 3.841 ± 0.021
5.56ValLeu: 5.56 ± 0.027
1.578ValMet: 1.578 ± 0.013
2.838ValAsn: 2.838 ± 0.016
3.115ValPro: 3.115 ± 0.03
2.349ValGln: 2.349 ± 0.025
3.02ValArg: 3.02 ± 0.019
4.757ValSer: 4.757 ± 0.018
3.755ValThr: 3.755 ± 0.028
4.31ValVal: 4.31 ± 0.022
0.692ValTrp: 0.692 ± 0.008
2.044ValTyr: 2.044 ± 0.013
0.001ValXaa: 0.001 ± 0.0
Trp
0.567TrpAla: 0.567 ± 0.007
0.223TrpCys: 0.223 ± 0.005
0.542TrpAsp: 0.542 ± 0.007
0.604TrpGlu: 0.604 ± 0.008
0.573TrpPhe: 0.573 ± 0.007
0.43TrpGly: 0.43 ± 0.007
0.235TrpHis: 0.235 ± 0.004
0.884TrpIle: 0.884 ± 0.009
1.003TrpLys: 1.003 ± 0.011
0.983TrpLeu: 0.983 ± 0.009
0.389TrpMet: 0.389 ± 0.006
0.753TrpAsn: 0.753 ± 0.008
0.42TrpPro: 0.42 ± 0.007
0.415TrpGln: 0.415 ± 0.007
0.647TrpArg: 0.647 ± 0.008
0.815TrpSer: 0.815 ± 0.009
0.689TrpThr: 0.689 ± 0.008
0.573TrpVal: 0.573 ± 0.008
0.153TrpTrp: 0.153 ± 0.004
0.369TrpTyr: 0.369 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.689TyrAla: 1.689 ± 0.012
0.811TyrCys: 0.811 ± 0.009
1.76TyrAsp: 1.76 ± 0.012
1.948TyrGlu: 1.948 ± 0.014
1.808TyrPhe: 1.808 ± 0.016
1.908TyrGly: 1.908 ± 0.014
0.852TyrHis: 0.852 ± 0.009
1.894TyrIle: 1.894 ± 0.015
1.735TyrLys: 1.735 ± 0.013
2.965TyrLeu: 2.965 ± 0.02
0.817TyrMet: 0.817 ± 0.009
1.544TyrAsn: 1.544 ± 0.012
1.436TyrPro: 1.436 ± 0.018
1.379TyrGln: 1.379 ± 0.012
1.773TyrArg: 1.773 ± 0.014
2.518TyrSer: 2.518 ± 0.018
1.648TyrThr: 1.648 ± 0.013
1.89TyrVal: 1.89 ± 0.014
0.453TyrTrp: 0.453 ± 0.007
1.288TyrTyr: 1.288 ± 0.013
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.085XaaXaa: 0.085 ± 0.034
Statistics based on 31252 proteins (12547433 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski