Amino acid dipepetide frequency for Capitella teleta (Polychaete worm)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.625AlaAla: 5.625 ± 0.038
1.489AlaCys: 1.489 ± 0.015
3.658AlaAsp: 3.658 ± 0.022
4.267AlaGlu: 4.267 ± 0.023
2.886AlaPhe: 2.886 ± 0.018
3.703AlaGly: 3.703 ± 0.026
1.544AlaHis: 1.544 ± 0.013
3.735AlaIle: 3.735 ± 0.021
3.794AlaLys: 3.794 ± 0.023
6.309AlaLeu: 6.309 ± 0.033
1.881AlaMet: 1.881 ± 0.015
2.648AlaAsn: 2.648 ± 0.021
3.083AlaPro: 3.083 ± 0.025
2.523AlaGln: 2.523 ± 0.017
3.174AlaArg: 3.174 ± 0.019
5.53AlaSer: 5.53 ± 0.028
3.854AlaThr: 3.854 ± 0.024
4.707AlaVal: 4.707 ± 0.024
0.799AlaTrp: 0.799 ± 0.008
1.879AlaTyr: 1.879 ± 0.013
0.001AlaXaa: 0.001 ± 0.0
Cys
1.397CysAla: 1.397 ± 0.011
0.702CysCys: 0.702 ± 0.014
1.41CysAsp: 1.41 ± 0.017
1.429CysGlu: 1.429 ± 0.02
0.99CysPhe: 0.99 ± 0.011
1.517CysGly: 1.517 ± 0.015
0.746CysHis: 0.746 ± 0.012
1.315CysIle: 1.315 ± 0.014
1.233CysLys: 1.233 ± 0.015
2.253CysLeu: 2.253 ± 0.02
0.594CysMet: 0.594 ± 0.008
1.067CysAsn: 1.067 ± 0.013
1.27CysPro: 1.27 ± 0.038
0.993CysGln: 0.993 ± 0.015
1.27CysArg: 1.27 ± 0.013
2.077CysSer: 2.077 ± 0.02
1.266CysThr: 1.266 ± 0.017
1.603CysVal: 1.603 ± 0.015
0.285CysTrp: 0.285 ± 0.007
0.674CysTyr: 0.674 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
3.783AspAla: 3.783 ± 0.022
1.271AspCys: 1.271 ± 0.016
4.179AspAsp: 4.179 ± 0.034
4.213AspGlu: 4.213 ± 0.025
2.517AspPhe: 2.517 ± 0.016
3.555AspGly: 3.555 ± 0.028
1.479AspHis: 1.479 ± 0.013
3.347AspIle: 3.347 ± 0.02
2.649AspLys: 2.649 ± 0.018
5.377AspLeu: 5.377 ± 0.022
1.373AspMet: 1.373 ± 0.013
2.262AspAsn: 2.262 ± 0.019
2.739AspPro: 2.739 ± 0.019
2.015AspGln: 2.015 ± 0.017
2.731AspArg: 2.731 ± 0.02
4.125AspSer: 4.125 ± 0.025
2.611AspThr: 2.611 ± 0.017
3.954AspVal: 3.954 ± 0.024
0.748AspTrp: 0.748 ± 0.01
1.795AspTyr: 1.795 ± 0.013
0.0AspXaa: 0.0 ± 0.0
Glu
4.564GluAla: 4.564 ± 0.026
1.503GluCys: 1.503 ± 0.023
4.176GluAsp: 4.176 ± 0.027
6.459GluGlu: 6.459 ± 0.072
2.351GluPhe: 2.351 ± 0.015
3.472GluGly: 3.472 ± 0.024
1.518GluHis: 1.518 ± 0.013
3.42GluIle: 3.42 ± 0.022
4.242GluLys: 4.242 ± 0.03
5.28GluLeu: 5.28 ± 0.028
1.864GluMet: 1.864 ± 0.016
2.876GluAsn: 2.876 ± 0.021
2.284GluPro: 2.284 ± 0.021
2.522GluGln: 2.522 ± 0.017
3.577GluArg: 3.577 ± 0.026
4.185GluSer: 4.185 ± 0.023
3.331GluThr: 3.331 ± 0.019
4.132GluVal: 4.132 ± 0.022
0.795GluTrp: 0.795 ± 0.01
1.864GluTyr: 1.864 ± 0.016
0.001GluXaa: 0.001 ± 0.0
Phe
2.596PheAla: 2.596 ± 0.019
1.035PheCys: 1.035 ± 0.012
2.352PheAsp: 2.352 ± 0.015
2.282PheGlu: 2.282 ± 0.018
1.861PhePhe: 1.861 ± 0.015
2.524PheGly: 2.524 ± 0.017
1.144PheHis: 1.144 ± 0.01
2.425PheIle: 2.425 ± 0.018
2.113PheLys: 2.113 ± 0.015
3.897PheLeu: 3.897 ± 0.02
1.06PheMet: 1.06 ± 0.01
1.916PheAsn: 1.916 ± 0.015
1.802PhePro: 1.802 ± 0.014
1.498PheGln: 1.498 ± 0.012
2.014PheArg: 2.014 ± 0.015
3.274PheSer: 3.274 ± 0.021
2.35PheThr: 2.35 ± 0.015
2.837PheVal: 2.837 ± 0.019
0.586PheTrp: 0.586 ± 0.009
1.399PheTyr: 1.399 ± 0.013
0.001PheXaa: 0.001 ± 0.0
Gly
3.641GlyAla: 3.641 ± 0.024
1.407GlyCys: 1.407 ± 0.017
3.293GlyAsp: 3.293 ± 0.021
3.356GlyGlu: 3.356 ± 0.024
2.619GlyPhe: 2.619 ± 0.019
4.206GlyGly: 4.206 ± 0.036
1.597GlyHis: 1.597 ± 0.016
3.192GlyIle: 3.192 ± 0.019
3.208GlyLys: 3.208 ± 0.021
4.959GlyLeu: 4.959 ± 0.026
1.506GlyMet: 1.506 ± 0.013
2.565GlyAsn: 2.565 ± 0.02
2.281GlyPro: 2.281 ± 0.028
2.246GlyGln: 2.246 ± 0.017
3.243GlyArg: 3.243 ± 0.027
4.765GlySer: 4.765 ± 0.028
3.033GlyThr: 3.033 ± 0.025
3.89GlyVal: 3.89 ± 0.023
0.82GlyTrp: 0.82 ± 0.012
2.003GlyTyr: 2.003 ± 0.023
0.001GlyXaa: 0.001 ± 0.0
His
1.578HisAla: 1.578 ± 0.013
0.743HisCys: 0.743 ± 0.009
1.316HisAsp: 1.316 ± 0.015
1.445HisGlu: 1.445 ± 0.013
1.216HisPhe: 1.216 ± 0.011
1.564HisGly: 1.564 ± 0.015
0.912HisHis: 0.912 ± 0.012
1.38HisIle: 1.38 ± 0.014
1.362HisLys: 1.362 ± 0.012
2.859HisLeu: 2.859 ± 0.022
0.724HisMet: 0.724 ± 0.01
1.129HisAsn: 1.129 ± 0.011
1.431HisPro: 1.431 ± 0.013
1.184HisGln: 1.184 ± 0.013
1.601HisArg: 1.601 ± 0.014
2.125HisSer: 2.125 ± 0.016
1.377HisThr: 1.377 ± 0.016
1.729HisVal: 1.729 ± 0.015
0.404HisTrp: 0.404 ± 0.007
0.889HisTyr: 0.889 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
3.686IleAla: 3.686 ± 0.022
1.342IleCys: 1.342 ± 0.013
2.957IleAsp: 2.957 ± 0.018
3.209IleGlu: 3.209 ± 0.02
2.279IlePhe: 2.279 ± 0.019
3.156IleGly: 3.156 ± 0.021
1.443IleHis: 1.443 ± 0.011
3.065IleIle: 3.065 ± 0.019
2.767IleLys: 2.767 ± 0.017
4.962IleLeu: 4.962 ± 0.026
1.336IleMet: 1.336 ± 0.013
2.357IleAsn: 2.357 ± 0.018
2.698IlePro: 2.698 ± 0.016
2.195IleGln: 2.195 ± 0.018
2.865IleArg: 2.865 ± 0.018
4.286IleSer: 4.286 ± 0.022
3.154IleThr: 3.154 ± 0.019
3.511IleVal: 3.511 ± 0.019
0.652IleTrp: 0.652 ± 0.009
1.636IleTyr: 1.636 ± 0.014
0.0IleXaa: 0.0 ± 0.0
Lys
3.847LysAla: 3.847 ± 0.024
1.279LysCys: 1.279 ± 0.015
3.162LysAsp: 3.162 ± 0.022
4.258LysGlu: 4.258 ± 0.033
2.007LysPhe: 2.007 ± 0.014
3.029LysGly: 3.029 ± 0.018
1.499LysHis: 1.499 ± 0.016
2.892LysIle: 2.892 ± 0.018
4.776LysLys: 4.776 ± 0.046
4.807LysLeu: 4.807 ± 0.027
1.583LysMet: 1.583 ± 0.014
2.444LysAsn: 2.444 ± 0.018
2.571LysPro: 2.571 ± 0.021
2.438LysGln: 2.438 ± 0.019
3.71LysArg: 3.71 ± 0.024
4.065LysSer: 4.065 ± 0.027
3.261LysThr: 3.261 ± 0.021
3.526LysVal: 3.526 ± 0.021
0.759LysTrp: 0.759 ± 0.009
1.89LysTyr: 1.89 ± 0.026
0.001LysXaa: 0.001 ± 0.0
Leu
6.112LeuAla: 6.112 ± 0.033
2.27LeuCys: 2.27 ± 0.019
4.912LeuAsp: 4.912 ± 0.026
5.287LeuGlu: 5.287 ± 0.029
3.659LeuPhe: 3.659 ± 0.02
4.679LeuGly: 4.679 ± 0.021
2.72LeuHis: 2.72 ± 0.02
4.649LeuIle: 4.649 ± 0.028
5.651LeuLys: 5.651 ± 0.029
9.306LeuLeu: 9.306 ± 0.046
2.423LeuMet: 2.423 ± 0.015
4.049LeuAsn: 4.049 ± 0.021
4.829LeuPro: 4.829 ± 0.025
4.304LeuGln: 4.304 ± 0.021
5.311LeuArg: 5.311 ± 0.028
7.43LeuSer: 7.43 ± 0.034
5.291LeuThr: 5.291 ± 0.024
5.744LeuVal: 5.744 ± 0.031
1.107LeuTrp: 1.107 ± 0.01
2.647LeuTyr: 2.647 ± 0.018
0.001LeuXaa: 0.001 ± 0.0
Met
2.168MetAla: 2.168 ± 0.017
0.539MetCys: 0.539 ± 0.009
1.536MetAsp: 1.536 ± 0.014
1.753MetGlu: 1.753 ± 0.013
0.964MetPhe: 0.964 ± 0.011
1.351MetGly: 1.351 ± 0.013
0.694MetHis: 0.694 ± 0.009
1.162MetIle: 1.162 ± 0.012
1.718MetLys: 1.718 ± 0.014
2.368MetLeu: 2.368 ± 0.018
0.736MetMet: 0.736 ± 0.01
1.149MetAsn: 1.149 ± 0.011
1.276MetPro: 1.276 ± 0.012
1.195MetGln: 1.195 ± 0.012
1.466MetArg: 1.466 ± 0.014
1.959MetSer: 1.959 ± 0.013
1.68MetThr: 1.68 ± 0.014
1.611MetVal: 1.611 ± 0.014
0.309MetTrp: 0.309 ± 0.006
0.732MetTyr: 0.732 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
2.901AsnAla: 2.901 ± 0.018
1.069AsnCys: 1.069 ± 0.012
2.323AsnAsp: 2.323 ± 0.018
2.672AsnGlu: 2.672 ± 0.017
1.783AsnPhe: 1.783 ± 0.014
2.804AsnGly: 2.804 ± 0.021
1.122AsnHis: 1.122 ± 0.011
2.663AsnIle: 2.663 ± 0.019
2.331AsnLys: 2.331 ± 0.018
3.856AsnLeu: 3.856 ± 0.022
1.111AsnMet: 1.111 ± 0.011
2.018AsnAsn: 2.018 ± 0.03
2.251AsnPro: 2.251 ± 0.018
1.782AsnGln: 1.782 ± 0.015
2.251AsnArg: 2.251 ± 0.017
3.197AsnSer: 3.197 ± 0.02
2.382AsnThr: 2.382 ± 0.017
2.766AsnVal: 2.766 ± 0.016
0.522AsnTrp: 0.522 ± 0.008
1.352AsnTyr: 1.352 ± 0.012
0.0AsnXaa: 0.0 ± 0.0
Pro
3.155ProAla: 3.155 ± 0.024
1.071ProCys: 1.071 ± 0.02
2.811ProAsp: 2.811 ± 0.019
3.11ProGlu: 3.11 ± 0.021
1.812ProPhe: 1.812 ± 0.015
2.933ProGly: 2.933 ± 0.028
1.257ProHis: 1.257 ± 0.013
2.182ProIle: 2.182 ± 0.017
2.63ProLys: 2.63 ± 0.023
4.178ProLeu: 4.178 ± 0.02
1.141ProMet: 1.141 ± 0.012
2.05ProAsn: 2.05 ± 0.018
3.736ProPro: 3.736 ± 0.039
2.015ProGln: 2.015 ± 0.019
2.48ProArg: 2.48 ± 0.015
4.442ProSer: 4.442 ± 0.029
2.811ProThr: 2.811 ± 0.023
3.13ProVal: 3.13 ± 0.023
0.586ProTrp: 0.586 ± 0.008
1.353ProTyr: 1.353 ± 0.013
0.001ProXaa: 0.001 ± 0.0
Gln
2.604GlnAla: 2.604 ± 0.021
1.036GlnCys: 1.036 ± 0.016
2.039GlnAsp: 2.039 ± 0.014
2.791GlnGlu: 2.791 ± 0.025
1.488GlnPhe: 1.488 ± 0.012
2.124GlnGly: 2.124 ± 0.017
1.289GlnHis: 1.289 ± 0.014
1.991GlnIle: 1.991 ± 0.015
2.458GlnLys: 2.458 ± 0.02
3.917GlnLeu: 3.917 ± 0.024
1.219GlnMet: 1.219 ± 0.012
1.761GlnAsn: 1.761 ± 0.014
2.088GlnPro: 2.088 ± 0.019
2.551GlnGln: 2.551 ± 0.045
2.629GlnArg: 2.629 ± 0.021
2.928GlnSer: 2.928 ± 0.019
2.175GlnThr: 2.175 ± 0.018
2.447GlnVal: 2.447 ± 0.017
0.578GlnTrp: 0.578 ± 0.008
1.232GlnTyr: 1.232 ± 0.014
0.0GlnXaa: 0.0 ± 0.0
Arg
3.143ArgAla: 3.143 ± 0.019
1.27ArgCys: 1.27 ± 0.013
2.918ArgAsp: 2.918 ± 0.019
3.454ArgGlu: 3.454 ± 0.022
2.055ArgPhe: 2.055 ± 0.017
3.0ArgGly: 3.0 ± 0.025
1.581ArgHis: 1.581 ± 0.014
2.844ArgIle: 2.844 ± 0.017
3.701ArgLys: 3.701 ± 0.023
5.044ArgLeu: 5.044 ± 0.024
1.534ArgMet: 1.534 ± 0.013
2.443ArgAsn: 2.443 ± 0.017
2.565ArgPro: 2.565 ± 0.019
2.353ArgGln: 2.353 ± 0.018
4.005ArgArg: 4.005 ± 0.03
4.243ArgSer: 4.243 ± 0.027
2.765ArgThr: 2.765 ± 0.021
3.326ArgVal: 3.326 ± 0.023
0.698ArgTrp: 0.698 ± 0.008
1.632ArgTyr: 1.632 ± 0.013
0.001ArgXaa: 0.001 ± 0.0
Ser
5.328SerAla: 5.328 ± 0.026
1.86SerCys: 1.86 ± 0.02
4.421SerAsp: 4.421 ± 0.025
4.507SerGlu: 4.507 ± 0.023
3.275SerPhe: 3.275 ± 0.019
4.746SerGly: 4.746 ± 0.028
2.105SerHis: 2.105 ± 0.014
4.025SerIle: 4.025 ± 0.022
4.112SerLys: 4.112 ± 0.022
7.439SerLeu: 7.439 ± 0.033
2.003SerMet: 2.003 ± 0.016
3.262SerAsn: 3.262 ± 0.02
4.111SerPro: 4.111 ± 0.029
3.092SerGln: 3.092 ± 0.022
4.035SerArg: 4.035 ± 0.027
7.817SerSer: 7.817 ± 0.046
4.775SerThr: 4.775 ± 0.029
5.127SerVal: 5.127 ± 0.025
0.962SerTrp: 0.962 ± 0.011
2.25SerTyr: 2.25 ± 0.017
0.001SerXaa: 0.001 ± 0.0
Thr
3.798ThrAla: 3.798 ± 0.023
1.426ThrCys: 1.426 ± 0.021
3.102ThrAsp: 3.102 ± 0.02
3.558ThrGlu: 3.558 ± 0.024
2.356ThrPhe: 2.356 ± 0.017
3.493ThrGly: 3.493 ± 0.025
1.366ThrHis: 1.366 ± 0.014
2.976ThrIle: 2.976 ± 0.02
2.94ThrLys: 2.94 ± 0.017
5.227ThrLeu: 5.227 ± 0.03
1.358ThrMet: 1.358 ± 0.013
2.271ThrAsn: 2.271 ± 0.017
3.163ThrPro: 3.163 ± 0.028
2.144ThrGln: 2.144 ± 0.019
2.633ThrArg: 2.633 ± 0.019
4.689ThrSer: 4.689 ± 0.026
3.661ThrThr: 3.661 ± 0.052
3.659ThrVal: 3.659 ± 0.028
0.799ThrTrp: 0.799 ± 0.009
1.614ThrTyr: 1.614 ± 0.015
0.001ThrXaa: 0.001 ± 0.0
Val
4.528ValAla: 4.528 ± 0.027
1.703ValCys: 1.703 ± 0.018
3.752ValAsp: 3.752 ± 0.024
3.912ValGlu: 3.912 ± 0.022
2.871ValPhe: 2.871 ± 0.022
3.4ValGly: 3.4 ± 0.026
1.756ValHis: 1.756 ± 0.014
3.95ValIle: 3.95 ± 0.025
3.562ValLys: 3.562 ± 0.024
6.109ValLeu: 6.109 ± 0.027
1.718ValMet: 1.718 ± 0.015
2.868ValAsn: 2.868 ± 0.019
2.933ValPro: 2.933 ± 0.019
2.55ValGln: 2.55 ± 0.017
3.152ValArg: 3.152 ± 0.019
4.826ValSer: 4.826 ± 0.026
3.949ValThr: 3.949 ± 0.025
4.685ValVal: 4.685 ± 0.031
0.813ValTrp: 0.813 ± 0.009
2.06ValTyr: 2.06 ± 0.017
0.0ValXaa: 0.0 ± 0.0
Trp
0.715TrpAla: 0.715 ± 0.009
0.289TrpCys: 0.289 ± 0.007
0.63TrpAsp: 0.63 ± 0.008
0.667TrpGlu: 0.667 ± 0.008
0.531TrpPhe: 0.531 ± 0.009
0.689TrpGly: 0.689 ± 0.011
0.348TrpHis: 0.348 ± 0.006
0.71TrpIle: 0.71 ± 0.01
0.861TrpLys: 0.861 ± 0.01
1.307TrpLeu: 1.307 ± 0.011
0.419TrpMet: 0.419 ± 0.006
0.638TrpAsn: 0.638 ± 0.007
0.49TrpPro: 0.49 ± 0.008
0.553TrpGln: 0.553 ± 0.007
0.818TrpArg: 0.818 ± 0.009
0.983TrpSer: 0.983 ± 0.01
0.799TrpThr: 0.799 ± 0.01
0.773TrpVal: 0.773 ± 0.009
0.221TrpTrp: 0.221 ± 0.005
0.38TrpTyr: 0.38 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.918TyrAla: 1.918 ± 0.014
0.789TyrCys: 0.789 ± 0.024
1.776TyrAsp: 1.776 ± 0.015
1.786TyrGlu: 1.786 ± 0.016
1.445TyrPhe: 1.445 ± 0.013
1.887TyrGly: 1.887 ± 0.015
0.849TyrHis: 0.849 ± 0.01
1.634TyrIle: 1.634 ± 0.014
1.696TyrLys: 1.696 ± 0.025
2.928TyrLeu: 2.928 ± 0.02
0.783TyrMet: 0.783 ± 0.01
1.385TyrAsn: 1.385 ± 0.012
1.282TyrPro: 1.282 ± 0.013
1.189TyrGln: 1.189 ± 0.011
1.606TyrArg: 1.606 ± 0.014
2.302TyrSer: 2.302 ± 0.016
1.707TyrThr: 1.707 ± 0.016
1.938TyrVal: 1.938 ± 0.015
0.38TyrTrp: 0.38 ± 0.007
1.151TyrTyr: 1.151 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 31134 proteins (11086238 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski