Amino acid dipepetide frequency for Diploscapter pachys

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.053AlaAla: 7.053 ± 0.039
1.195AlaCys: 1.195 ± 0.011
3.869AlaAsp: 3.869 ± 0.022
4.833AlaGlu: 4.833 ± 0.026
2.761AlaPhe: 2.761 ± 0.014
4.403AlaGly: 4.403 ± 0.024
1.623AlaHis: 1.623 ± 0.012
4.32AlaIle: 4.32 ± 0.019
4.073AlaLys: 4.073 ± 0.023
6.376AlaLeu: 6.376 ± 0.028
1.881AlaMet: 1.881 ± 0.013
3.192AlaAsn: 3.192 ± 0.016
3.569AlaPro: 3.569 ± 0.025
3.194AlaGln: 3.194 ± 0.018
4.045AlaArg: 4.045 ± 0.02
5.46AlaSer: 5.46 ± 0.026
4.083AlaThr: 4.083 ± 0.022
4.721AlaVal: 4.721 ± 0.022
0.672AlaTrp: 0.672 ± 0.008
1.91AlaTyr: 1.91 ± 0.013
0.002AlaXaa: 0.002 ± 0.0
Cys
1.18CysAla: 1.18 ± 0.012
0.504CysCys: 0.504 ± 0.009
1.069CysAsp: 1.069 ± 0.016
1.077CysGlu: 1.077 ± 0.013
0.747CysPhe: 0.747 ± 0.007
1.214CysGly: 1.214 ± 0.013
0.455CysHis: 0.455 ± 0.006
1.084CysIle: 1.084 ± 0.013
0.993CysLys: 0.993 ± 0.012
1.62CysLeu: 1.62 ± 0.014
0.441CysMet: 0.441 ± 0.006
0.876CysAsn: 0.876 ± 0.012
1.005CysPro: 1.005 ± 0.015
0.791CysGln: 0.791 ± 0.011
1.136CysArg: 1.136 ± 0.012
1.541CysSer: 1.541 ± 0.016
0.941CysThr: 0.941 ± 0.011
1.144CysVal: 1.144 ± 0.011
0.208CysTrp: 0.208 ± 0.004
0.55CysTyr: 0.55 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
4.003AspAla: 4.003 ± 0.023
0.962AspCys: 0.962 ± 0.014
3.969AspAsp: 3.969 ± 0.024
4.951AspGlu: 4.951 ± 0.027
2.264AspPhe: 2.264 ± 0.014
3.533AspGly: 3.533 ± 0.022
1.173AspHis: 1.173 ± 0.011
2.969AspIle: 2.969 ± 0.016
3.236AspLys: 3.236 ± 0.024
4.637AspLeu: 4.637 ± 0.02
1.315AspMet: 1.315 ± 0.011
2.145AspAsn: 2.145 ± 0.013
2.666AspPro: 2.666 ± 0.016
2.156AspGln: 2.156 ± 0.014
3.381AspArg: 3.381 ± 0.023
4.061AspSer: 4.061 ± 0.021
2.506AspThr: 2.506 ± 0.012
3.438AspVal: 3.438 ± 0.017
0.664AspTrp: 0.664 ± 0.008
1.734AspTyr: 1.734 ± 0.013
0.001AspXaa: 0.001 ± 0.0
Glu
4.713GluAla: 4.713 ± 0.024
1.081GluCys: 1.081 ± 0.012
3.989GluAsp: 3.989 ± 0.019
6.369GluGlu: 6.369 ± 0.039
2.329GluPhe: 2.329 ± 0.013
3.318GluGly: 3.318 ± 0.018
1.527GluHis: 1.527 ± 0.012
3.783GluIle: 3.783 ± 0.022
5.316GluLys: 5.316 ± 0.032
5.569GluLeu: 5.569 ± 0.025
2.009GluMet: 2.009 ± 0.013
3.281GluAsn: 3.281 ± 0.016
2.653GluPro: 2.653 ± 0.024
3.287GluGln: 3.287 ± 0.021
4.251GluArg: 4.251 ± 0.026
4.287GluSer: 4.287 ± 0.023
3.405GluThr: 3.405 ± 0.02
3.56GluVal: 3.56 ± 0.02
0.781GluTrp: 0.781 ± 0.007
1.793GluTyr: 1.793 ± 0.013
0.001GluXaa: 0.001 ± 0.0
Phe
2.793PheAla: 2.793 ± 0.017
0.895PheCys: 0.895 ± 0.009
2.592PheAsp: 2.592 ± 0.014
2.525PheGlu: 2.525 ± 0.015
1.845PhePhe: 1.845 ± 0.014
2.762PheGly: 2.762 ± 0.015
0.958PheHis: 0.958 ± 0.008
2.376PheIle: 2.376 ± 0.015
1.864PheLys: 1.864 ± 0.013
3.522PheLeu: 3.522 ± 0.02
0.953PheMet: 0.953 ± 0.008
1.775PheAsn: 1.775 ± 0.012
1.75PhePro: 1.75 ± 0.012
1.404PheGln: 1.404 ± 0.01
2.032PheArg: 2.032 ± 0.013
3.036PheSer: 3.036 ± 0.017
2.136PheThr: 2.136 ± 0.013
2.77PheVal: 2.77 ± 0.015
0.478PheTrp: 0.478 ± 0.006
1.458PheTyr: 1.458 ± 0.01
0.001PheXaa: 0.001 ± 0.0
Gly
3.986GlyAla: 3.986 ± 0.022
1.083GlyCys: 1.083 ± 0.012
3.274GlyAsp: 3.274 ± 0.019
3.59GlyGlu: 3.59 ± 0.021
2.367GlyPhe: 2.367 ± 0.016
4.578GlyGly: 4.578 ± 0.037
1.457GlyHis: 1.457 ± 0.013
3.29GlyIle: 3.29 ± 0.016
3.515GlyLys: 3.515 ± 0.021
4.595GlyLeu: 4.595 ± 0.02
1.541GlyMet: 1.541 ± 0.014
2.743GlyAsn: 2.743 ± 0.018
2.361GlyPro: 2.361 ± 0.02
2.7GlyGln: 2.7 ± 0.016
3.642GlyArg: 3.642 ± 0.022
4.503GlySer: 4.503 ± 0.023
3.194GlyThr: 3.194 ± 0.02
3.497GlyVal: 3.497 ± 0.018
0.71GlyTrp: 0.71 ± 0.008
1.89GlyTyr: 1.89 ± 0.014
0.001GlyXaa: 0.001 ± 0.0
His
1.518HisAla: 1.518 ± 0.012
0.5HisCys: 0.5 ± 0.008
1.144HisAsp: 1.144 ± 0.009
1.367HisGlu: 1.367 ± 0.012
1.102HisPhe: 1.102 ± 0.008
1.382HisGly: 1.382 ± 0.011
0.826HisHis: 0.826 ± 0.011
1.289HisIle: 1.289 ± 0.01
1.164HisLys: 1.164 ± 0.011
2.293HisLeu: 2.293 ± 0.012
0.548HisMet: 0.548 ± 0.006
1.002HisAsn: 1.002 ± 0.009
1.336HisPro: 1.336 ± 0.011
1.094HisGln: 1.094 ± 0.01
1.732HisArg: 1.732 ± 0.014
1.915HisSer: 1.915 ± 0.012
1.064HisThr: 1.064 ± 0.008
1.407HisVal: 1.407 ± 0.011
0.287HisTrp: 0.287 ± 0.005
0.778HisTyr: 0.778 ± 0.008
0.0HisXaa: 0.0 ± 0.0
Ile
4.316IleAla: 4.316 ± 0.019
1.237IleCys: 1.237 ± 0.011
3.614IleAsp: 3.614 ± 0.016
3.882IleGlu: 3.882 ± 0.019
2.457IlePhe: 2.457 ± 0.014
3.518IleGly: 3.518 ± 0.019
1.332IleHis: 1.332 ± 0.01
3.061IleIle: 3.061 ± 0.017
2.873IleLys: 2.873 ± 0.017
4.847IleLeu: 4.847 ± 0.023
1.202IleMet: 1.202 ± 0.009
2.427IleAsn: 2.427 ± 0.013
2.869IlePro: 2.869 ± 0.016
2.232IleGln: 2.232 ± 0.016
3.3IleArg: 3.3 ± 0.018
4.418IleSer: 4.418 ± 0.023
2.822IleThr: 2.822 ± 0.015
3.809IleVal: 3.809 ± 0.018
0.621IleTrp: 0.621 ± 0.007
1.804IleTyr: 1.804 ± 0.012
0.0IleXaa: 0.0 ± 0.0
Lys
3.837LysAla: 3.837 ± 0.022
1.079LysCys: 1.079 ± 0.013
3.293LysAsp: 3.293 ± 0.028
4.716LysGlu: 4.716 ± 0.028
2.185LysPhe: 2.185 ± 0.013
2.714LysGly: 2.714 ± 0.017
1.237LysHis: 1.237 ± 0.01
3.474LysIle: 3.474 ± 0.021
5.189LysLys: 5.189 ± 0.045
5.022LysLeu: 5.022 ± 0.026
1.721LysMet: 1.721 ± 0.013
2.729LysAsn: 2.729 ± 0.017
2.798LysPro: 2.798 ± 0.033
2.501LysGln: 2.501 ± 0.018
3.757LysArg: 3.757 ± 0.02
4.145LysSer: 4.145 ± 0.021
3.133LysThr: 3.133 ± 0.018
3.095LysVal: 3.095 ± 0.017
0.714LysTrp: 0.714 ± 0.008
1.759LysTyr: 1.759 ± 0.012
0.001LysXaa: 0.001 ± 0.0
Leu
6.585LeuAla: 6.585 ± 0.029
1.685LeuCys: 1.685 ± 0.013
4.7LeuAsp: 4.7 ± 0.02
5.426LeuGlu: 5.426 ± 0.026
3.72LeuPhe: 3.72 ± 0.02
4.497LeuGly: 4.497 ± 0.019
2.247LeuHis: 2.247 ± 0.015
4.872LeuIle: 4.872 ± 0.022
5.087LeuLys: 5.087 ± 0.027
8.338LeuLeu: 8.338 ± 0.039
2.097LeuMet: 2.097 ± 0.013
3.888LeuAsn: 3.888 ± 0.018
4.817LeuPro: 4.817 ± 0.021
3.858LeuGln: 3.858 ± 0.023
5.054LeuArg: 5.054 ± 0.023
6.613LeuSer: 6.613 ± 0.025
4.686LeuThr: 4.686 ± 0.019
5.113LeuVal: 5.113 ± 0.021
0.881LeuTrp: 0.881 ± 0.008
2.389LeuTyr: 2.389 ± 0.012
0.001LeuXaa: 0.001 ± 0.0
Met
1.969MetAla: 1.969 ± 0.014
0.447MetCys: 0.447 ± 0.006
1.356MetAsp: 1.356 ± 0.011
1.624MetGlu: 1.624 ± 0.012
0.93MetPhe: 0.93 ± 0.009
1.397MetGly: 1.397 ± 0.013
0.63MetHis: 0.63 ± 0.007
1.389MetIle: 1.389 ± 0.009
1.659MetLys: 1.659 ± 0.012
2.137MetLeu: 2.137 ± 0.013
0.758MetMet: 0.758 ± 0.009
1.247MetAsn: 1.247 ± 0.01
1.403MetPro: 1.403 ± 0.01
1.284MetGln: 1.284 ± 0.011
1.541MetArg: 1.541 ± 0.011
1.961MetSer: 1.961 ± 0.012
1.501MetThr: 1.501 ± 0.01
1.326MetVal: 1.326 ± 0.009
0.244MetTrp: 0.244 ± 0.004
0.582MetTyr: 0.582 ± 0.007
0.0MetXaa: 0.0 ± 0.0
Asn
3.144AsnAla: 3.144 ± 0.014
0.897AsnCys: 0.897 ± 0.01
2.522AsnAsp: 2.522 ± 0.015
3.182AsnGlu: 3.182 ± 0.018
1.833AsnPhe: 1.833 ± 0.014
3.4AsnGly: 3.4 ± 0.022
0.972AsnHis: 0.972 ± 0.01
2.448AsnIle: 2.448 ± 0.016
2.37AsnLys: 2.37 ± 0.014
3.828AsnLeu: 3.828 ± 0.018
1.099AsnMet: 1.099 ± 0.009
2.136AsnAsn: 2.136 ± 0.016
2.364AsnPro: 2.364 ± 0.017
2.069AsnGln: 2.069 ± 0.015
2.569AsnArg: 2.569 ± 0.014
3.823AsnSer: 3.823 ± 0.02
2.15AsnThr: 2.15 ± 0.013
2.682AsnVal: 2.682 ± 0.014
0.542AsnTrp: 0.542 ± 0.007
1.504AsnTyr: 1.504 ± 0.012
0.0AsnXaa: 0.0 ± 0.0
Pro
3.828ProAla: 3.828 ± 0.027
0.702ProCys: 0.702 ± 0.01
2.61ProAsp: 2.61 ± 0.018
3.245ProGlu: 3.245 ± 0.021
1.984ProPhe: 1.984 ± 0.012
2.743ProGly: 2.743 ± 0.023
1.127ProHis: 1.127 ± 0.01
2.861ProIle: 2.861 ± 0.018
2.75ProLys: 2.75 ± 0.028
4.173ProLeu: 4.173 ± 0.018
1.175ProMet: 1.175 ± 0.011
2.433ProAsn: 2.433 ± 0.016
4.144ProPro: 4.144 ± 0.029
2.333ProGln: 2.333 ± 0.018
2.62ProArg: 2.62 ± 0.016
4.843ProSer: 4.843 ± 0.029
3.194ProThr: 3.194 ± 0.019
3.227ProVal: 3.227 ± 0.019
0.475ProTrp: 0.475 ± 0.007
1.431ProTyr: 1.431 ± 0.013
0.0ProXaa: 0.0 ± 0.0
Gln
3.179GlnAla: 3.179 ± 0.021
0.796GlnCys: 0.796 ± 0.015
1.652GlnAsp: 1.652 ± 0.012
2.489GlnGlu: 2.489 ± 0.018
1.749GlnPhe: 1.749 ± 0.012
2.051GlnGly: 2.051 ± 0.016
1.148GlnHis: 1.148 ± 0.01
2.659GlnIle: 2.659 ± 0.016
2.626GlnLys: 2.626 ± 0.016
4.345GlnLeu: 4.345 ± 0.023
1.419GlnMet: 1.419 ± 0.013
2.167GlnAsn: 2.167 ± 0.014
2.744GlnPro: 2.744 ± 0.024
3.495GlnGln: 3.495 ± 0.043
2.858GlnArg: 2.858 ± 0.021
3.062GlnSer: 3.062 ± 0.017
2.456GlnThr: 2.456 ± 0.016
2.393GlnVal: 2.393 ± 0.015
0.477GlnTrp: 0.477 ± 0.007
1.171GlnTyr: 1.171 ± 0.009
0.001GlnXaa: 0.001 ± 0.0
Arg
3.949ArgAla: 3.949 ± 0.021
1.08ArgCys: 1.08 ± 0.013
3.163ArgAsp: 3.163 ± 0.019
3.737ArgGlu: 3.737 ± 0.022
2.285ArgPhe: 2.285 ± 0.013
3.089ArgGly: 3.089 ± 0.021
1.661ArgHis: 1.661 ± 0.014
3.572ArgIle: 3.572 ± 0.017
3.836ArgLys: 3.836 ± 0.019
5.477ArgLeu: 5.477 ± 0.024
1.6ArgMet: 1.6 ± 0.012
2.729ArgAsn: 2.729 ± 0.015
2.9ArgPro: 2.9 ± 0.016
2.914ArgGln: 2.914 ± 0.019
5.004ArgArg: 5.004 ± 0.038
4.155ArgSer: 4.155 ± 0.021
2.933ArgThr: 2.933 ± 0.016
3.283ArgVal: 3.283 ± 0.019
0.637ArgTrp: 0.637 ± 0.007
1.679ArgTyr: 1.679 ± 0.012
0.001ArgXaa: 0.001 ± 0.0
Ser
5.709SerAla: 5.709 ± 0.026
1.342SerCys: 1.342 ± 0.013
4.228SerAsp: 4.228 ± 0.023
4.535SerGlu: 4.535 ± 0.025
2.977SerPhe: 2.977 ± 0.018
4.809SerGly: 4.809 ± 0.025
1.762SerHis: 1.762 ± 0.013
4.339SerIle: 4.339 ± 0.021
4.022SerLys: 4.022 ± 0.024
6.523SerLeu: 6.523 ± 0.026
1.861SerMet: 1.861 ± 0.012
3.675SerAsn: 3.675 ± 0.014
4.345SerPro: 4.345 ± 0.031
3.205SerGln: 3.205 ± 0.023
4.274SerArg: 4.274 ± 0.02
8.493SerSer: 8.493 ± 0.051
5.101SerThr: 5.101 ± 0.034
4.346SerVal: 4.346 ± 0.021
0.745SerTrp: 0.745 ± 0.007
2.101SerTyr: 2.101 ± 0.017
0.001SerXaa: 0.001 ± 0.0
Thr
4.174ThrAla: 4.174 ± 0.02
1.023ThrCys: 1.023 ± 0.013
2.745ThrAsp: 2.745 ± 0.016
3.234ThrGlu: 3.234 ± 0.02
2.094ThrPhe: 2.094 ± 0.013
3.305ThrGly: 3.305 ± 0.017
1.148ThrHis: 1.148 ± 0.009
3.262ThrIle: 3.262 ± 0.015
2.833ThrLys: 2.833 ± 0.016
4.545ThrLeu: 4.545 ± 0.02
1.316ThrMet: 1.316 ± 0.011
2.496ThrAsn: 2.496 ± 0.015
3.282ThrPro: 3.282 ± 0.019
2.162ThrGln: 2.162 ± 0.016
2.725ThrArg: 2.725 ± 0.014
4.768ThrSer: 4.768 ± 0.029
4.096ThrThr: 4.096 ± 0.044
3.607ThrVal: 3.607 ± 0.019
0.535ThrTrp: 0.535 ± 0.006
1.487ThrTyr: 1.487 ± 0.011
0.001ThrXaa: 0.001 ± 0.0
Val
4.68ValAla: 4.68 ± 0.021
1.223ValCys: 1.223 ± 0.012
3.684ValAsp: 3.684 ± 0.019
4.067ValGlu: 4.067 ± 0.022
2.443ValPhe: 2.443 ± 0.014
3.397ValGly: 3.397 ± 0.018
1.484ValHis: 1.484 ± 0.011
3.378ValIle: 3.378 ± 0.016
3.382ValLys: 3.382 ± 0.024
5.035ValLeu: 5.035 ± 0.025
1.377ValMet: 1.377 ± 0.01
2.625ValAsn: 2.625 ± 0.013
3.117ValPro: 3.117 ± 0.018
2.581ValGln: 2.581 ± 0.014
3.301ValArg: 3.301 ± 0.018
4.284ValSer: 4.284 ± 0.019
3.262ValThr: 3.262 ± 0.018
4.082ValVal: 4.082 ± 0.022
0.639ValTrp: 0.639 ± 0.008
1.731ValTyr: 1.731 ± 0.01
0.001ValXaa: 0.001 ± 0.0
Trp
0.688TrpAla: 0.688 ± 0.007
0.188TrpCys: 0.188 ± 0.004
0.547TrpAsp: 0.547 ± 0.006
0.589TrpGlu: 0.589 ± 0.007
0.442TrpPhe: 0.442 ± 0.006
0.511TrpGly: 0.511 ± 0.006
0.263TrpHis: 0.263 ± 0.004
0.688TrpIle: 0.688 ± 0.007
0.754TrpLys: 0.754 ± 0.008
1.052TrpLeu: 1.052 ± 0.01
0.362TrpMet: 0.362 ± 0.005
0.589TrpAsn: 0.589 ± 0.006
0.46TrpPro: 0.46 ± 0.006
0.481TrpGln: 0.481 ± 0.006
0.728TrpArg: 0.728 ± 0.007
0.773TrpSer: 0.773 ± 0.008
0.685TrpThr: 0.685 ± 0.007
0.565TrpVal: 0.565 ± 0.007
0.171TrpTrp: 0.171 ± 0.003
0.322TrpTyr: 0.322 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.929TyrAla: 1.929 ± 0.013
0.657TyrCys: 0.657 ± 0.008
1.744TyrAsp: 1.744 ± 0.012
1.863TyrGlu: 1.863 ± 0.012
1.411TyrPhe: 1.411 ± 0.011
1.894TyrGly: 1.894 ± 0.014
0.718TyrHis: 0.718 ± 0.007
1.527TyrIle: 1.527 ± 0.01
1.555TyrLys: 1.555 ± 0.011
2.621TyrLeu: 2.621 ± 0.014
0.711TyrMet: 0.711 ± 0.007
1.381TyrAsn: 1.381 ± 0.01
1.314TyrPro: 1.314 ± 0.012
1.193TyrGln: 1.193 ± 0.01
1.731TyrArg: 1.731 ± 0.012
2.253TyrSer: 2.253 ± 0.016
1.489TyrThr: 1.489 ± 0.01
1.705TyrVal: 1.705 ± 0.011
0.378TyrTrp: 0.378 ± 0.006
1.177TyrTyr: 1.177 ± 0.012
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.074XaaXaa: 0.074 ± 0.007
Statistics based on 33981 proteins (15129975 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski