Amino acid dipepetide frequency for Paragonimus westermani

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.844AlaAla: 5.844 ± 0.062
1.503AlaCys: 1.503 ± 0.022
3.764AlaAsp: 3.764 ± 0.032
3.961AlaGlu: 3.961 ± 0.038
2.618AlaPhe: 2.618 ± 0.024
3.597AlaGly: 3.597 ± 0.031
1.833AlaHis: 1.833 ± 0.018
3.311AlaIle: 3.311 ± 0.025
2.897AlaLys: 2.897 ± 0.027
6.346AlaLeu: 6.346 ± 0.043
1.411AlaMet: 1.411 ± 0.018
2.951AlaAsn: 2.951 ± 0.025
3.124AlaPro: 3.124 ± 0.031
2.695AlaGln: 2.695 ± 0.02
3.875AlaArg: 3.875 ± 0.029
6.214AlaSer: 6.214 ± 0.045
4.271AlaThr: 4.271 ± 0.029
5.025AlaVal: 5.025 ± 0.033
0.719AlaTrp: 0.719 ± 0.011
1.942AlaTyr: 1.942 ± 0.019
0.001AlaXaa: 0.001 ± 0.0
Cys
1.45CysAla: 1.45 ± 0.018
0.568CysCys: 0.568 ± 0.013
1.113CysAsp: 1.113 ± 0.016
1.087CysGlu: 1.087 ± 0.014
0.898CysPhe: 0.898 ± 0.013
1.349CysGly: 1.349 ± 0.02
0.603CysHis: 0.603 ± 0.011
1.033CysIle: 1.033 ± 0.014
0.852CysLys: 0.852 ± 0.016
2.475CysLeu: 2.475 ± 0.026
0.417CysMet: 0.417 ± 0.009
0.778CysAsn: 0.778 ± 0.013
1.245CysPro: 1.245 ± 0.018
0.846CysGln: 0.846 ± 0.012
1.336CysArg: 1.336 ± 0.019
2.069CysSer: 2.069 ± 0.024
1.288CysThr: 1.288 ± 0.015
1.539CysVal: 1.539 ± 0.02
0.268CysTrp: 0.268 ± 0.006
0.573CysTyr: 0.573 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
3.52AspAla: 3.52 ± 0.023
1.156AspCys: 1.156 ± 0.016
3.043AspAsp: 3.043 ± 0.04
3.576AspGlu: 3.576 ± 0.032
2.037AspPhe: 2.037 ± 0.019
3.092AspGly: 3.092 ± 0.033
1.37AspHis: 1.37 ± 0.015
2.404AspIle: 2.404 ± 0.023
2.101AspLys: 2.101 ± 0.024
5.352AspLeu: 5.352 ± 0.039
1.074AspMet: 1.074 ± 0.016
1.92AspAsn: 1.92 ± 0.019
2.828AspPro: 2.828 ± 0.024
2.123AspGln: 2.123 ± 0.023
3.392AspArg: 3.392 ± 0.027
4.759AspSer: 4.759 ± 0.033
2.868AspThr: 2.868 ± 0.026
3.585AspVal: 3.585 ± 0.03
0.767AspTrp: 0.767 ± 0.013
1.557AspTyr: 1.557 ± 0.022
0.0AspXaa: 0.0 ± 0.0
Glu
4.067GluAla: 4.067 ± 0.036
1.054GluCys: 1.054 ± 0.015
3.158GluAsp: 3.158 ± 0.029
3.969GluGlu: 3.969 ± 0.044
2.059GluPhe: 2.059 ± 0.02
2.212GluGly: 2.212 ± 0.025
1.593GluHis: 1.593 ± 0.016
2.531GluIle: 2.531 ± 0.024
2.877GluLys: 2.877 ± 0.032
6.284GluLeu: 6.284 ± 0.055
1.211GluMet: 1.211 ± 0.015
2.47GluAsn: 2.47 ± 0.022
2.606GluPro: 2.606 ± 0.024
2.737GluGln: 2.737 ± 0.03
3.71GluArg: 3.71 ± 0.038
4.327GluSer: 4.327 ± 0.032
3.292GluThr: 3.292 ± 0.027
3.398GluVal: 3.398 ± 0.031
0.591GluTrp: 0.591 ± 0.011
1.441GluTyr: 1.441 ± 0.017
0.0GluXaa: 0.0 ± 0.0
Phe
2.516PheAla: 2.516 ± 0.024
0.979PheCys: 0.979 ± 0.013
2.141PheAsp: 2.141 ± 0.02
1.986PheGlu: 1.986 ± 0.022
1.469PhePhe: 1.469 ± 0.018
2.451PheGly: 2.451 ± 0.027
1.114PheHis: 1.114 ± 0.015
1.926PheIle: 1.926 ± 0.02
1.482PheLys: 1.482 ± 0.018
3.765PheLeu: 3.765 ± 0.028
0.759PheMet: 0.759 ± 0.013
1.539PheAsn: 1.539 ± 0.019
1.951PhePro: 1.951 ± 0.021
1.552PheGln: 1.552 ± 0.015
2.392PheArg: 2.392 ± 0.022
3.398PheSer: 3.398 ± 0.029
2.479PheThr: 2.479 ± 0.023
2.78PheVal: 2.78 ± 0.021
0.464PheTrp: 0.464 ± 0.01
1.137PheTyr: 1.137 ± 0.014
0.0PheXaa: 0.0 ± 0.0
Gly
3.252GlyAla: 3.252 ± 0.032
1.2GlyCys: 1.2 ± 0.015
2.789GlyAsp: 2.789 ± 0.029
2.66GlyGlu: 2.66 ± 0.031
2.157GlyPhe: 2.157 ± 0.021
3.334GlyGly: 3.334 ± 0.041
1.567GlyHis: 1.567 ± 0.018
2.589GlyIle: 2.589 ± 0.03
2.406GlyLys: 2.406 ± 0.026
5.55GlyLeu: 5.55 ± 0.039
1.123GlyMet: 1.123 ± 0.014
2.068GlyAsn: 2.068 ± 0.021
2.842GlyPro: 2.842 ± 0.056
2.294GlyGln: 2.294 ± 0.023
3.429GlyArg: 3.429 ± 0.027
5.167GlySer: 5.167 ± 0.042
3.355GlyThr: 3.355 ± 0.029
3.5GlyVal: 3.5 ± 0.026
0.634GlyTrp: 0.634 ± 0.011
1.565GlyTyr: 1.565 ± 0.023
0.0GlyXaa: 0.0 ± 0.0
His
1.707HisAla: 1.707 ± 0.018
0.703HisCys: 0.703 ± 0.012
1.102HisAsp: 1.102 ± 0.014
1.336HisGlu: 1.336 ± 0.016
1.174HisPhe: 1.174 ± 0.017
1.495HisGly: 1.495 ± 0.018
0.953HisHis: 0.953 ± 0.018
1.333HisIle: 1.333 ± 0.015
1.078HisLys: 1.078 ± 0.013
3.206HisLeu: 3.206 ± 0.025
0.587HisMet: 0.587 ± 0.01
1.047HisAsn: 1.047 ± 0.014
1.86HisPro: 1.86 ± 0.02
1.274HisGln: 1.274 ± 0.017
1.929HisArg: 1.929 ± 0.02
2.916HisSer: 2.916 ± 0.029
1.75HisThr: 1.75 ± 0.021
1.88HisVal: 1.88 ± 0.02
0.395HisTrp: 0.395 ± 0.008
0.781HisTyr: 0.781 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
3.119IleAla: 3.119 ± 0.023
1.138IleCys: 1.138 ± 0.016
2.499IleAsp: 2.499 ± 0.022
2.581IleGlu: 2.581 ± 0.024
1.844IlePhe: 1.844 ± 0.021
2.661IleGly: 2.661 ± 0.025
1.421IleHis: 1.421 ± 0.017
2.259IleIle: 2.259 ± 0.124
1.996IleLys: 1.996 ± 0.024
4.595IleLeu: 4.595 ± 0.033
0.924IleMet: 0.924 ± 0.014
1.966IleAsn: 1.966 ± 0.021
2.933IlePro: 2.933 ± 0.025
2.025IleGln: 2.025 ± 0.022
3.27IleArg: 3.27 ± 0.024
4.279IleSer: 4.279 ± 0.029
2.882IleThr: 2.882 ± 0.026
2.996IleVal: 2.996 ± 0.025
0.56IleTrp: 0.56 ± 0.01
1.398IleTyr: 1.398 ± 0.016
0.0IleXaa: 0.0 ± 0.0
Lys
2.784LysAla: 2.784 ± 0.025
0.925LysCys: 0.925 ± 0.015
2.022LysAsp: 2.022 ± 0.024
2.525LysGlu: 2.525 ± 0.026
1.496LysPhe: 1.496 ± 0.017
1.74LysGly: 1.74 ± 0.035
1.321LysHis: 1.321 ± 0.016
2.009LysIle: 2.009 ± 0.022
2.332LysLys: 2.332 ± 0.029
4.896LysLeu: 4.896 ± 0.037
0.988LysMet: 0.988 ± 0.015
1.662LysAsn: 1.662 ± 0.019
2.734LysPro: 2.734 ± 0.024
2.246LysGln: 2.246 ± 0.022
3.423LysArg: 3.423 ± 0.032
3.689LysSer: 3.689 ± 0.031
2.67LysThr: 2.67 ± 0.024
2.534LysVal: 2.534 ± 0.025
0.501LysTrp: 0.501 ± 0.011
1.182LysTyr: 1.182 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
7.105LeuAla: 7.105 ± 0.047
2.236LeuCys: 2.236 ± 0.023
5.463LeuAsp: 5.463 ± 0.036
5.644LeuGlu: 5.644 ± 0.051
4.166LeuPhe: 4.166 ± 0.03
4.983LeuGly: 4.983 ± 0.033
2.927LeuHis: 2.927 ± 0.026
5.048LeuIle: 5.048 ± 0.035
4.61LeuLys: 4.61 ± 0.034
10.846LeuLeu: 10.846 ± 0.071
1.968LeuMet: 1.968 ± 0.018
4.444LeuAsn: 4.444 ± 0.034
5.987LeuPro: 5.987 ± 0.042
4.448LeuGln: 4.448 ± 0.036
6.674LeuArg: 6.674 ± 0.04
9.372LeuSer: 9.372 ± 0.05
6.492LeuThr: 6.492 ± 0.037
6.376LeuVal: 6.376 ± 0.037
1.074LeuTrp: 1.074 ± 0.016
2.469LeuTyr: 2.469 ± 0.025
0.001LeuXaa: 0.001 ± 0.0
Met
1.408MetAla: 1.408 ± 0.016
0.445MetCys: 0.445 ± 0.009
1.277MetAsp: 1.277 ± 0.016
1.324MetGlu: 1.324 ± 0.02
0.818MetPhe: 0.818 ± 0.014
1.015MetGly: 1.015 ± 0.017
0.587MetHis: 0.587 ± 0.01
0.919MetIle: 0.919 ± 0.012
0.988MetLys: 0.988 ± 0.014
2.084MetLeu: 2.084 ± 0.022
0.446MetMet: 0.446 ± 0.01
0.979MetAsn: 0.979 ± 0.013
1.054MetPro: 1.054 ± 0.014
0.915MetGln: 0.915 ± 0.014
1.283MetArg: 1.283 ± 0.016
1.671MetSer: 1.671 ± 0.017
1.205MetThr: 1.205 ± 0.014
1.271MetVal: 1.271 ± 0.014
0.21MetTrp: 0.21 ± 0.007
0.499MetTyr: 0.499 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
2.893AsnAla: 2.893 ± 0.024
0.92AsnCys: 0.92 ± 0.014
1.96AsnAsp: 1.96 ± 0.021
2.359AsnGlu: 2.359 ± 0.023
1.568AsnPhe: 1.568 ± 0.019
2.609AsnGly: 2.609 ± 0.024
1.183AsnHis: 1.183 ± 0.015
1.896AsnIle: 1.896 ± 0.019
1.767AsnLys: 1.767 ± 0.018
4.266AsnLeu: 4.266 ± 0.031
0.883AsnMet: 0.883 ± 0.014
1.627AsnAsn: 1.627 ± 0.02
2.487AsnPro: 2.487 ± 0.025
1.884AsnGln: 1.884 ± 0.017
2.698AsnArg: 2.698 ± 0.023
3.899AsnSer: 3.899 ± 0.037
2.566AsnThr: 2.566 ± 0.021
2.791AsnVal: 2.791 ± 0.023
0.519AsnTrp: 0.519 ± 0.01
1.192AsnTyr: 1.192 ± 0.015
0.001AsnXaa: 0.001 ± 0.0
Pro
3.589ProAla: 3.589 ± 0.033
1.005ProCys: 1.005 ± 0.015
3.059ProAsp: 3.059 ± 0.024
3.077ProGlu: 3.077 ± 0.028
2.047ProPhe: 2.047 ± 0.021
3.213ProGly: 3.213 ± 0.063
1.524ProHis: 1.524 ± 0.019
2.792ProIle: 2.792 ± 0.024
2.409ProLys: 2.409 ± 0.023
5.042ProLeu: 5.042 ± 0.032
1.058ProMet: 1.058 ± 0.014
2.66ProAsn: 2.66 ± 0.029
4.187ProPro: 4.187 ± 0.047
2.084ProGln: 2.084 ± 0.019
2.957ProArg: 2.957 ± 0.028
5.906ProSer: 5.906 ± 0.048
4.247ProThr: 4.247 ± 0.038
4.376ProVal: 4.376 ± 0.033
0.567ProTrp: 0.567 ± 0.01
1.404ProTyr: 1.404 ± 0.02
0.0ProXaa: 0.0 ± 0.0
Gln
2.777GlnAla: 2.777 ± 0.025
0.894GlnCys: 0.894 ± 0.014
1.555GlnAsp: 1.555 ± 0.015
1.963GlnGlu: 1.963 ± 0.025
1.69GlnPhe: 1.69 ± 0.017
1.506GlnGly: 1.506 ± 0.022
1.319GlnHis: 1.319 ± 0.017
2.016GlnIle: 2.016 ± 0.02
1.927GlnLys: 1.927 ± 0.022
5.566GlnLeu: 5.566 ± 0.042
1.007GlnMet: 1.007 ± 0.014
1.706GlnAsn: 1.706 ± 0.018
2.876GlnPro: 2.876 ± 0.026
2.561GlnGln: 2.561 ± 0.048
2.814GlnArg: 2.814 ± 0.025
3.936GlnSer: 3.936 ± 0.034
2.946GlnThr: 2.946 ± 0.031
2.384GlnVal: 2.384 ± 0.02
0.504GlnTrp: 0.504 ± 0.009
1.022GlnTyr: 1.022 ± 0.014
0.0GlnXaa: 0.0 ± 0.0
Arg
3.911ArgAla: 3.911 ± 0.03
1.382ArgCys: 1.382 ± 0.021
2.747ArgAsp: 2.747 ± 0.024
3.249ArgGlu: 3.249 ± 0.031
2.602ArgPhe: 2.602 ± 0.024
2.921ArgGly: 2.921 ± 0.032
1.873ArgHis: 1.873 ± 0.019
3.315ArgIle: 3.315 ± 0.025
3.108ArgLys: 3.108 ± 0.026
7.627ArgLeu: 7.627 ± 0.05
1.47ArgMet: 1.47 ± 0.014
2.531ArgAsn: 2.531 ± 0.021
3.429ArgPro: 3.429 ± 0.03
2.858ArgGln: 2.858 ± 0.025
5.197ArgArg: 5.197 ± 0.036
5.66ArgSer: 5.66 ± 0.04
3.846ArgThr: 3.846 ± 0.029
3.769ArgVal: 3.769 ± 0.033
0.822ArgTrp: 0.822 ± 0.014
1.734ArgTyr: 1.734 ± 0.019
0.0ArgXaa: 0.0 ± 0.0
Ser
6.435SerAla: 6.435 ± 0.046
1.856SerCys: 1.856 ± 0.019
5.106SerAsp: 5.106 ± 0.034
4.922SerGlu: 4.922 ± 0.032
3.134SerPhe: 3.134 ± 0.025
5.652SerGly: 5.652 ± 0.043
2.469SerHis: 2.469 ± 0.028
3.959SerIle: 3.959 ± 0.029
3.841SerLys: 3.841 ± 0.034
8.516SerLeu: 8.516 ± 0.052
1.788SerMet: 1.788 ± 0.018
4.024SerAsn: 4.024 ± 0.033
5.589SerPro: 5.589 ± 0.05
3.697SerGln: 3.697 ± 0.03
5.335SerArg: 5.335 ± 0.036
10.622SerSer: 10.622 ± 0.092
6.744SerThr: 6.744 ± 0.05
6.644SerVal: 6.644 ± 0.04
0.952SerTrp: 0.952 ± 0.012
2.054SerTyr: 2.054 ± 0.023
0.0SerXaa: 0.0 ± 0.0
Thr
4.534ThrAla: 4.534 ± 0.032
1.306ThrCys: 1.306 ± 0.019
3.704ThrAsp: 3.704 ± 0.028
3.737ThrGlu: 3.737 ± 0.034
2.242ThrPhe: 2.242 ± 0.021
3.953ThrGly: 3.953 ± 0.03
1.693ThrHis: 1.693 ± 0.018
2.843ThrIle: 2.843 ± 0.025
2.686ThrLys: 2.686 ± 0.026
5.646ThrLeu: 5.646 ± 0.035
1.234ThrMet: 1.234 ± 0.015
2.932ThrAsn: 2.932 ± 0.029
3.687ThrPro: 3.687 ± 0.032
2.527ThrGln: 2.527 ± 0.025
3.453ThrArg: 3.453 ± 0.022
6.261ThrSer: 6.261 ± 0.046
4.655ThrThr: 4.655 ± 0.046
4.783ThrVal: 4.783 ± 0.034
0.626ThrTrp: 0.626 ± 0.012
1.615ThrTyr: 1.615 ± 0.018
0.0ThrXaa: 0.0 ± 0.0
Val
4.504ValAla: 4.504 ± 0.035
1.645ValCys: 1.645 ± 0.022
4.085ValAsp: 4.085 ± 0.03
3.654ValGlu: 3.654 ± 0.028
2.515ValPhe: 2.515 ± 0.027
3.637ValGly: 3.637 ± 0.03
2.034ValHis: 2.034 ± 0.018
3.206ValIle: 3.206 ± 0.029
2.737ValLys: 2.737 ± 0.027
6.109ValLeu: 6.109 ± 0.035
1.24ValMet: 1.24 ± 0.015
3.037ValAsn: 3.037 ± 0.023
3.753ValPro: 3.753 ± 0.031
2.663ValGln: 2.663 ± 0.026
4.162ValArg: 4.162 ± 0.03
5.914ValSer: 5.914 ± 0.035
4.275ValThr: 4.275 ± 0.033
4.37ValVal: 4.37 ± 0.041
0.751ValTrp: 0.751 ± 0.012
2.01ValTyr: 2.01 ± 0.02
0.0ValXaa: 0.0 ± 0.0
Trp
0.657TrpAla: 0.657 ± 0.011
0.26TrpCys: 0.26 ± 0.008
0.577TrpAsp: 0.577 ± 0.011
0.526TrpGlu: 0.526 ± 0.011
0.523TrpPhe: 0.523 ± 0.01
0.43TrpGly: 0.43 ± 0.009
0.324TrpHis: 0.324 ± 0.008
0.729TrpIle: 0.729 ± 0.013
0.552TrpLys: 0.552 ± 0.011
1.344TrpLeu: 1.344 ± 0.018
0.258TrpMet: 0.258 ± 0.007
0.578TrpAsn: 0.578 ± 0.011
0.609TrpPro: 0.609 ± 0.012
0.435TrpGln: 0.435 ± 0.009
0.785TrpArg: 0.785 ± 0.011
1.097TrpSer: 1.097 ± 0.016
0.74TrpThr: 0.74 ± 0.012
0.564TrpVal: 0.564 ± 0.011
0.17TrpTrp: 0.17 ± 0.006
0.31TrpTyr: 0.31 ± 0.008
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.833TyrAla: 1.833 ± 0.018
0.612TyrCys: 0.612 ± 0.011
1.359TyrAsp: 1.359 ± 0.021
1.449TyrGlu: 1.449 ± 0.019
1.222TyrPhe: 1.222 ± 0.016
1.633TyrGly: 1.633 ± 0.025
0.798TyrHis: 0.798 ± 0.011
1.228TyrIle: 1.228 ± 0.017
1.034TyrLys: 1.034 ± 0.016
2.818TyrLeu: 2.818 ± 0.029
0.551TyrMet: 0.551 ± 0.01
1.046TyrAsn: 1.046 ± 0.015
1.421TyrPro: 1.421 ± 0.018
1.105TyrGln: 1.105 ± 0.014
1.938TyrArg: 1.938 ± 0.02
2.182TyrSer: 2.182 ± 0.021
1.547TyrThr: 1.547 ± 0.016
1.738TyrVal: 1.738 ± 0.019
0.373TyrTrp: 0.373 ± 0.008
0.869TyrTyr: 0.869 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12535 proteins (5687671 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski