Amino acid dipepetide frequency for Nippostrongylus brasiliensis (Rat hookworm)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.529AlaAla: 6.529 ± 0.047
1.309AlaCys: 1.309 ± 0.016
3.798AlaAsp: 3.798 ± 0.026
4.854AlaGlu: 4.854 ± 0.038
2.874AlaPhe: 2.874 ± 0.024
3.91AlaGly: 3.91 ± 0.034
1.608AlaHis: 1.608 ± 0.018
3.876AlaIle: 3.876 ± 0.029
4.072AlaLys: 4.072 ± 0.034
6.715AlaLeu: 6.715 ± 0.033
1.91AlaMet: 1.91 ± 0.016
2.691AlaAsn: 2.691 ± 0.019
3.53AlaPro: 3.53 ± 0.032
2.72AlaGln: 2.72 ± 0.022
3.972AlaArg: 3.972 ± 0.027
5.175AlaSer: 5.175 ± 0.035
4.069AlaThr: 4.069 ± 0.025
5.553AlaVal: 5.553 ± 0.029
0.673AlaTrp: 0.673 ± 0.011
1.892AlaTyr: 1.892 ± 0.018
0.003AlaXaa: 0.003 ± 0.001
Cys
1.461CysAla: 1.461 ± 0.017
0.579CysCys: 0.579 ± 0.013
1.237CysAsp: 1.237 ± 0.019
1.304CysGlu: 1.304 ± 0.019
0.944CysPhe: 0.944 ± 0.012
1.439CysGly: 1.439 ± 0.018
0.531CysHis: 0.531 ± 0.009
1.067CysIle: 1.067 ± 0.016
1.006CysLys: 1.006 ± 0.013
1.815CysLeu: 1.815 ± 0.019
0.507CysMet: 0.507 ± 0.009
0.82CysAsn: 0.82 ± 0.011
1.206CysPro: 1.206 ± 0.02
0.806CysGln: 0.806 ± 0.012
1.352CysArg: 1.352 ± 0.019
1.902CysSer: 1.902 ± 0.021
1.129CysThr: 1.129 ± 0.014
1.457CysVal: 1.457 ± 0.019
0.238CysTrp: 0.238 ± 0.007
0.646CysTyr: 0.646 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
3.805AspAla: 3.805 ± 0.027
1.074AspCys: 1.074 ± 0.017
4.2AspAsp: 4.2 ± 0.034
4.701AspGlu: 4.701 ± 0.032
2.324AspPhe: 2.324 ± 0.019
3.691AspGly: 3.691 ± 0.029
1.245AspHis: 1.245 ± 0.017
2.864AspIle: 2.864 ± 0.023
2.628AspLys: 2.628 ± 0.025
4.671AspLeu: 4.671 ± 0.025
1.347AspMet: 1.347 ± 0.015
2.028AspAsn: 2.028 ± 0.022
2.51AspPro: 2.51 ± 0.024
1.922AspGln: 1.922 ± 0.019
3.2AspArg: 3.2 ± 0.023
3.976AspSer: 3.976 ± 0.029
2.513AspThr: 2.513 ± 0.021
4.118AspVal: 4.118 ± 0.027
0.679AspTrp: 0.679 ± 0.01
1.778AspTyr: 1.778 ± 0.016
0.002AspXaa: 0.002 ± 0.001
Glu
4.62GluAla: 4.62 ± 0.032
1.277GluCys: 1.277 ± 0.021
3.817GluAsp: 3.817 ± 0.032
6.164GluGlu: 6.164 ± 0.058
2.4GluPhe: 2.4 ± 0.02
3.259GluGly: 3.259 ± 0.027
1.537GluHis: 1.537 ± 0.017
3.381GluIle: 3.381 ± 0.027
5.158GluLys: 5.158 ± 0.046
5.873GluLeu: 5.873 ± 0.041
1.973GluMet: 1.973 ± 0.017
2.982GluAsn: 2.982 ± 0.023
2.575GluPro: 2.575 ± 0.027
2.854GluGln: 2.854 ± 0.026
4.402GluArg: 4.402 ± 0.031
4.213GluSer: 4.213 ± 0.031
3.334GluThr: 3.334 ± 0.027
4.191GluVal: 4.191 ± 0.03
0.825GluTrp: 0.825 ± 0.013
1.803GluTyr: 1.803 ± 0.019
0.003GluXaa: 0.003 ± 0.001
Phe
2.898PheAla: 2.898 ± 0.022
1.058PheCys: 1.058 ± 0.014
2.476PheAsp: 2.476 ± 0.022
2.429PheGlu: 2.429 ± 0.021
2.076PhePhe: 2.076 ± 0.022
2.692PheGly: 2.692 ± 0.021
1.056PheHis: 1.056 ± 0.015
2.255PheIle: 2.255 ± 0.021
1.839PheLys: 1.839 ± 0.019
3.978PheLeu: 3.978 ± 0.032
1.018PheMet: 1.018 ± 0.013
1.677PheAsn: 1.677 ± 0.018
1.87PhePro: 1.87 ± 0.019
1.427PheGln: 1.427 ± 0.016
2.33PheArg: 2.33 ± 0.022
3.33PheSer: 3.33 ± 0.023
2.273PheThr: 2.273 ± 0.019
3.141PheVal: 3.141 ± 0.026
0.525PheTrp: 0.525 ± 0.011
1.467PheTyr: 1.467 ± 0.019
0.001PheXaa: 0.001 ± 0.0
Gly
3.896GlyAla: 3.896 ± 0.03
1.238GlyCys: 1.238 ± 0.019
3.23GlyAsp: 3.23 ± 0.027
3.623GlyGlu: 3.623 ± 0.027
2.499GlyPhe: 2.499 ± 0.024
4.425GlyGly: 4.425 ± 0.052
1.362GlyHis: 1.362 ± 0.018
2.93GlyIle: 2.93 ± 0.022
3.369GlyLys: 3.369 ± 0.03
4.459GlyLeu: 4.459 ± 0.026
1.53GlyMet: 1.53 ± 0.021
2.349GlyAsn: 2.349 ± 0.021
2.403GlyPro: 2.403 ± 0.036
2.152GlyGln: 2.152 ± 0.026
3.585GlyArg: 3.585 ± 0.028
4.479GlySer: 4.479 ± 0.028
3.181GlyThr: 3.181 ± 0.026
3.924GlyVal: 3.924 ± 0.03
0.709GlyTrp: 0.709 ± 0.011
1.944GlyTyr: 1.944 ± 0.025
0.001GlyXaa: 0.001 ± 0.0
His
1.415HisAla: 1.415 ± 0.016
0.578HisCys: 0.578 ± 0.009
1.202HisAsp: 1.202 ± 0.015
1.399HisGlu: 1.399 ± 0.018
1.134HisPhe: 1.134 ± 0.014
1.421HisGly: 1.421 ± 0.019
0.82HisHis: 0.82 ± 0.017
1.263HisIle: 1.263 ± 0.015
1.095HisLys: 1.095 ± 0.013
2.343HisLeu: 2.343 ± 0.019
0.622HisMet: 0.622 ± 0.01
0.899HisAsn: 0.899 ± 0.013
1.305HisPro: 1.305 ± 0.015
0.933HisGln: 0.933 ± 0.013
1.631HisArg: 1.631 ± 0.017
1.852HisSer: 1.852 ± 0.018
1.177HisThr: 1.177 ± 0.013
1.6HisVal: 1.6 ± 0.015
0.304HisTrp: 0.304 ± 0.007
0.831HisTyr: 0.831 ± 0.013
0.002HisXaa: 0.002 ± 0.001
Ile
3.981IleAla: 3.981 ± 0.028
1.245IleCys: 1.245 ± 0.016
3.128IleAsp: 3.128 ± 0.022
3.3IleGlu: 3.3 ± 0.026
2.262IlePhe: 2.262 ± 0.021
3.133IleGly: 3.133 ± 0.021
1.271IleHis: 1.271 ± 0.014
2.702IleIle: 2.702 ± 0.024
2.411IleLys: 2.411 ± 0.02
4.586IleLeu: 4.586 ± 0.032
1.191IleMet: 1.191 ± 0.013
2.001IleAsn: 2.001 ± 0.018
2.751IlePro: 2.751 ± 0.021
1.847IleGln: 1.847 ± 0.018
3.288IleArg: 3.288 ± 0.024
4.1IleSer: 4.1 ± 0.028
2.939IleThr: 2.939 ± 0.025
3.796IleVal: 3.796 ± 0.026
0.581IleTrp: 0.581 ± 0.009
1.578IleTyr: 1.578 ± 0.017
0.001IleXaa: 0.001 ± 0.0
Lys
3.911LysAla: 3.911 ± 0.033
1.172LysCys: 1.172 ± 0.017
3.049LysAsp: 3.049 ± 0.031
4.521LysGlu: 4.521 ± 0.04
2.021LysPhe: 2.021 ± 0.017
2.82LysGly: 2.82 ± 0.025
1.236LysHis: 1.236 ± 0.015
2.916LysIle: 2.916 ± 0.022
4.841LysLys: 4.841 ± 0.057
4.87LysLeu: 4.87 ± 0.035
1.631LysMet: 1.631 ± 0.016
2.404LysAsn: 2.404 ± 0.022
2.606LysPro: 2.606 ± 0.027
2.244LysGln: 2.244 ± 0.022
3.855LysArg: 3.855 ± 0.027
3.795LysSer: 3.795 ± 0.028
3.079LysThr: 3.079 ± 0.021
3.574LysVal: 3.574 ± 0.026
0.699LysTrp: 0.699 ± 0.011
1.705LysTyr: 1.705 ± 0.019
0.001LysXaa: 0.001 ± 0.001
Leu
6.471LeuAla: 6.471 ± 0.039
2.013LeuCys: 2.013 ± 0.018
4.701LeuAsp: 4.701 ± 0.032
5.573LeuGlu: 5.573 ± 0.044
3.933LeuPhe: 3.933 ± 0.029
4.436LeuGly: 4.436 ± 0.031
2.276LeuHis: 2.276 ± 0.018
4.598LeuIle: 4.598 ± 0.031
5.125LeuLys: 5.125 ± 0.035
9.219LeuLeu: 9.219 ± 0.052
2.318LeuMet: 2.318 ± 0.019
3.726LeuAsn: 3.726 ± 0.028
4.82LeuPro: 4.82 ± 0.032
3.712LeuGln: 3.712 ± 0.029
5.902LeuArg: 5.902 ± 0.037
7.178LeuSer: 7.178 ± 0.035
4.829LeuThr: 4.829 ± 0.029
5.789LeuVal: 5.789 ± 0.026
1.047LeuTrp: 1.047 ± 0.013
2.599LeuTyr: 2.599 ± 0.021
0.002LeuXaa: 0.002 ± 0.001
Met
1.976MetAla: 1.976 ± 0.017
0.533MetCys: 0.533 ± 0.009
1.45MetAsp: 1.45 ± 0.016
1.838MetGlu: 1.838 ± 0.019
1.088MetPhe: 1.088 ± 0.013
1.379MetGly: 1.379 ± 0.016
0.561MetHis: 0.561 ± 0.009
1.326MetIle: 1.326 ± 0.015
1.635MetLys: 1.635 ± 0.016
2.386MetLeu: 2.386 ± 0.018
0.818MetMet: 0.818 ± 0.013
1.146MetAsn: 1.146 ± 0.015
1.2MetPro: 1.2 ± 0.015
1.0MetGln: 1.0 ± 0.013
1.678MetArg: 1.678 ± 0.017
2.062MetSer: 2.062 ± 0.018
1.462MetThr: 1.462 ± 0.016
1.703MetVal: 1.703 ± 0.016
0.296MetTrp: 0.296 ± 0.006
0.725MetTyr: 0.725 ± 0.012
0.001MetXaa: 0.001 ± 0.0
Asn
2.907AsnAla: 2.907 ± 0.02
0.897AsnCys: 0.897 ± 0.012
2.343AsnAsp: 2.343 ± 0.02
2.66AsnGlu: 2.66 ± 0.022
1.676AsnPhe: 1.676 ± 0.019
2.847AsnGly: 2.847 ± 0.023
0.926AsnHis: 0.926 ± 0.012
2.241AsnIle: 2.241 ± 0.019
1.974AsnLys: 1.974 ± 0.021
3.415AsnLeu: 3.415 ± 0.026
1.042AsnMet: 1.042 ± 0.012
1.867AsnAsn: 1.867 ± 0.032
2.033AsnPro: 2.033 ± 0.021
1.43AsnGln: 1.43 ± 0.015
2.362AsnArg: 2.362 ± 0.02
3.036AsnSer: 3.036 ± 0.022
2.089AsnThr: 2.089 ± 0.019
2.997AsnVal: 2.997 ± 0.022
0.449AsnTrp: 0.449 ± 0.009
1.414AsnTyr: 1.414 ± 0.018
0.001AsnXaa: 0.001 ± 0.0
Pro
3.484ProAla: 3.484 ± 0.037
0.836ProCys: 0.836 ± 0.014
2.508ProAsp: 2.508 ± 0.024
3.127ProGlu: 3.127 ± 0.026
2.029ProPhe: 2.029 ± 0.019
2.978ProGly: 2.978 ± 0.059
1.168ProHis: 1.168 ± 0.015
2.476ProIle: 2.476 ± 0.02
2.634ProLys: 2.634 ± 0.022
4.199ProLeu: 4.199 ± 0.031
1.149ProMet: 1.149 ± 0.014
1.955ProAsn: 1.955 ± 0.018
4.236ProPro: 4.236 ± 0.063
2.035ProGln: 2.035 ± 0.026
2.73ProArg: 2.73 ± 0.023
4.604ProSer: 4.604 ± 0.04
3.141ProThr: 3.141 ± 0.029
3.293ProVal: 3.293 ± 0.025
0.495ProTrp: 0.495 ± 0.009
1.441ProTyr: 1.441 ± 0.016
0.002ProXaa: 0.002 ± 0.001
Gln
2.548GlnAla: 2.548 ± 0.022
0.894GlnCys: 0.894 ± 0.015
1.486GlnAsp: 1.486 ± 0.015
2.305GlnGlu: 2.305 ± 0.023
1.616GlnPhe: 1.616 ± 0.017
1.782GlnGly: 1.782 ± 0.019
1.0GlnHis: 1.0 ± 0.014
2.03GlnIle: 2.03 ± 0.018
2.344GlnLys: 2.344 ± 0.023
4.145GlnLeu: 4.145 ± 0.031
1.215GlnMet: 1.215 ± 0.014
1.59GlnAsn: 1.59 ± 0.017
2.054GlnPro: 2.054 ± 0.024
2.495GlnGln: 2.495 ± 0.042
2.636GlnArg: 2.636 ± 0.022
2.734GlnSer: 2.734 ± 0.022
1.957GlnThr: 1.957 ± 0.016
2.352GlnVal: 2.352 ± 0.02
0.523GlnTrp: 0.523 ± 0.009
1.219GlnTyr: 1.219 ± 0.016
0.001GlnXaa: 0.001 ± 0.0
Arg
3.969ArgAla: 3.969 ± 0.026
1.359ArgCys: 1.359 ± 0.02
3.103ArgAsp: 3.103 ± 0.027
3.967ArgGlu: 3.967 ± 0.031
2.599ArgPhe: 2.599 ± 0.022
3.126ArgGly: 3.126 ± 0.025
1.584ArgHis: 1.584 ± 0.018
3.372ArgIle: 3.372 ± 0.024
4.123ArgLys: 4.123 ± 0.028
5.783ArgLeu: 5.783 ± 0.032
1.695ArgMet: 1.695 ± 0.014
2.662ArgAsn: 2.662 ± 0.02
2.93ArgPro: 2.93 ± 0.027
2.534ArgGln: 2.534 ± 0.021
5.222ArgArg: 5.222 ± 0.043
4.695ArgSer: 4.695 ± 0.03
3.293ArgThr: 3.293 ± 0.023
3.733ArgVal: 3.733 ± 0.026
0.785ArgTrp: 0.785 ± 0.01
1.805ArgTyr: 1.805 ± 0.018
0.002ArgXaa: 0.002 ± 0.001
Ser
5.667SerAla: 5.667 ± 0.037
1.641SerCys: 1.641 ± 0.017
4.293SerAsp: 4.293 ± 0.032
4.613SerGlu: 4.613 ± 0.03
3.192SerPhe: 3.192 ± 0.027
4.645SerGly: 4.645 ± 0.03
1.726SerHis: 1.726 ± 0.019
3.916SerIle: 3.916 ± 0.03
3.902SerLys: 3.902 ± 0.028
6.78SerLeu: 6.78 ± 0.035
1.95SerMet: 1.95 ± 0.018
3.034SerAsn: 3.034 ± 0.024
4.057SerPro: 4.057 ± 0.043
2.775SerGln: 2.775 ± 0.021
4.584SerArg: 4.584 ± 0.029
7.896SerSer: 7.896 ± 0.054
5.005SerThr: 5.005 ± 0.042
5.144SerVal: 5.144 ± 0.031
0.832SerTrp: 0.832 ± 0.012
2.121SerTyr: 2.121 ± 0.023
0.003SerXaa: 0.003 ± 0.001
Thr
4.362ThrAla: 4.362 ± 0.032
1.199ThrCys: 1.199 ± 0.017
2.782ThrAsp: 2.782 ± 0.023
3.218ThrGlu: 3.218 ± 0.025
2.241ThrPhe: 2.241 ± 0.02
3.099ThrGly: 3.099 ± 0.028
1.166ThrHis: 1.166 ± 0.014
3.112ThrIle: 3.112 ± 0.023
2.869ThrLys: 2.869 ± 0.025
4.844ThrLeu: 4.844 ± 0.028
1.426ThrMet: 1.426 ± 0.015
2.173ThrAsn: 2.173 ± 0.018
3.157ThrPro: 3.157 ± 0.033
1.882ThrGln: 1.882 ± 0.019
2.988ThrArg: 2.988 ± 0.023
4.563ThrSer: 4.563 ± 0.037
4.045ThrThr: 4.045 ± 0.043
4.265ThrVal: 4.265 ± 0.03
0.619ThrTrp: 0.619 ± 0.011
1.541ThrTyr: 1.541 ± 0.018
0.002ThrXaa: 0.002 ± 0.0
Val
5.161ValAla: 5.161 ± 0.033
1.509ValCys: 1.509 ± 0.016
4.103ValAsp: 4.103 ± 0.031
4.589ValGlu: 4.589 ± 0.036
3.021ValPhe: 3.021 ± 0.024
3.675ValGly: 3.675 ± 0.026
1.693ValHis: 1.693 ± 0.018
3.624ValIle: 3.624 ± 0.025
3.626ValLys: 3.626 ± 0.029
6.353ValLeu: 6.353 ± 0.035
1.739ValMet: 1.739 ± 0.019
2.724ValAsn: 2.724 ± 0.024
3.399ValPro: 3.399 ± 0.027
2.583ValGln: 2.583 ± 0.02
3.969ValArg: 3.969 ± 0.028
5.03ValSer: 5.03 ± 0.031
3.733ValThr: 3.733 ± 0.027
5.58ValVal: 5.58 ± 0.04
0.761ValTrp: 0.761 ± 0.011
2.009ValTyr: 2.009 ± 0.019
0.002ValXaa: 0.002 ± 0.0
Trp
0.7TrpAla: 0.7 ± 0.011
0.238TrpCys: 0.238 ± 0.007
0.592TrpAsp: 0.592 ± 0.01
0.634TrpGlu: 0.634 ± 0.01
0.492TrpPhe: 0.492 ± 0.009
0.513TrpGly: 0.513 ± 0.01
0.269TrpHis: 0.269 ± 0.007
0.688TrpIle: 0.688 ± 0.011
0.798TrpLys: 0.798 ± 0.012
1.123TrpLeu: 1.123 ± 0.015
0.403TrpMet: 0.403 ± 0.008
0.57TrpAsn: 0.57 ± 0.01
0.485TrpPro: 0.485 ± 0.01
0.472TrpGln: 0.472 ± 0.009
0.788TrpArg: 0.788 ± 0.011
0.896TrpSer: 0.896 ± 0.012
0.771TrpThr: 0.771 ± 0.012
0.648TrpVal: 0.648 ± 0.011
0.193TrpTrp: 0.193 ± 0.005
0.374TrpTyr: 0.374 ± 0.008
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.969TyrAla: 1.969 ± 0.018
0.798TyrCys: 0.798 ± 0.012
1.775TyrAsp: 1.775 ± 0.018
1.818TyrGlu: 1.818 ± 0.017
1.395TyrPhe: 1.395 ± 0.014
2.015TyrGly: 2.015 ± 0.024
0.789TyrHis: 0.789 ± 0.013
1.455TyrIle: 1.455 ± 0.017
1.499TyrLys: 1.499 ± 0.016
2.709TyrLeu: 2.709 ± 0.021
0.777TyrMet: 0.777 ± 0.013
1.306TyrAsn: 1.306 ± 0.018
1.393TyrPro: 1.393 ± 0.018
1.101TyrGln: 1.101 ± 0.015
1.864TyrArg: 1.864 ± 0.019
2.258TyrSer: 2.258 ± 0.022
1.534TyrThr: 1.534 ± 0.017
2.019TyrVal: 2.019 ± 0.018
0.416TyrTrp: 0.416 ± 0.008
1.164TyrTyr: 1.164 ± 0.017
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.002XaaAsp: 0.002 ± 0.001
0.002XaaGlu: 0.002 ± 0.001
0.001XaaPhe: 0.001 ± 0.0
0.002XaaGly: 0.002 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.002XaaIle: 0.002 ± 0.001
0.001XaaLys: 0.001 ± 0.0
0.003XaaLeu: 0.003 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.002XaaAsn: 0.002 ± 0.001
0.003XaaPro: 0.003 ± 0.001
0.001XaaGln: 0.001 ± 0.0
0.002XaaArg: 0.002 ± 0.001
0.003XaaSer: 0.003 ± 0.001
0.002XaaThr: 0.002 ± 0.001
0.003XaaVal: 0.003 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.072XaaXaa: 0.072 ± 0.028
Statistics based on 22636 proteins (6447329 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski