Amino acid dipepetide frequency for Angiostrongylus costaricensis (Nematode worm)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.432AlaAla: 5.432 ± 0.052
1.314AlaCys: 1.314 ± 0.019
3.324AlaAsp: 3.324 ± 0.034
4.15AlaGlu: 4.15 ± 0.038
2.91AlaPhe: 2.91 ± 0.029
3.57AlaGly: 3.57 ± 0.035
1.601AlaHis: 1.601 ± 0.02
3.979AlaIle: 3.979 ± 0.033
3.562AlaLys: 3.562 ± 0.033
6.409AlaLeu: 6.409 ± 0.046
1.76AlaMet: 1.76 ± 0.02
2.604AlaAsn: 2.604 ± 0.026
2.924AlaPro: 2.924 ± 0.032
2.394AlaGln: 2.394 ± 0.03
3.73AlaArg: 3.73 ± 0.033
4.819AlaSer: 4.819 ± 0.035
3.623AlaThr: 3.623 ± 0.033
5.171AlaVal: 5.171 ± 0.035
0.641AlaTrp: 0.641 ± 0.011
1.8AlaTyr: 1.8 ± 0.022
0.0AlaXaa: 0.0 ± 0.0
Cys
1.445CysAla: 1.445 ± 0.022
0.642CysCys: 0.642 ± 0.017
1.323CysAsp: 1.323 ± 0.025
1.345CysGlu: 1.345 ± 0.022
1.119CysPhe: 1.119 ± 0.017
1.515CysGly: 1.515 ± 0.023
0.593CysHis: 0.593 ± 0.014
1.236CysIle: 1.236 ± 0.021
1.037CysLys: 1.037 ± 0.019
2.066CysLeu: 2.066 ± 0.027
0.541CysMet: 0.541 ± 0.011
0.896CysAsn: 0.896 ± 0.017
1.169CysPro: 1.169 ± 0.031
0.815CysGln: 0.815 ± 0.018
1.434CysArg: 1.434 ± 0.025
2.035CysSer: 2.035 ± 0.026
1.145CysThr: 1.145 ± 0.017
1.551CysVal: 1.551 ± 0.024
0.264CysTrp: 0.264 ± 0.008
0.704CysTyr: 0.704 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
3.34AspAla: 3.34 ± 0.03
1.129AspCys: 1.129 ± 0.022
3.852AspAsp: 3.852 ± 0.048
4.359AspGlu: 4.359 ± 0.039
2.461AspPhe: 2.461 ± 0.023
3.66AspGly: 3.66 ± 0.043
1.248AspHis: 1.248 ± 0.018
3.154AspIle: 3.154 ± 0.028
2.573AspLys: 2.573 ± 0.024
4.692AspLeu: 4.692 ± 0.034
1.3AspMet: 1.3 ± 0.019
2.022AspAsn: 2.022 ± 0.025
2.323AspPro: 2.323 ± 0.026
1.721AspGln: 1.721 ± 0.021
3.071AspArg: 3.071 ± 0.026
3.866AspSer: 3.866 ± 0.033
2.375AspThr: 2.375 ± 0.022
4.062AspVal: 4.062 ± 0.037
0.674AspTrp: 0.674 ± 0.012
1.75AspTyr: 1.75 ± 0.024
0.0AspXaa: 0.0 ± 0.0
Glu
4.102GluAla: 4.102 ± 0.039
1.363GluCys: 1.363 ± 0.025
3.565GluAsp: 3.565 ± 0.035
5.363GluGlu: 5.363 ± 0.062
2.463GluPhe: 2.463 ± 0.023
2.955GluGly: 2.955 ± 0.028
1.544GluHis: 1.544 ± 0.023
3.609GluIle: 3.609 ± 0.034
4.752GluLys: 4.752 ± 0.042
5.878GluLeu: 5.878 ± 0.048
1.867GluMet: 1.867 ± 0.023
3.119GluAsn: 3.119 ± 0.032
2.162GluPro: 2.162 ± 0.027
2.697GluGln: 2.697 ± 0.031
4.33GluArg: 4.33 ± 0.045
4.12GluSer: 4.12 ± 0.034
3.172GluThr: 3.172 ± 0.031
4.006GluVal: 4.006 ± 0.034
0.847GluTrp: 0.847 ± 0.016
1.881GluTyr: 1.881 ± 0.02
0.001GluXaa: 0.001 ± 0.001
Phe
2.916PheAla: 2.916 ± 0.026
1.163PheCys: 1.163 ± 0.019
2.605PheAsp: 2.605 ± 0.024
2.573PheGlu: 2.573 ± 0.027
2.484PhePhe: 2.484 ± 0.029
2.759PheGly: 2.759 ± 0.03
1.197PheHis: 1.197 ± 0.018
2.653PheIle: 2.653 ± 0.03
2.004PheLys: 2.004 ± 0.022
4.441PheLeu: 4.441 ± 0.036
1.109PheMet: 1.109 ± 0.014
1.876PheAsn: 1.876 ± 0.02
1.92PhePro: 1.92 ± 0.023
1.57PheGln: 1.57 ± 0.018
2.55PheArg: 2.55 ± 0.023
3.776PheSer: 3.776 ± 0.03
2.459PheThr: 2.459 ± 0.022
3.345PheVal: 3.345 ± 0.031
0.541PheTrp: 0.541 ± 0.011
1.692PheTyr: 1.692 ± 0.023
0.001PheXaa: 0.001 ± 0.001
Gly
3.544GlyAla: 3.544 ± 0.038
1.269GlyCys: 1.269 ± 0.026
3.107GlyAsp: 3.107 ± 0.032
3.49GlyGlu: 3.49 ± 0.037
2.634GlyPhe: 2.634 ± 0.027
3.957GlyGly: 3.957 ± 0.056
1.346GlyHis: 1.346 ± 0.02
3.255GlyIle: 3.255 ± 0.031
3.311GlyLys: 3.311 ± 0.031
4.715GlyLeu: 4.715 ± 0.053
1.426GlyMet: 1.426 ± 0.02
2.454GlyAsn: 2.454 ± 0.025
2.381GlyPro: 2.381 ± 0.049
2.063GlyGln: 2.063 ± 0.027
3.637GlyArg: 3.637 ± 0.035
4.272GlySer: 4.272 ± 0.04
3.137GlyThr: 3.137 ± 0.032
3.986GlyVal: 3.986 ± 0.038
0.72GlyTrp: 0.72 ± 0.015
1.955GlyTyr: 1.955 ± 0.027
0.001GlyXaa: 0.001 ± 0.0
His
1.472HisAla: 1.472 ± 0.019
0.647HisCys: 0.647 ± 0.014
1.178HisAsp: 1.178 ± 0.016
1.429HisGlu: 1.429 ± 0.021
1.229HisPhe: 1.229 ± 0.016
1.502HisGly: 1.502 ± 0.019
0.796HisHis: 0.796 ± 0.018
1.448HisIle: 1.448 ± 0.018
1.135HisLys: 1.135 ± 0.016
2.554HisLeu: 2.554 ± 0.028
0.623HisMet: 0.623 ± 0.012
0.955HisAsn: 0.955 ± 0.015
1.333HisPro: 1.333 ± 0.02
0.912HisGln: 0.912 ± 0.015
1.607HisArg: 1.607 ± 0.02
1.977HisSer: 1.977 ± 0.025
1.206HisThr: 1.206 ± 0.018
1.701HisVal: 1.701 ± 0.019
0.328HisTrp: 0.328 ± 0.009
0.919HisTyr: 0.919 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
4.039IleAla: 4.039 ± 0.034
1.443IleCys: 1.443 ± 0.022
3.31IleAsp: 3.31 ± 0.03
3.493IleGlu: 3.493 ± 0.031
2.657IlePhe: 2.657 ± 0.031
3.537IleGly: 3.537 ± 0.035
1.479IleHis: 1.479 ± 0.018
3.101IleIle: 3.101 ± 0.032
2.622IleLys: 2.622 ± 0.027
5.09IleLeu: 5.09 ± 0.037
1.311IleMet: 1.311 ± 0.019
2.276IleAsn: 2.276 ± 0.022
2.855IlePro: 2.855 ± 0.028
2.012IleGln: 2.012 ± 0.022
3.609IleArg: 3.609 ± 0.034
4.699IleSer: 4.699 ± 0.034
3.172IleThr: 3.172 ± 0.034
4.104IleVal: 4.104 ± 0.032
0.659IleTrp: 0.659 ± 0.013
1.838IleTyr: 1.838 ± 0.022
0.001IleXaa: 0.001 ± 0.001
Lys
3.504LysAla: 3.504 ± 0.035
1.212LysCys: 1.212 ± 0.021
2.846LysAsp: 2.846 ± 0.029
4.048LysGlu: 4.048 ± 0.04
2.136LysPhe: 2.136 ± 0.022
2.657LysGly: 2.657 ± 0.03
1.305LysHis: 1.305 ± 0.018
3.155LysIle: 3.155 ± 0.029
4.201LysLys: 4.201 ± 0.044
4.98LysLeu: 4.98 ± 0.038
1.522LysMet: 1.522 ± 0.018
2.51LysAsn: 2.51 ± 0.026
2.331LysPro: 2.331 ± 0.03
2.191LysGln: 2.191 ± 0.024
3.697LysArg: 3.697 ± 0.034
3.718LysSer: 3.718 ± 0.03
3.0LysThr: 3.0 ± 0.027
3.596LysVal: 3.596 ± 0.031
0.715LysTrp: 0.715 ± 0.014
1.837LysTyr: 1.837 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
6.37LeuAla: 6.37 ± 0.048
2.228LeuCys: 2.228 ± 0.027
4.751LeuAsp: 4.751 ± 0.034
5.674LeuGlu: 5.674 ± 0.058
4.5LeuPhe: 4.5 ± 0.04
4.502LeuGly: 4.502 ± 0.036
2.495LeuHis: 2.495 ± 0.023
5.103LeuIle: 5.103 ± 0.042
5.219LeuLys: 5.219 ± 0.04
9.57LeuLeu: 9.57 ± 0.069
2.378LeuMet: 2.378 ± 0.025
4.045LeuAsn: 4.045 ± 0.029
4.718LeuPro: 4.718 ± 0.044
3.788LeuGln: 3.788 ± 0.034
6.055LeuArg: 6.055 ± 0.042
7.77LeuSer: 7.77 ± 0.046
5.002LeuThr: 5.002 ± 0.037
5.956LeuVal: 5.956 ± 0.042
1.084LeuTrp: 1.084 ± 0.017
2.822LeuTyr: 2.822 ± 0.026
0.0LeuXaa: 0.0 ± 0.0
Met
1.755MetAla: 1.755 ± 0.018
0.561MetCys: 0.561 ± 0.011
1.424MetAsp: 1.424 ± 0.019
1.725MetGlu: 1.725 ± 0.019
1.19MetPhe: 1.19 ± 0.019
1.296MetGly: 1.296 ± 0.022
0.584MetHis: 0.584 ± 0.012
1.373MetIle: 1.373 ± 0.019
1.623MetLys: 1.623 ± 0.017
2.445MetLeu: 2.445 ± 0.025
0.784MetMet: 0.784 ± 0.018
1.234MetAsn: 1.234 ± 0.017
1.112MetPro: 1.112 ± 0.017
0.992MetGln: 0.992 ± 0.015
1.615MetArg: 1.615 ± 0.016
1.982MetSer: 1.982 ± 0.019
1.439MetThr: 1.439 ± 0.022
1.703MetVal: 1.703 ± 0.022
0.311MetTrp: 0.311 ± 0.008
0.704MetTyr: 0.704 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.772AsnAla: 2.772 ± 0.024
0.929AsnCys: 0.929 ± 0.017
2.372AsnAsp: 2.372 ± 0.027
2.889AsnGlu: 2.889 ± 0.028
1.86AsnPhe: 1.86 ± 0.023
2.905AsnGly: 2.905 ± 0.031
0.986AsnHis: 0.986 ± 0.015
2.64AsnIle: 2.64 ± 0.028
2.093AsnLys: 2.093 ± 0.023
3.679AsnLeu: 3.679 ± 0.03
1.102AsnMet: 1.102 ± 0.018
1.806AsnAsn: 1.806 ± 0.022
2.006AsnPro: 2.006 ± 0.024
1.408AsnGln: 1.408 ± 0.016
2.46AsnArg: 2.46 ± 0.025
3.196AsnSer: 3.196 ± 0.029
2.221AsnThr: 2.221 ± 0.023
3.159AsnVal: 3.159 ± 0.025
0.49AsnTrp: 0.49 ± 0.012
1.451AsnTyr: 1.451 ± 0.019
0.0AsnXaa: 0.0 ± 0.0
Pro
2.779ProAla: 2.779 ± 0.032
0.867ProCys: 0.867 ± 0.016
2.306ProAsp: 2.306 ± 0.026
2.744ProGlu: 2.744 ± 0.027
2.102ProPhe: 2.102 ± 0.023
2.96ProGly: 2.96 ± 0.108
1.162ProHis: 1.162 ± 0.016
2.507ProIle: 2.507 ± 0.026
2.279ProLys: 2.279 ± 0.025
4.232ProLeu: 4.232 ± 0.04
1.024ProMet: 1.024 ± 0.015
1.953ProAsn: 1.953 ± 0.022
3.612ProPro: 3.612 ± 0.061
1.766ProGln: 1.766 ± 0.025
2.463ProArg: 2.463 ± 0.028
4.244ProSer: 4.244 ± 0.042
2.837ProThr: 2.837 ± 0.038
3.104ProVal: 3.104 ± 0.034
0.474ProTrp: 0.474 ± 0.012
1.462ProTyr: 1.462 ± 0.021
0.0ProXaa: 0.0 ± 0.0
Gln
2.353GlnAla: 2.353 ± 0.028
0.942GlnCys: 0.942 ± 0.023
1.404GlnAsp: 1.404 ± 0.022
2.084GlnGlu: 2.084 ± 0.027
1.65GlnPhe: 1.65 ± 0.02
1.677GlnGly: 1.677 ± 0.023
1.038GlnHis: 1.038 ± 0.017
2.154GlnIle: 2.154 ± 0.022
2.217GlnLys: 2.217 ± 0.022
4.184GlnLeu: 4.184 ± 0.039
1.17GlnMet: 1.17 ± 0.018
1.599GlnAsn: 1.599 ± 0.022
1.801GlnPro: 1.801 ± 0.027
2.174GlnGln: 2.174 ± 0.057
2.522GlnArg: 2.522 ± 0.025
2.596GlnSer: 2.596 ± 0.029
1.905GlnThr: 1.905 ± 0.021
2.38GlnVal: 2.38 ± 0.024
0.522GlnTrp: 0.522 ± 0.015
1.208GlnTyr: 1.208 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
3.682ArgAla: 3.682 ± 0.03
1.437ArgCys: 1.437 ± 0.023
2.984ArgAsp: 2.984 ± 0.031
3.845ArgGlu: 3.845 ± 0.037
2.761ArgPhe: 2.761 ± 0.026
3.062ArgGly: 3.062 ± 0.032
1.631ArgHis: 1.631 ± 0.021
3.657ArgIle: 3.657 ± 0.027
3.965ArgLys: 3.965 ± 0.035
5.985ArgLeu: 5.985 ± 0.041
1.631ArgMet: 1.631 ± 0.019
2.815ArgAsn: 2.815 ± 0.025
2.679ArgPro: 2.679 ± 0.027
2.493ArgGln: 2.493 ± 0.026
5.012ArgArg: 5.012 ± 0.046
4.609ArgSer: 4.609 ± 0.034
3.311ArgThr: 3.311 ± 0.028
3.694ArgVal: 3.694 ± 0.033
0.764ArgTrp: 0.764 ± 0.014
1.907ArgTyr: 1.907 ± 0.022
0.001ArgXaa: 0.001 ± 0.0
Ser
5.108SerAla: 5.108 ± 0.041
1.754SerCys: 1.754 ± 0.025
4.157SerAsp: 4.157 ± 0.034
4.642SerGlu: 4.642 ± 0.04
3.626SerPhe: 3.626 ± 0.034
4.632SerGly: 4.632 ± 0.039
1.853SerHis: 1.853 ± 0.02
4.443SerIle: 4.443 ± 0.039
3.815SerLys: 3.815 ± 0.032
7.342SerLeu: 7.342 ± 0.048
1.924SerMet: 1.924 ± 0.02
3.133SerAsn: 3.133 ± 0.029
3.764SerPro: 3.764 ± 0.049
2.723SerGln: 2.723 ± 0.028
4.542SerArg: 4.542 ± 0.036
7.698SerSer: 7.698 ± 0.071
4.868SerThr: 4.868 ± 0.045
5.35SerVal: 5.35 ± 0.038
0.86SerTrp: 0.86 ± 0.016
2.259SerTyr: 2.259 ± 0.023
0.001SerXaa: 0.001 ± 0.001
Thr
3.813ThrAla: 3.813 ± 0.03
1.229ThrCys: 1.229 ± 0.022
2.642ThrAsp: 2.642 ± 0.029
3.037ThrGlu: 3.037 ± 0.028
2.452ThrPhe: 2.452 ± 0.025
3.103ThrGly: 3.103 ± 0.028
1.239ThrHis: 1.239 ± 0.02
3.276ThrIle: 3.276 ± 0.033
2.86ThrLys: 2.86 ± 0.027
5.008ThrLeu: 5.008 ± 0.036
1.439ThrMet: 1.439 ± 0.019
2.296ThrAsn: 2.296 ± 0.024
2.767ThrPro: 2.767 ± 0.034
1.79ThrGln: 1.79 ± 0.021
2.903ThrArg: 2.903 ± 0.028
4.627ThrSer: 4.627 ± 0.042
3.571ThrThr: 3.571 ± 0.05
4.208ThrVal: 4.208 ± 0.035
0.604ThrTrp: 0.604 ± 0.011
1.607ThrTyr: 1.607 ± 0.022
0.001ThrXaa: 0.001 ± 0.0
Val
4.639ValAla: 4.639 ± 0.031
1.654ValCys: 1.654 ± 0.024
4.06ValAsp: 4.06 ± 0.033
4.337ValGlu: 4.337 ± 0.034
3.279ValPhe: 3.279 ± 0.032
3.817ValGly: 3.817 ± 0.035
1.743ValHis: 1.743 ± 0.019
4.097ValIle: 4.097 ± 0.029
3.603ValLys: 3.603 ± 0.033
6.672ValLeu: 6.672 ± 0.041
1.724ValMet: 1.724 ± 0.02
2.904ValAsn: 2.904 ± 0.027
3.162ValPro: 3.162 ± 0.029
2.496ValGln: 2.496 ± 0.023
3.903ValArg: 3.903 ± 0.034
5.163ValSer: 5.163 ± 0.038
3.729ValThr: 3.729 ± 0.033
5.434ValVal: 5.434 ± 0.047
0.759ValTrp: 0.759 ± 0.014
2.083ValTyr: 2.083 ± 0.024
0.0ValXaa: 0.0 ± 0.0
Trp
0.684TrpAla: 0.684 ± 0.013
0.259TrpCys: 0.259 ± 0.008
0.607TrpAsp: 0.607 ± 0.013
0.654TrpGlu: 0.654 ± 0.012
0.554TrpPhe: 0.554 ± 0.012
0.505TrpGly: 0.505 ± 0.013
0.273TrpHis: 0.273 ± 0.009
0.771TrpIle: 0.771 ± 0.016
0.775TrpLys: 0.775 ± 0.014
1.17TrpLeu: 1.17 ± 0.017
0.391TrpMet: 0.391 ± 0.009
0.614TrpAsn: 0.614 ± 0.013
0.469TrpPro: 0.469 ± 0.011
0.46TrpGln: 0.46 ± 0.009
0.795TrpArg: 0.795 ± 0.015
0.95TrpSer: 0.95 ± 0.014
0.715TrpThr: 0.715 ± 0.013
0.655TrpVal: 0.655 ± 0.012
0.179TrpTrp: 0.179 ± 0.006
0.378TrpTyr: 0.378 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.967TyrAla: 1.967 ± 0.022
0.836TyrCys: 0.836 ± 0.016
1.815TyrAsp: 1.815 ± 0.022
1.914TyrGlu: 1.914 ± 0.023
1.566TyrPhe: 1.566 ± 0.02
2.086TyrGly: 2.086 ± 0.029
0.839TyrHis: 0.839 ± 0.016
1.696TyrIle: 1.696 ± 0.024
1.515TyrLys: 1.515 ± 0.022
2.917TyrLeu: 2.917 ± 0.024
0.826TyrMet: 0.826 ± 0.014
1.316TyrAsn: 1.316 ± 0.019
1.389TyrPro: 1.389 ± 0.019
1.111TyrGln: 1.111 ± 0.018
1.99TyrArg: 1.99 ± 0.023
2.376TyrSer: 2.376 ± 0.028
1.583TyrThr: 1.583 ± 0.02
2.094TyrVal: 2.094 ± 0.024
0.419TyrTrp: 0.419 ± 0.01
1.224TyrTyr: 1.224 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.001XaaGln: 0.001 ± 0.001
0.001XaaArg: 0.001 ± 0.001
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.006XaaXaa: 0.006 ± 0.005
Statistics based on 13350 proteins (4341050 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski