Amino acid dipepetide frequency for Venustampulla echinocandica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.672AlaAla: 8.672 ± 0.064
1.033AlaCys: 1.033 ± 0.013
4.099AlaAsp: 4.099 ± 0.03
5.045AlaGlu: 5.045 ± 0.043
3.043AlaPhe: 3.043 ± 0.025
5.671AlaGly: 5.671 ± 0.034
1.682AlaHis: 1.682 ± 0.019
4.487AlaIle: 4.487 ± 0.032
4.227AlaLys: 4.227 ± 0.035
7.612AlaLeu: 7.612 ± 0.042
1.963AlaMet: 1.963 ± 0.022
3.025AlaAsn: 3.025 ± 0.025
4.779AlaPro: 4.779 ± 0.043
3.269AlaGln: 3.269 ± 0.027
4.622AlaArg: 4.622 ± 0.033
7.145AlaSer: 7.145 ± 0.038
5.306AlaThr: 5.306 ± 0.033
5.131AlaVal: 5.131 ± 0.034
1.095AlaTrp: 1.095 ± 0.018
2.092AlaTyr: 2.092 ± 0.018
0.0AlaXaa: 0.0 ± 0.0
Cys
0.879CysAla: 0.879 ± 0.014
0.23CysCys: 0.23 ± 0.007
0.631CysAsp: 0.631 ± 0.011
0.593CysGlu: 0.593 ± 0.012
0.522CysPhe: 0.522 ± 0.012
0.94CysGly: 0.94 ± 0.014
0.315CysHis: 0.315 ± 0.008
0.724CysIle: 0.724 ± 0.013
0.551CysLys: 0.551 ± 0.011
1.203CysLeu: 1.203 ± 0.019
0.255CysMet: 0.255 ± 0.007
0.45CysAsn: 0.45 ± 0.008
0.666CysPro: 0.666 ± 0.014
0.437CysGln: 0.437 ± 0.009
0.721CysArg: 0.721 ± 0.012
0.941CysSer: 0.941 ± 0.017
0.695CysThr: 0.695 ± 0.011
0.779CysVal: 0.779 ± 0.015
0.206CysTrp: 0.206 ± 0.005
0.375CysTyr: 0.375 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
4.404AspAla: 4.404 ± 0.032
0.617AspCys: 0.617 ± 0.014
4.113AspAsp: 4.113 ± 0.039
4.408AspGlu: 4.408 ± 0.037
2.185AspPhe: 2.185 ± 0.021
4.132AspGly: 4.132 ± 0.032
1.213AspHis: 1.213 ± 0.02
3.355AspIle: 3.355 ± 0.026
2.38AspLys: 2.38 ± 0.024
5.0AspLeu: 5.0 ± 0.031
1.295AspMet: 1.295 ± 0.014
1.955AspAsn: 1.955 ± 0.017
3.292AspPro: 3.292 ± 0.024
1.845AspGln: 1.845 ± 0.019
2.878AspArg: 2.878 ± 0.026
4.182AspSer: 4.182 ± 0.029
2.937AspThr: 2.937 ± 0.026
3.554AspVal: 3.554 ± 0.025
0.864AspTrp: 0.864 ± 0.013
1.545AspTyr: 1.545 ± 0.018
0.0AspXaa: 0.0 ± 0.0
Glu
5.303GluAla: 5.303 ± 0.042
0.629GluCys: 0.629 ± 0.012
4.212GluAsp: 4.212 ± 0.038
5.342GluGlu: 5.342 ± 0.057
1.979GluPhe: 1.979 ± 0.019
3.937GluGly: 3.937 ± 0.032
1.349GluHis: 1.349 ± 0.017
3.279GluIle: 3.279 ± 0.029
3.784GluLys: 3.784 ± 0.034
5.216GluLeu: 5.216 ± 0.039
1.487GluMet: 1.487 ± 0.017
2.405GluAsn: 2.405 ± 0.023
2.782GluPro: 2.782 ± 0.032
2.378GluGln: 2.378 ± 0.026
3.838GluArg: 3.838 ± 0.038
4.352GluSer: 4.352 ± 0.031
3.383GluThr: 3.383 ± 0.028
3.694GluVal: 3.694 ± 0.029
0.878GluTrp: 0.878 ± 0.013
1.663GluTyr: 1.663 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
2.95PheAla: 2.95 ± 0.028
0.558PheCys: 0.558 ± 0.009
2.289PheAsp: 2.289 ± 0.022
2.186PheGlu: 2.186 ± 0.02
1.487PhePhe: 1.487 ± 0.02
2.872PheGly: 2.872 ± 0.028
0.851PheHis: 0.851 ± 0.012
1.847PheIle: 1.847 ± 0.022
1.607PheLys: 1.607 ± 0.018
3.337PheLeu: 3.337 ± 0.031
0.781PheMet: 0.781 ± 0.014
1.485PheAsn: 1.485 ± 0.015
1.977PhePro: 1.977 ± 0.023
1.441PheGln: 1.441 ± 0.017
1.852PheArg: 1.852 ± 0.02
2.989PheSer: 2.989 ± 0.026
2.155PheThr: 2.155 ± 0.023
2.299PheVal: 2.299 ± 0.023
0.607PheTrp: 0.607 ± 0.011
1.057PheTyr: 1.057 ± 0.016
0.0PheXaa: 0.0 ± 0.0
Gly
5.361GlyAla: 5.361 ± 0.04
0.883GlyCys: 0.883 ± 0.014
3.764GlyAsp: 3.764 ± 0.029
3.743GlyGlu: 3.743 ± 0.034
2.8GlyPhe: 2.8 ± 0.031
6.04GlyGly: 6.04 ± 0.058
1.665GlyHis: 1.665 ± 0.019
3.744GlyIle: 3.744 ± 0.03
3.726GlyLys: 3.726 ± 0.03
6.043GlyLeu: 6.043 ± 0.038
1.675GlyMet: 1.675 ± 0.018
2.8GlyAsn: 2.8 ± 0.028
3.447GlyPro: 3.447 ± 0.029
2.512GlyGln: 2.512 ± 0.026
4.12GlyArg: 4.12 ± 0.035
5.854GlySer: 5.854 ± 0.04
4.051GlyThr: 4.051 ± 0.032
4.333GlyVal: 4.333 ± 0.03
1.148GlyTrp: 1.148 ± 0.016
2.141GlyTyr: 2.141 ± 0.026
0.0GlyXaa: 0.0 ± 0.0
His
1.703HisAla: 1.703 ± 0.022
0.311HisCys: 0.311 ± 0.008
1.241HisAsp: 1.241 ± 0.017
1.266HisGlu: 1.266 ± 0.016
0.887HisPhe: 0.887 ± 0.014
1.672HisGly: 1.672 ± 0.019
0.791HisHis: 0.791 ± 0.016
1.267HisIle: 1.267 ± 0.017
0.947HisLys: 0.947 ± 0.014
2.14HisLeu: 2.14 ± 0.021
0.471HisMet: 0.471 ± 0.011
0.89HisAsn: 0.89 ± 0.014
1.664HisPro: 1.664 ± 0.02
0.982HisGln: 0.982 ± 0.014
1.445HisArg: 1.445 ± 0.016
1.808HisSer: 1.808 ± 0.019
1.253HisThr: 1.253 ± 0.017
1.302HisVal: 1.302 ± 0.016
0.322HisTrp: 0.322 ± 0.007
0.664HisTyr: 0.664 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
4.298IleAla: 4.298 ± 0.033
0.783IleCys: 0.783 ± 0.014
2.986IleAsp: 2.986 ± 0.023
3.033IleGlu: 3.033 ± 0.03
2.094IlePhe: 2.094 ± 0.023
3.386IleGly: 3.386 ± 0.029
1.23IleHis: 1.23 ± 0.015
2.78IleIle: 2.78 ± 0.026
2.443IleLys: 2.443 ± 0.024
4.777IleLeu: 4.777 ± 0.032
1.065IleMet: 1.065 ± 0.014
1.954IleAsn: 1.954 ± 0.02
3.321IlePro: 3.321 ± 0.03
2.026IleGln: 2.026 ± 0.019
2.811IleArg: 2.811 ± 0.024
4.366IleSer: 4.366 ± 0.03
3.057IleThr: 3.057 ± 0.025
3.206IleVal: 3.206 ± 0.03
0.752IleTrp: 0.752 ± 0.013
1.495IleTyr: 1.495 ± 0.018
0.0IleXaa: 0.0 ± 0.0
Lys
4.436LysAla: 4.436 ± 0.035
0.55LysCys: 0.55 ± 0.011
2.908LysAsp: 2.908 ± 0.028
3.531LysGlu: 3.531 ± 0.035
1.552LysPhe: 1.552 ± 0.018
3.316LysGly: 3.316 ± 0.028
1.144LysHis: 1.144 ± 0.016
2.492LysIle: 2.492 ± 0.026
3.47LysLys: 3.47 ± 0.049
4.324LysLeu: 4.324 ± 0.034
1.061LysMet: 1.061 ± 0.013
1.866LysAsn: 1.866 ± 0.02
2.895LysPro: 2.895 ± 0.028
1.862LysGln: 1.862 ± 0.023
3.417LysArg: 3.417 ± 0.032
3.877LysSer: 3.877 ± 0.033
2.939LysThr: 2.939 ± 0.027
2.937LysVal: 2.937 ± 0.024
0.709LysTrp: 0.709 ± 0.012
1.467LysTyr: 1.467 ± 0.017
0.0LysXaa: 0.0 ± 0.0
Leu
7.681LeuAla: 7.681 ± 0.044
1.184LeuCys: 1.184 ± 0.017
5.14LeuAsp: 5.14 ± 0.032
5.675LeuGlu: 5.675 ± 0.042
3.177LeuPhe: 3.177 ± 0.029
5.964LeuGly: 5.964 ± 0.036
2.121LeuHis: 2.121 ± 0.022
3.996LeuIle: 3.996 ± 0.032
4.484LeuLys: 4.484 ± 0.033
8.261LeuLeu: 8.261 ± 0.055
1.751LeuMet: 1.751 ± 0.018
3.263LeuAsn: 3.263 ± 0.027
5.552LeuPro: 5.552 ± 0.037
3.881LeuGln: 3.881 ± 0.035
5.507LeuArg: 5.507 ± 0.038
7.26LeuSer: 7.26 ± 0.039
4.724LeuThr: 4.724 ± 0.032
5.271LeuVal: 5.271 ± 0.037
1.161LeuTrp: 1.161 ± 0.016
2.289LeuTyr: 2.289 ± 0.022
0.0LeuXaa: 0.0 ± 0.0
Met
2.263MetAla: 2.263 ± 0.02
0.233MetCys: 0.233 ± 0.007
1.3MetAsp: 1.3 ± 0.014
1.358MetGlu: 1.358 ± 0.016
0.736MetPhe: 0.736 ± 0.012
1.522MetGly: 1.522 ± 0.02
0.458MetHis: 0.458 ± 0.009
1.023MetIle: 1.023 ± 0.015
1.091MetLys: 1.091 ± 0.015
1.825MetLeu: 1.825 ± 0.019
0.565MetMet: 0.565 ± 0.011
0.822MetAsn: 0.822 ± 0.012
1.27MetPro: 1.27 ± 0.016
0.845MetGln: 0.845 ± 0.012
1.22MetArg: 1.22 ± 0.016
1.889MetSer: 1.889 ± 0.019
1.229MetThr: 1.229 ± 0.015
1.298MetVal: 1.298 ± 0.016
0.253MetTrp: 0.253 ± 0.007
0.514MetTyr: 0.514 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
3.136AsnAla: 3.136 ± 0.023
0.486AsnCys: 0.486 ± 0.011
2.028AsnAsp: 2.028 ± 0.021
2.076AsnGlu: 2.076 ± 0.021
1.455AsnPhe: 1.455 ± 0.017
3.174AsnGly: 3.174 ± 0.033
0.883AsnHis: 0.883 ± 0.014
2.303AsnIle: 2.303 ± 0.022
1.66AsnLys: 1.66 ± 0.02
3.39AsnLeu: 3.39 ± 0.028
0.873AsnMet: 0.873 ± 0.014
1.619AsnAsn: 1.619 ± 0.021
2.654AsnPro: 2.654 ± 0.025
1.394AsnGln: 1.394 ± 0.018
1.968AsnArg: 1.968 ± 0.02
3.089AsnSer: 3.089 ± 0.028
2.345AsnThr: 2.345 ± 0.021
2.28AsnVal: 2.28 ± 0.021
0.561AsnTrp: 0.561 ± 0.01
1.072AsnTyr: 1.072 ± 0.014
0.0AsnXaa: 0.0 ± 0.0
Pro
5.078ProAla: 5.078 ± 0.044
0.502ProCys: 0.502 ± 0.01
3.089ProAsp: 3.089 ± 0.025
3.793ProGlu: 3.793 ± 0.033
2.048ProPhe: 2.048 ± 0.018
4.015ProGly: 4.015 ± 0.032
1.328ProHis: 1.328 ± 0.016
2.828ProIle: 2.828 ± 0.024
2.92ProLys: 2.92 ± 0.03
4.82ProLeu: 4.82 ± 0.03
1.069ProMet: 1.069 ± 0.016
2.389ProAsn: 2.389 ± 0.025
5.362ProPro: 5.362 ± 0.071
2.615ProGln: 2.615 ± 0.027
3.374ProArg: 3.374 ± 0.026
6.305ProSer: 6.305 ± 0.048
4.251ProThr: 4.251 ± 0.04
3.443ProVal: 3.443 ± 0.03
0.725ProTrp: 0.725 ± 0.012
1.494ProTyr: 1.494 ± 0.018
0.0ProXaa: 0.0 ± 0.0
Gln
3.338GlnAla: 3.338 ± 0.026
0.437GlnCys: 0.437 ± 0.01
2.055GlnAsp: 2.055 ± 0.023
2.394GlnGlu: 2.394 ± 0.021
1.308GlnPhe: 1.308 ± 0.017
2.408GlnGly: 2.408 ± 0.025
1.033GlnHis: 1.033 ± 0.015
1.969GlnIle: 1.969 ± 0.021
2.055GlnLys: 2.055 ± 0.024
3.471GlnLeu: 3.471 ± 0.03
0.874GlnMet: 0.874 ± 0.016
1.673GlnAsn: 1.673 ± 0.02
2.497GlnPro: 2.497 ± 0.031
2.304GlnGln: 2.304 ± 0.04
2.558GlnArg: 2.558 ± 0.026
3.136GlnSer: 3.136 ± 0.025
2.302GlnThr: 2.302 ± 0.021
2.16GlnVal: 2.16 ± 0.024
0.537GlnTrp: 0.537 ± 0.01
1.18GlnTyr: 1.18 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
4.456ArgAla: 4.456 ± 0.033
0.679ArgCys: 0.679 ± 0.014
3.233ArgAsp: 3.233 ± 0.031
3.752ArgGlu: 3.752 ± 0.032
2.021ArgPhe: 2.021 ± 0.023
3.796ArgGly: 3.796 ± 0.035
1.445ArgHis: 1.445 ± 0.016
2.926ArgIle: 2.926 ± 0.024
3.482ArgLys: 3.482 ± 0.031
5.17ArgLeu: 5.17 ± 0.044
1.268ArgMet: 1.268 ± 0.019
2.315ArgAsn: 2.315 ± 0.022
3.428ArgPro: 3.428 ± 0.031
2.482ArgGln: 2.482 ± 0.026
4.765ArgArg: 4.765 ± 0.049
4.728ArgSer: 4.728 ± 0.044
3.187ArgThr: 3.187 ± 0.025
3.194ArgVal: 3.194 ± 0.022
0.865ArgTrp: 0.865 ± 0.014
1.588ArgTyr: 1.588 ± 0.018
0.0ArgXaa: 0.0 ± 0.0
Ser
6.53SerAla: 6.53 ± 0.036
0.894SerCys: 0.894 ± 0.015
4.183SerAsp: 4.183 ± 0.031
4.208SerGlu: 4.208 ± 0.03
3.02SerPhe: 3.02 ± 0.029
5.7SerGly: 5.7 ± 0.035
1.941SerHis: 1.941 ± 0.022
4.374SerIle: 4.374 ± 0.033
4.113SerLys: 4.113 ± 0.031
7.221SerLeu: 7.221 ± 0.045
1.789SerMet: 1.789 ± 0.019
3.367SerAsn: 3.367 ± 0.028
5.829SerPro: 5.829 ± 0.056
3.409SerGln: 3.409 ± 0.031
4.974SerArg: 4.974 ± 0.037
9.146SerSer: 9.146 ± 0.073
5.842SerThr: 5.842 ± 0.043
4.429SerVal: 4.429 ± 0.034
1.13SerTrp: 1.13 ± 0.015
2.131SerTyr: 2.131 ± 0.021
0.0SerXaa: 0.0 ± 0.0
Thr
5.178ThrAla: 5.178 ± 0.031
0.738ThrCys: 0.738 ± 0.012
2.753ThrAsp: 2.753 ± 0.023
3.121ThrGlu: 3.121 ± 0.027
2.279ThrPhe: 2.279 ± 0.023
4.178ThrGly: 4.178 ± 0.03
1.256ThrHis: 1.256 ± 0.017
3.299ThrIle: 3.299 ± 0.027
2.826ThrLys: 2.826 ± 0.025
5.254ThrLeu: 5.254 ± 0.034
1.202ThrMet: 1.202 ± 0.013
2.219ThrAsn: 2.219 ± 0.022
4.487ThrPro: 4.487 ± 0.04
2.068ThrGln: 2.068 ± 0.021
3.071ThrArg: 3.071 ± 0.024
5.573ThrSer: 5.573 ± 0.039
4.429ThrThr: 4.429 ± 0.042
3.655ThrVal: 3.655 ± 0.03
0.814ThrTrp: 0.814 ± 0.015
1.594ThrTyr: 1.594 ± 0.02
0.0ThrXaa: 0.0 ± 0.0
Val
5.18ValAla: 5.18 ± 0.033
0.762ValCys: 0.762 ± 0.013
3.638ValAsp: 3.638 ± 0.025
3.983ValGlu: 3.983 ± 0.029
2.309ValPhe: 2.309 ± 0.023
4.116ValGly: 4.116 ± 0.032
1.282ValHis: 1.282 ± 0.015
2.961ValIle: 2.961 ± 0.022
3.007ValLys: 3.007 ± 0.024
5.383ValLeu: 5.383 ± 0.035
1.301ValMet: 1.301 ± 0.016
2.168ValAsn: 2.168 ± 0.022
3.465ValPro: 3.465 ± 0.025
2.278ValGln: 2.278 ± 0.021
3.198ValArg: 3.198 ± 0.022
4.46ValSer: 4.46 ± 0.032
3.388ValThr: 3.388 ± 0.029
4.226ValVal: 4.226 ± 0.034
0.821ValTrp: 0.821 ± 0.013
1.667ValTyr: 1.667 ± 0.019
0.0ValXaa: 0.0 ± 0.0
Trp
1.121TrpAla: 1.121 ± 0.017
0.185TrpCys: 0.185 ± 0.006
0.896TrpAsp: 0.896 ± 0.013
0.852TrpGlu: 0.852 ± 0.015
0.518TrpPhe: 0.518 ± 0.01
0.941TrpGly: 0.941 ± 0.014
0.325TrpHis: 0.325 ± 0.009
0.766TrpIle: 0.766 ± 0.014
0.821TrpLys: 0.821 ± 0.015
1.266TrpLeu: 1.266 ± 0.018
0.347TrpMet: 0.347 ± 0.008
0.634TrpAsn: 0.634 ± 0.011
0.594TrpPro: 0.594 ± 0.011
0.542TrpGln: 0.542 ± 0.01
0.894TrpArg: 0.894 ± 0.014
1.013TrpSer: 1.013 ± 0.015
0.877TrpThr: 0.877 ± 0.013
0.86TrpVal: 0.86 ± 0.013
0.266TrpTrp: 0.266 ± 0.008
0.418TrpTyr: 0.418 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.03TyrAla: 2.03 ± 0.02
0.421TyrCys: 0.421 ± 0.009
1.596TyrAsp: 1.596 ± 0.016
1.536TyrGlu: 1.536 ± 0.016
1.208TyrPhe: 1.208 ± 0.015
2.071TyrGly: 2.071 ± 0.023
0.716TyrHis: 0.716 ± 0.013
1.443TyrIle: 1.443 ± 0.019
1.157TyrLys: 1.157 ± 0.015
2.628TyrLeu: 2.628 ± 0.021
0.62TyrMet: 0.62 ± 0.012
1.144TyrAsn: 1.144 ± 0.016
1.486TyrPro: 1.486 ± 0.019
1.117TyrGln: 1.117 ± 0.018
1.547TyrArg: 1.547 ± 0.019
2.115TyrSer: 2.115 ± 0.02
1.645TyrThr: 1.645 ± 0.016
1.541TyrVal: 1.541 ± 0.017
0.423TyrTrp: 0.423 ± 0.01
0.899TyrTyr: 0.899 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10697 proteins (5497940 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski