Amino acid dipepetide frequency for Pseudozobellia thermophila

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.846AlaAla: 4.846 ± 0.074
0.646AlaCys: 0.646 ± 0.023
3.993AlaAsp: 3.993 ± 0.057
4.442AlaGlu: 4.442 ± 0.062
3.346AlaPhe: 3.346 ± 0.048
4.612AlaGly: 4.612 ± 0.074
1.418AlaHis: 1.418 ± 0.035
5.029AlaIle: 5.029 ± 0.065
4.567AlaLys: 4.567 ± 0.066
7.23AlaLeu: 7.23 ± 0.078
1.764AlaMet: 1.764 ± 0.034
3.318AlaAsn: 3.318 ± 0.052
2.548AlaPro: 2.548 ± 0.048
2.787AlaGln: 2.787 ± 0.044
2.544AlaArg: 2.544 ± 0.047
4.475AlaSer: 4.475 ± 0.059
3.853AlaThr: 3.853 ± 0.064
4.472AlaVal: 4.472 ± 0.057
0.751AlaTrp: 0.751 ± 0.022
2.931AlaTyr: 2.931 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
0.508CysAla: 0.508 ± 0.019
0.092CysCys: 0.092 ± 0.008
0.447CysAsp: 0.447 ± 0.02
0.47CysGlu: 0.47 ± 0.02
0.386CysPhe: 0.386 ± 0.018
0.613CysGly: 0.613 ± 0.024
0.192CysHis: 0.192 ± 0.013
0.492CysIle: 0.492 ± 0.02
0.414CysLys: 0.414 ± 0.017
0.662CysLeu: 0.662 ± 0.024
0.171CysMet: 0.171 ± 0.011
0.359CysAsn: 0.359 ± 0.019
0.316CysPro: 0.316 ± 0.019
0.224CysGln: 0.224 ± 0.01
0.252CysArg: 0.252 ± 0.012
0.565CysSer: 0.565 ± 0.021
0.486CysThr: 0.486 ± 0.023
0.409CysVal: 0.409 ± 0.018
0.074CysTrp: 0.074 ± 0.008
0.3CysTyr: 0.3 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
4.024AspAla: 4.024 ± 0.061
0.403AspCys: 0.403 ± 0.019
3.481AspAsp: 3.481 ± 0.063
3.959AspGlu: 3.959 ± 0.063
3.589AspPhe: 3.589 ± 0.052
4.864AspGly: 4.864 ± 0.105
1.223AspHis: 1.223 ± 0.029
4.238AspIle: 4.238 ± 0.053
3.842AspLys: 3.842 ± 0.056
5.559AspLeu: 5.559 ± 0.073
1.332AspMet: 1.332 ± 0.027
2.933AspAsn: 2.933 ± 0.05
2.503AspPro: 2.503 ± 0.047
1.917AspGln: 1.917 ± 0.039
2.716AspArg: 2.716 ± 0.044
3.261AspSer: 3.261 ± 0.058
3.089AspThr: 3.089 ± 0.049
3.555AspVal: 3.555 ± 0.056
0.93AspTrp: 0.93 ± 0.025
2.945AspTyr: 2.945 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
5.052GluAla: 5.052 ± 0.07
0.342GluCys: 0.342 ± 0.016
3.702GluAsp: 3.702 ± 0.057
4.999GluGlu: 4.999 ± 0.074
2.728GluPhe: 2.728 ± 0.045
4.719GluGly: 4.719 ± 0.057
1.282GluHis: 1.282 ± 0.031
4.881GluIle: 4.881 ± 0.068
5.455GluLys: 5.455 ± 0.077
6.188GluLeu: 6.188 ± 0.081
1.628GluMet: 1.628 ± 0.038
4.194GluAsn: 4.194 ± 0.054
1.919GluPro: 1.919 ± 0.035
2.237GluGln: 2.237 ± 0.037
3.12GluArg: 3.12 ± 0.056
3.243GluSer: 3.243 ± 0.056
3.575GluThr: 3.575 ± 0.051
4.734GluVal: 4.734 ± 0.066
0.763GluTrp: 0.763 ± 0.024
2.373GluTyr: 2.373 ± 0.037
0.0GluXaa: 0.0 ± 0.0
Phe
3.079PheAla: 3.079 ± 0.047
0.386PheCys: 0.386 ± 0.017
3.468PheAsp: 3.468 ± 0.056
3.341PheGlu: 3.341 ± 0.041
2.557PhePhe: 2.557 ± 0.046
3.637PheGly: 3.637 ± 0.055
0.842PheHis: 0.842 ± 0.024
3.022PheIle: 3.022 ± 0.054
3.12PheLys: 3.12 ± 0.056
4.6PheLeu: 4.6 ± 0.067
1.146PheMet: 1.146 ± 0.026
2.531PheAsn: 2.531 ± 0.045
1.812PhePro: 1.812 ± 0.032
1.412PheGln: 1.412 ± 0.029
1.846PheArg: 1.846 ± 0.037
3.581PheSer: 3.581 ± 0.052
2.922PheThr: 2.922 ± 0.053
3.005PheVal: 3.005 ± 0.041
0.608PheTrp: 0.608 ± 0.023
2.099PheTyr: 2.099 ± 0.044
0.0PheXaa: 0.0 ± 0.0
Gly
4.884GlyAla: 4.884 ± 0.072
0.635GlyCys: 0.635 ± 0.032
4.129GlyAsp: 4.129 ± 0.074
4.403GlyGlu: 4.403 ± 0.056
3.734GlyPhe: 3.734 ± 0.057
5.215GlyGly: 5.215 ± 0.098
1.524GlyHis: 1.524 ± 0.027
5.419GlyIle: 5.419 ± 0.07
5.267GlyLys: 5.267 ± 0.069
6.731GlyLeu: 6.731 ± 0.075
1.778GlyMet: 1.778 ± 0.037
3.722GlyAsn: 3.722 ± 0.062
2.053GlyPro: 2.053 ± 0.04
2.493GlyGln: 2.493 ± 0.045
2.934GlyArg: 2.934 ± 0.042
4.644GlySer: 4.644 ± 0.077
4.494GlyThr: 4.494 ± 0.089
4.73GlyVal: 4.73 ± 0.061
0.968GlyTrp: 0.968 ± 0.028
3.241GlyTyr: 3.241 ± 0.051
0.0GlyXaa: 0.0 ± 0.0
His
1.142HisAla: 1.142 ± 0.03
0.188HisCys: 0.188 ± 0.011
0.926HisAsp: 0.926 ± 0.023
1.049HisGlu: 1.049 ± 0.032
1.156HisPhe: 1.156 ± 0.027
1.338HisGly: 1.338 ± 0.029
0.534HisHis: 0.534 ± 0.02
1.449HisIle: 1.449 ± 0.037
1.164HisLys: 1.164 ± 0.031
1.937HisLeu: 1.937 ± 0.041
0.422HisMet: 0.422 ± 0.019
0.898HisAsn: 0.898 ± 0.028
1.015HisPro: 1.015 ± 0.028
0.667HisGln: 0.667 ± 0.024
0.889HisArg: 0.889 ± 0.027
1.084HisSer: 1.084 ± 0.025
1.105HisThr: 1.105 ± 0.029
1.075HisVal: 1.075 ± 0.028
0.292HisTrp: 0.292 ± 0.014
0.861HisTyr: 0.861 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
5.211IleAla: 5.211 ± 0.072
0.565IleCys: 0.565 ± 0.022
4.831IleAsp: 4.831 ± 0.05
4.807IleGlu: 4.807 ± 0.061
2.87IlePhe: 2.87 ± 0.051
5.068IleGly: 5.068 ± 0.071
1.164IleHis: 1.164 ± 0.033
4.086IleIle: 4.086 ± 0.071
4.417IleLys: 4.417 ± 0.074
5.887IleLeu: 5.887 ± 0.08
1.224IleMet: 1.224 ± 0.032
3.405IleAsn: 3.405 ± 0.049
3.001IlePro: 3.001 ± 0.047
1.941IleGln: 1.941 ± 0.039
2.791IleArg: 2.791 ± 0.041
4.572IleSer: 4.572 ± 0.059
3.992IleThr: 3.992 ± 0.068
4.475IleVal: 4.475 ± 0.059
0.729IleTrp: 0.729 ± 0.022
2.433IleTyr: 2.433 ± 0.044
0.0IleXaa: 0.0 ± 0.0
Lys
5.032LysAla: 5.032 ± 0.08
0.31LysCys: 0.31 ± 0.016
3.994LysAsp: 3.994 ± 0.06
5.589LysGlu: 5.589 ± 0.083
2.503LysPhe: 2.503 ± 0.046
4.93LysGly: 4.93 ± 0.066
1.235LysHis: 1.235 ± 0.03
4.715LysIle: 4.715 ± 0.069
5.989LysLys: 5.989 ± 0.078
5.641LysLeu: 5.641 ± 0.067
1.72LysMet: 1.72 ± 0.037
4.16LysAsn: 4.16 ± 0.058
2.321LysPro: 2.321 ± 0.038
2.037LysGln: 2.037 ± 0.042
3.044LysArg: 3.044 ± 0.054
3.737LysSer: 3.737 ± 0.053
4.035LysThr: 4.035 ± 0.061
4.515LysVal: 4.515 ± 0.07
0.873LysTrp: 0.873 ± 0.029
2.663LysTyr: 2.663 ± 0.045
0.0LysXaa: 0.0 ± 0.0
Leu
6.861LeuAla: 6.861 ± 0.071
0.741LeuCys: 0.741 ± 0.024
5.433LeuAsp: 5.433 ± 0.067
6.062LeuGlu: 6.062 ± 0.071
4.775LeuPhe: 4.775 ± 0.067
6.797LeuGly: 6.797 ± 0.076
1.616LeuHis: 1.616 ± 0.034
5.705LeuIle: 5.705 ± 0.091
6.917LeuLys: 6.917 ± 0.08
8.949LeuLeu: 8.949 ± 0.117
2.174LeuMet: 2.174 ± 0.045
4.954LeuAsn: 4.954 ± 0.063
3.926LeuPro: 3.926 ± 0.049
3.067LeuGln: 3.067 ± 0.053
3.95LeuArg: 3.95 ± 0.053
6.423LeuSer: 6.423 ± 0.083
5.04LeuThr: 5.04 ± 0.065
6.083LeuVal: 6.083 ± 0.07
0.987LeuTrp: 0.987 ± 0.03
3.398LeuTyr: 3.398 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
2.177MetAla: 2.177 ± 0.039
0.134MetCys: 0.134 ± 0.011
1.356MetAsp: 1.356 ± 0.032
1.602MetGlu: 1.602 ± 0.03
0.827MetPhe: 0.827 ± 0.022
1.853MetGly: 1.853 ± 0.042
0.421MetHis: 0.421 ± 0.016
1.211MetIle: 1.211 ± 0.028
1.942MetLys: 1.942 ± 0.042
1.933MetLeu: 1.933 ± 0.041
0.522MetMet: 0.522 ± 0.021
1.093MetAsn: 1.093 ± 0.025
0.967MetPro: 0.967 ± 0.026
0.809MetGln: 0.809 ± 0.027
0.972MetArg: 0.972 ± 0.024
1.188MetSer: 1.188 ± 0.027
1.1MetThr: 1.1 ± 0.028
1.702MetVal: 1.702 ± 0.039
0.19MetTrp: 0.19 ± 0.01
0.703MetTyr: 0.703 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.466AsnAla: 3.466 ± 0.051
0.396AsnCys: 0.396 ± 0.019
3.087AsnAsp: 3.087 ± 0.062
3.095AsnGlu: 3.095 ± 0.048
2.623AsnPhe: 2.623 ± 0.05
4.278AsnGly: 4.278 ± 0.072
0.931AsnHis: 0.931 ± 0.025
3.699AsnIle: 3.699 ± 0.052
3.108AsnLys: 3.108 ± 0.052
4.784AsnLeu: 4.784 ± 0.068
1.132AsnMet: 1.132 ± 0.027
2.706AsnAsn: 2.706 ± 0.056
2.547AsnPro: 2.547 ± 0.041
1.72AsnGln: 1.72 ± 0.034
2.306AsnArg: 2.306 ± 0.041
3.055AsnSer: 3.055 ± 0.05
3.116AsnThr: 3.116 ± 0.055
2.982AsnVal: 2.982 ± 0.051
0.685AsnTrp: 0.685 ± 0.024
2.386AsnTyr: 2.386 ± 0.046
0.0AsnXaa: 0.0 ± 0.0
Pro
2.25ProAla: 2.25 ± 0.045
0.225ProCys: 0.225 ± 0.018
2.771ProAsp: 2.771 ± 0.047
3.392ProGlu: 3.392 ± 0.056
1.946ProPhe: 1.946 ± 0.034
2.536ProGly: 2.536 ± 0.046
0.763ProHis: 0.763 ± 0.025
2.648ProIle: 2.648 ± 0.04
2.731ProLys: 2.731 ± 0.05
3.452ProLeu: 3.452 ± 0.043
0.917ProMet: 0.917 ± 0.027
2.142ProAsn: 2.142 ± 0.042
1.163ProPro: 1.163 ± 0.03
1.275ProGln: 1.275 ± 0.031
1.29ProArg: 1.29 ± 0.032
2.486ProSer: 2.486 ± 0.041
2.094ProThr: 2.094 ± 0.045
2.775ProVal: 2.775 ± 0.05
0.457ProTrp: 0.457 ± 0.02
1.607ProTyr: 1.607 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
2.164GlnAla: 2.164 ± 0.042
0.173GlnCys: 0.173 ± 0.012
1.688GlnAsp: 1.688 ± 0.035
2.358GlnGlu: 2.358 ± 0.041
1.43GlnPhe: 1.43 ± 0.031
2.307GlnGly: 2.307 ± 0.036
0.591GlnHis: 0.591 ± 0.022
2.284GlnIle: 2.284 ± 0.039
2.546GlnLys: 2.546 ± 0.047
3.383GlnLeu: 3.383 ± 0.053
0.857GlnMet: 0.857 ± 0.023
1.845GlnAsn: 1.845 ± 0.037
1.106GlnPro: 1.106 ± 0.027
1.214GlnGln: 1.214 ± 0.028
1.481GlnArg: 1.481 ± 0.031
1.751GlnSer: 1.751 ± 0.031
1.65GlnThr: 1.65 ± 0.037
2.198GlnVal: 2.198 ± 0.038
0.478GlnTrp: 0.478 ± 0.02
1.25GlnTyr: 1.25 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
2.692ArgAla: 2.692 ± 0.045
0.221ArgCys: 0.221 ± 0.014
2.134ArgAsp: 2.134 ± 0.046
2.769ArgGlu: 2.769 ± 0.045
2.312ArgPhe: 2.312 ± 0.047
2.412ArgGly: 2.412 ± 0.047
0.876ArgHis: 0.876 ± 0.023
3.068ArgIle: 3.068 ± 0.053
3.099ArgLys: 3.099 ± 0.054
4.093ArgLeu: 4.093 ± 0.055
1.057ArgMet: 1.057 ± 0.028
2.306ArgAsn: 2.306 ± 0.042
1.83ArgPro: 1.83 ± 0.036
1.467ArgGln: 1.467 ± 0.03
1.702ArgArg: 1.702 ± 0.04
2.407ArgSer: 2.407 ± 0.042
2.195ArgThr: 2.195 ± 0.034
2.458ArgVal: 2.458 ± 0.042
0.509ArgTrp: 0.509 ± 0.019
1.963ArgTyr: 1.963 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
3.987SerAla: 3.987 ± 0.056
0.637SerCys: 0.637 ± 0.02
3.789SerAsp: 3.789 ± 0.049
3.934SerGlu: 3.934 ± 0.049
3.33SerPhe: 3.33 ± 0.047
4.857SerGly: 4.857 ± 0.079
1.139SerHis: 1.139 ± 0.029
4.435SerIle: 4.435 ± 0.059
3.921SerLys: 3.921 ± 0.056
5.936SerLeu: 5.936 ± 0.074
1.268SerMet: 1.268 ± 0.03
2.984SerAsn: 2.984 ± 0.049
2.426SerPro: 2.426 ± 0.057
1.889SerGln: 1.889 ± 0.033
2.398SerArg: 2.398 ± 0.041
3.876SerSer: 3.876 ± 0.063
3.313SerThr: 3.313 ± 0.058
4.081SerVal: 4.081 ± 0.058
0.723SerTrp: 0.723 ± 0.024
2.654SerTyr: 2.654 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
3.962ThrAla: 3.962 ± 0.062
0.353ThrCys: 0.353 ± 0.017
3.639ThrAsp: 3.639 ± 0.061
3.535ThrGlu: 3.535 ± 0.053
2.69ThrPhe: 2.69 ± 0.05
4.383ThrGly: 4.383 ± 0.071
1.017ThrHis: 1.017 ± 0.026
3.924ThrIle: 3.924 ± 0.055
3.182ThrLys: 3.182 ± 0.048
5.353ThrLeu: 5.353 ± 0.061
1.054ThrMet: 1.054 ± 0.022
2.493ThrAsn: 2.493 ± 0.054
2.663ThrPro: 2.663 ± 0.05
1.754ThrGln: 1.754 ± 0.034
1.904ThrArg: 1.904 ± 0.036
3.482ThrSer: 3.482 ± 0.052
3.211ThrThr: 3.211 ± 0.064
4.172ThrVal: 4.172 ± 0.075
0.619ThrTrp: 0.619 ± 0.021
2.438ThrTyr: 2.438 ± 0.053
0.0ThrXaa: 0.0 ± 0.0
Val
4.695ValAla: 4.695 ± 0.064
0.574ValCys: 0.574 ± 0.02
4.133ValAsp: 4.133 ± 0.054
4.068ValGlu: 4.068 ± 0.051
3.369ValPhe: 3.369 ± 0.056
4.462ValGly: 4.462 ± 0.057
1.247ValHis: 1.247 ± 0.029
4.05ValIle: 4.05 ± 0.054
3.955ValLys: 3.955 ± 0.057
6.608ValLeu: 6.608 ± 0.072
1.427ValMet: 1.427 ± 0.028
3.191ValAsn: 3.191 ± 0.049
2.756ValPro: 2.756 ± 0.042
1.983ValGln: 1.983 ± 0.038
2.705ValArg: 2.705 ± 0.047
4.535ValSer: 4.535 ± 0.056
3.491ValThr: 3.491 ± 0.069
4.741ValVal: 4.741 ± 0.07
0.741ValTrp: 0.741 ± 0.027
2.537ValTyr: 2.537 ± 0.044
0.0ValXaa: 0.0 ± 0.0
Trp
0.813TrpAla: 0.813 ± 0.026
0.103TrpCys: 0.103 ± 0.009
0.773TrpAsp: 0.773 ± 0.023
0.798TrpGlu: 0.798 ± 0.025
0.578TrpPhe: 0.578 ± 0.02
0.928TrpGly: 0.928 ± 0.027
0.293TrpHis: 0.293 ± 0.015
0.661TrpIle: 0.661 ± 0.024
0.851TrpLys: 0.851 ± 0.026
1.083TrpLeu: 1.083 ± 0.029
0.338TrpMet: 0.338 ± 0.018
0.687TrpAsn: 0.687 ± 0.022
0.409TrpPro: 0.409 ± 0.02
0.46TrpGln: 0.46 ± 0.018
0.542TrpArg: 0.542 ± 0.021
0.722TrpSer: 0.722 ± 0.026
0.611TrpThr: 0.611 ± 0.023
0.787TrpVal: 0.787 ± 0.024
0.196TrpTrp: 0.196 ± 0.012
0.479TrpTyr: 0.479 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.726TyrAla: 2.726 ± 0.045
0.309TyrCys: 0.309 ± 0.016
2.589TyrAsp: 2.589 ± 0.048
2.461TyrGlu: 2.461 ± 0.044
2.266TyrPhe: 2.266 ± 0.041
3.193TyrGly: 3.193 ± 0.055
0.883TyrHis: 0.883 ± 0.028
2.45TyrIle: 2.45 ± 0.036
2.556TyrLys: 2.556 ± 0.05
3.851TyrLeu: 3.851 ± 0.06
0.771TyrMet: 0.771 ± 0.023
2.179TyrAsn: 2.179 ± 0.045
1.639TyrPro: 1.639 ± 0.039
1.402TyrGln: 1.402 ± 0.032
2.183TyrArg: 2.183 ± 0.044
2.489TyrSer: 2.489 ± 0.048
2.454TyrThr: 2.454 ± 0.055
2.32TyrVal: 2.32 ± 0.046
0.54TyrTrp: 0.54 ± 0.019
1.83TyrTyr: 1.83 ± 0.036
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4169 proteins (1506813 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski