Amino acid dipepetide frequency for Parabacteroides distasonis (strain ATCC 8503 / DSM 20701 / CIP 104284 / JCM 5825 / NCTC 11152)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.205AlaAla: 5.205 ± 0.084
0.956AlaCys: 0.956 ± 0.027
3.95AlaAsp: 3.95 ± 0.055
4.413AlaGlu: 4.413 ± 0.068
3.313AlaPhe: 3.313 ± 0.054
5.127AlaGly: 5.127 ± 0.065
1.111AlaHis: 1.111 ± 0.029
4.887AlaIle: 4.887 ± 0.073
4.091AlaLys: 4.091 ± 0.063
6.739AlaLeu: 6.739 ± 0.078
1.878AlaMet: 1.878 ± 0.035
3.232AlaAsn: 3.232 ± 0.049
2.311AlaPro: 2.311 ± 0.041
2.47AlaGln: 2.47 ± 0.043
3.053AlaArg: 3.053 ± 0.042
4.609AlaSer: 4.609 ± 0.057
3.77AlaThr: 3.77 ± 0.062
4.517AlaVal: 4.517 ± 0.065
0.875AlaTrp: 0.875 ± 0.026
3.005AlaTyr: 3.005 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
0.739CysAla: 0.739 ± 0.025
0.211CysCys: 0.211 ± 0.014
0.619CysAsp: 0.619 ± 0.019
0.672CysGlu: 0.672 ± 0.021
0.625CysPhe: 0.625 ± 0.023
0.986CysGly: 0.986 ± 0.03
0.279CysHis: 0.279 ± 0.014
0.823CysIle: 0.823 ± 0.03
0.632CysLys: 0.632 ± 0.02
1.154CysLeu: 1.154 ± 0.026
0.355CysMet: 0.355 ± 0.017
0.483CysAsn: 0.483 ± 0.02
0.531CysPro: 0.531 ± 0.021
0.307CysGln: 0.307 ± 0.018
0.556CysArg: 0.556 ± 0.02
0.764CysSer: 0.764 ± 0.023
0.599CysThr: 0.599 ± 0.02
0.787CysVal: 0.787 ± 0.027
0.171CysTrp: 0.171 ± 0.012
0.497CysTyr: 0.497 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
3.773AspAla: 3.773 ± 0.062
0.584AspCys: 0.584 ± 0.021
2.546AspAsp: 2.546 ± 0.051
3.752AspGlu: 3.752 ± 0.053
2.936AspPhe: 2.936 ± 0.051
3.928AspGly: 3.928 ± 0.062
0.943AspHis: 0.943 ± 0.024
4.183AspIle: 4.183 ± 0.052
3.953AspLys: 3.953 ± 0.059
5.218AspLeu: 5.218 ± 0.065
1.671AspMet: 1.671 ± 0.029
2.627AspAsn: 2.627 ± 0.044
2.235AspPro: 2.235 ± 0.046
1.524AspGln: 1.524 ± 0.03
2.619AspArg: 2.619 ± 0.045
3.074AspSer: 3.074 ± 0.046
2.795AspThr: 2.795 ± 0.047
3.46AspVal: 3.46 ± 0.057
0.9AspTrp: 0.9 ± 0.026
2.88AspTyr: 2.88 ± 0.043
0.0AspXaa: 0.0 ± 0.0
Glu
5.007GluAla: 5.007 ± 0.067
0.586GluCys: 0.586 ± 0.022
3.385GluAsp: 3.385 ± 0.054
5.083GluGlu: 5.083 ± 0.079
2.338GluPhe: 2.338 ± 0.041
4.175GluGly: 4.175 ± 0.059
1.125GluHis: 1.125 ± 0.029
4.692GluIle: 4.692 ± 0.066
5.023GluLys: 5.023 ± 0.075
5.906GluLeu: 5.906 ± 0.075
1.888GluMet: 1.888 ± 0.037
3.603GluAsn: 3.603 ± 0.049
1.842GluPro: 1.842 ± 0.042
2.436GluGln: 2.436 ± 0.048
3.473GluArg: 3.473 ± 0.061
3.321GluSer: 3.321 ± 0.051
3.179GluThr: 3.179 ± 0.048
4.432GluVal: 4.432 ± 0.063
0.821GluTrp: 0.821 ± 0.026
2.703GluTyr: 2.703 ± 0.041
0.0GluXaa: 0.0 ± 0.0
Phe
3.091PheAla: 3.091 ± 0.051
0.656PheCys: 0.656 ± 0.026
2.784PheAsp: 2.784 ± 0.047
2.52PheGlu: 2.52 ± 0.045
2.28PhePhe: 2.28 ± 0.053
3.275PheGly: 3.275 ± 0.055
0.897PheHis: 0.897 ± 0.023
3.268PheIle: 3.268 ± 0.061
2.57PheLys: 2.57 ± 0.046
4.284PheLeu: 4.284 ± 0.065
1.342PheMet: 1.342 ± 0.031
2.44PheAsn: 2.44 ± 0.047
1.884PhePro: 1.884 ± 0.037
1.305PheGln: 1.305 ± 0.034
2.108PheArg: 2.108 ± 0.04
3.557PheSer: 3.557 ± 0.065
2.855PheThr: 2.855 ± 0.044
3.016PheVal: 3.016 ± 0.053
0.586PheTrp: 0.586 ± 0.021
2.011PheTyr: 2.011 ± 0.035
0.0PheXaa: 0.0 ± 0.0
Gly
4.396GlyAla: 4.396 ± 0.065
0.887GlyCys: 0.887 ± 0.027
3.626GlyAsp: 3.626 ± 0.052
4.234GlyGlu: 4.234 ± 0.059
3.306GlyPhe: 3.306 ± 0.057
4.814GlyGly: 4.814 ± 0.078
1.209GlyHis: 1.209 ± 0.03
5.312GlyIle: 5.312 ± 0.072
5.246GlyLys: 5.246 ± 0.071
6.115GlyLeu: 6.115 ± 0.074
2.155GlyMet: 2.155 ± 0.04
3.455GlyAsn: 3.455 ± 0.062
1.367GlyPro: 1.367 ± 0.035
2.061GlyGln: 2.061 ± 0.039
2.925GlyArg: 2.925 ± 0.05
4.123GlySer: 4.123 ± 0.065
4.156GlyThr: 4.156 ± 0.052
4.824GlyVal: 4.824 ± 0.069
0.993GlyTrp: 0.993 ± 0.03
3.421GlyTyr: 3.421 ± 0.056
0.0GlyXaa: 0.0 ± 0.0
His
1.135HisAla: 1.135 ± 0.026
0.282HisCys: 0.282 ± 0.013
0.868HisAsp: 0.868 ± 0.028
1.02HisGlu: 1.02 ± 0.027
1.02HisPhe: 1.02 ± 0.027
1.158HisGly: 1.158 ± 0.03
0.437HisHis: 0.437 ± 0.019
1.414HisIle: 1.414 ± 0.034
1.031HisLys: 1.031 ± 0.027
1.777HisLeu: 1.777 ± 0.039
0.409HisMet: 0.409 ± 0.018
0.845HisAsn: 0.845 ± 0.022
1.036HisPro: 1.036 ± 0.028
0.556HisGln: 0.556 ± 0.019
0.825HisArg: 0.825 ± 0.024
1.095HisSer: 1.095 ± 0.028
1.043HisThr: 1.043 ± 0.028
1.027HisVal: 1.027 ± 0.027
0.246HisTrp: 0.246 ± 0.014
0.824HisTyr: 0.824 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
5.331IleAla: 5.331 ± 0.074
0.991IleCys: 0.991 ± 0.028
4.154IleAsp: 4.154 ± 0.057
4.457IleGlu: 4.457 ± 0.054
2.781IlePhe: 2.781 ± 0.052
4.809IleGly: 4.809 ± 0.08
1.479IleHis: 1.479 ± 0.034
4.364IleIle: 4.364 ± 0.069
4.072IleLys: 4.072 ± 0.059
6.247IleLeu: 6.247 ± 0.083
1.568IleMet: 1.568 ± 0.032
3.243IleAsn: 3.243 ± 0.05
3.363IlePro: 3.363 ± 0.048
2.457IleGln: 2.457 ± 0.041
3.679IleArg: 3.679 ± 0.055
4.809IleSer: 4.809 ± 0.06
4.023IleThr: 4.023 ± 0.06
4.448IleVal: 4.448 ± 0.068
0.724IleTrp: 0.724 ± 0.024
2.826IleTyr: 2.826 ± 0.045
0.0IleXaa: 0.0 ± 0.0
Lys
4.793LysAla: 4.793 ± 0.069
0.53LysCys: 0.53 ± 0.021
4.128LysAsp: 4.128 ± 0.064
5.768LysGlu: 5.768 ± 0.079
2.057LysPhe: 2.057 ± 0.034
4.503LysGly: 4.503 ± 0.063
1.128LysHis: 1.128 ± 0.023
4.199LysIle: 4.199 ± 0.064
4.669LysLys: 4.669 ± 0.074
5.013LysLeu: 5.013 ± 0.064
1.972LysMet: 1.972 ± 0.037
3.477LysAsn: 3.477 ± 0.054
2.186LysPro: 2.186 ± 0.04
2.418LysGln: 2.418 ± 0.039
3.212LysArg: 3.212 ± 0.05
3.379LysSer: 3.379 ± 0.047
3.242LysThr: 3.242 ± 0.053
4.271LysVal: 4.271 ± 0.058
0.727LysTrp: 0.727 ± 0.023
2.804LysTyr: 2.804 ± 0.054
0.0LysXaa: 0.0 ± 0.0
Leu
6.452LeuAla: 6.452 ± 0.08
1.307LeuCys: 1.307 ± 0.029
4.981LeuAsp: 4.981 ± 0.067
5.145LeuGlu: 5.145 ± 0.067
4.873LeuPhe: 4.873 ± 0.081
5.925LeuGly: 5.925 ± 0.085
1.585LeuHis: 1.585 ± 0.033
5.895LeuIle: 5.895 ± 0.086
6.082LeuLys: 6.082 ± 0.069
9.479LeuLeu: 9.479 ± 0.105
2.459LeuMet: 2.459 ± 0.044
4.637LeuAsn: 4.637 ± 0.06
4.311LeuPro: 4.311 ± 0.055
2.961LeuGln: 2.961 ± 0.047
4.93LeuArg: 4.93 ± 0.054
7.227LeuSer: 7.227 ± 0.075
5.078LeuThr: 5.078 ± 0.067
5.317LeuVal: 5.317 ± 0.07
0.986LeuTrp: 0.986 ± 0.025
3.808LeuTyr: 3.808 ± 0.055
0.0LeuXaa: 0.0 ± 0.0
Met
2.066MetAla: 2.066 ± 0.036
0.249MetCys: 0.249 ± 0.014
1.621MetAsp: 1.621 ± 0.033
1.91MetGlu: 1.91 ± 0.04
0.982MetPhe: 0.982 ± 0.028
1.849MetGly: 1.849 ± 0.038
0.379MetHis: 0.379 ± 0.015
1.768MetIle: 1.768 ± 0.039
2.226MetLys: 2.226 ± 0.036
2.448MetLeu: 2.448 ± 0.05
0.801MetMet: 0.801 ± 0.027
1.638MetAsn: 1.638 ± 0.032
1.189MetPro: 1.189 ± 0.03
0.917MetGln: 0.917 ± 0.026
1.6MetArg: 1.6 ± 0.034
1.582MetSer: 1.582 ± 0.032
1.361MetThr: 1.361 ± 0.029
1.618MetVal: 1.618 ± 0.029
0.232MetTrp: 0.232 ± 0.014
0.924MetTyr: 0.924 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
3.472AsnAla: 3.472 ± 0.053
0.469AsnCys: 0.469 ± 0.021
2.529AsnAsp: 2.529 ± 0.048
3.012AsnGlu: 3.012 ± 0.055
2.192AsnPhe: 2.192 ± 0.042
3.776AsnGly: 3.776 ± 0.062
0.946AsnHis: 0.946 ± 0.026
3.594AsnIle: 3.594 ± 0.062
3.072AsnLys: 3.072 ± 0.053
4.392AsnLeu: 4.392 ± 0.065
1.315AsnMet: 1.315 ± 0.029
2.525AsnAsn: 2.525 ± 0.055
2.541AsnPro: 2.541 ± 0.038
1.638AsnGln: 1.638 ± 0.036
2.429AsnArg: 2.429 ± 0.05
2.776AsnSer: 2.776 ± 0.046
2.802AsnThr: 2.802 ± 0.052
3.192AsnVal: 3.192 ± 0.053
0.66AsnTrp: 0.66 ± 0.024
2.355AsnTyr: 2.355 ± 0.049
0.0AsnXaa: 0.0 ± 0.0
Pro
2.782ProAla: 2.782 ± 0.044
0.361ProCys: 0.361 ± 0.016
2.728ProAsp: 2.728 ± 0.046
3.56ProGlu: 3.56 ± 0.055
2.035ProPhe: 2.035 ± 0.042
2.575ProGly: 2.575 ± 0.05
0.695ProHis: 0.695 ± 0.023
2.566ProIle: 2.566 ± 0.039
2.074ProLys: 2.074 ± 0.036
3.439ProLeu: 3.439 ± 0.047
0.979ProMet: 0.979 ± 0.027
1.825ProAsn: 1.825 ± 0.037
0.91ProPro: 0.91 ± 0.027
1.278ProGln: 1.278 ± 0.03
1.374ProArg: 1.374 ± 0.039
2.523ProSer: 2.523 ± 0.043
2.063ProThr: 2.063 ± 0.037
3.085ProVal: 3.085 ± 0.05
0.458ProTrp: 0.458 ± 0.018
1.786ProTyr: 1.786 ± 0.042
0.0ProXaa: 0.0 ± 0.0
Gln
2.466GlnAla: 2.466 ± 0.044
0.264GlnCys: 0.264 ± 0.014
1.63GlnAsp: 1.63 ± 0.033
2.29GlnGlu: 2.29 ± 0.041
1.296GlnPhe: 1.296 ± 0.031
2.083GlnGly: 2.083 ± 0.04
0.522GlnHis: 0.522 ± 0.019
2.395GlnIle: 2.395 ± 0.04
2.267GlnLys: 2.267 ± 0.044
3.208GlnLeu: 3.208 ± 0.053
0.961GlnMet: 0.961 ± 0.027
1.579GlnAsn: 1.579 ± 0.03
1.273GlnPro: 1.273 ± 0.03
1.373GlnGln: 1.373 ± 0.039
1.562GlnArg: 1.562 ± 0.031
1.896GlnSer: 1.896 ± 0.037
1.92GlnThr: 1.92 ± 0.038
2.167GlnVal: 2.167 ± 0.036
0.401GlnTrp: 0.401 ± 0.017
1.42GlnTyr: 1.42 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
2.715ArgAla: 2.715 ± 0.05
0.47ArgCys: 0.47 ± 0.019
2.267ArgAsp: 2.267 ± 0.042
3.188ArgGlu: 3.188 ± 0.057
2.483ArgPhe: 2.483 ± 0.045
2.502ArgGly: 2.502 ± 0.045
0.927ArgHis: 0.927 ± 0.024
3.859ArgIle: 3.859 ± 0.05
3.53ArgLys: 3.53 ± 0.053
4.844ArgLeu: 4.844 ± 0.061
1.552ArgMet: 1.552 ± 0.031
2.484ArgAsn: 2.484 ± 0.045
1.751ArgPro: 1.751 ± 0.035
1.8ArgGln: 1.8 ± 0.035
2.311ArgArg: 2.311 ± 0.045
2.662ArgSer: 2.662 ± 0.045
2.514ArgThr: 2.514 ± 0.038
2.747ArgVal: 2.747 ± 0.041
0.66ArgTrp: 0.66 ± 0.023
2.382ArgTyr: 2.382 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
4.115SerAla: 4.115 ± 0.057
0.811SerCys: 0.811 ± 0.026
3.539SerAsp: 3.539 ± 0.05
3.568SerGlu: 3.568 ± 0.047
3.56SerPhe: 3.56 ± 0.051
4.764SerGly: 4.764 ± 0.069
1.139SerHis: 1.139 ± 0.03
4.47SerIle: 4.47 ± 0.062
3.438SerLys: 3.438 ± 0.048
6.472SerLeu: 6.472 ± 0.076
1.568SerMet: 1.568 ± 0.034
2.614SerAsn: 2.614 ± 0.046
2.591SerPro: 2.591 ± 0.04
1.823SerGln: 1.823 ± 0.039
2.813SerArg: 2.813 ± 0.046
4.124SerSer: 4.124 ± 0.07
3.305SerThr: 3.305 ± 0.055
4.482SerVal: 4.482 ± 0.057
0.797SerTrp: 0.797 ± 0.025
3.057SerTyr: 3.057 ± 0.057
0.0SerXaa: 0.0 ± 0.0
Thr
3.815ThrAla: 3.815 ± 0.066
0.562ThrCys: 0.562 ± 0.019
3.294ThrAsp: 3.294 ± 0.054
3.204ThrGlu: 3.204 ± 0.052
2.85ThrPhe: 2.85 ± 0.045
4.254ThrGly: 4.254 ± 0.063
1.051ThrHis: 1.051 ± 0.029
3.913ThrIle: 3.913 ± 0.06
2.698ThrLys: 2.698 ± 0.052
5.424ThrLeu: 5.424 ± 0.07
1.213ThrMet: 1.213 ± 0.03
2.498ThrAsn: 2.498 ± 0.05
2.779ThrPro: 2.779 ± 0.042
1.769ThrGln: 1.769 ± 0.036
2.257ThrArg: 2.257 ± 0.036
3.392ThrSer: 3.392 ± 0.047
3.08ThrThr: 3.08 ± 0.054
3.937ThrVal: 3.937 ± 0.06
0.636ThrTrp: 0.636 ± 0.024
2.434ThrTyr: 2.434 ± 0.045
0.0ThrXaa: 0.0 ± 0.0
Val
4.517ValAla: 4.517 ± 0.069
0.934ValCys: 0.934 ± 0.026
3.659ValAsp: 3.659 ± 0.057
4.046ValGlu: 4.046 ± 0.057
3.035ValPhe: 3.035 ± 0.054
4.18ValGly: 4.18 ± 0.062
1.033ValHis: 1.033 ± 0.029
4.48ValIle: 4.48 ± 0.068
4.129ValLys: 4.129 ± 0.058
5.982ValLeu: 5.982 ± 0.071
1.712ValMet: 1.712 ± 0.042
3.323ValAsn: 3.323 ± 0.057
2.648ValPro: 2.648 ± 0.046
1.826ValGln: 1.826 ± 0.036
3.107ValArg: 3.107 ± 0.047
4.72ValSer: 4.72 ± 0.069
3.827ValThr: 3.827 ± 0.06
4.472ValVal: 4.472 ± 0.066
0.748ValTrp: 0.748 ± 0.024
2.764ValTyr: 2.764 ± 0.046
0.0ValXaa: 0.0 ± 0.0
Trp
0.737TrpAla: 0.737 ± 0.022
0.154TrpCys: 0.154 ± 0.011
0.72TrpAsp: 0.72 ± 0.024
0.759TrpGlu: 0.759 ± 0.023
0.597TrpPhe: 0.597 ± 0.022
0.931TrpGly: 0.931 ± 0.025
0.264TrpHis: 0.264 ± 0.014
0.861TrpIle: 0.861 ± 0.025
0.897TrpLys: 0.897 ± 0.027
1.179TrpLeu: 1.179 ± 0.031
0.431TrpMet: 0.431 ± 0.021
0.739TrpAsn: 0.739 ± 0.026
0.297TrpPro: 0.297 ± 0.014
0.462TrpGln: 0.462 ± 0.019
0.59TrpArg: 0.59 ± 0.021
0.665TrpSer: 0.665 ± 0.023
0.641TrpThr: 0.641 ± 0.021
0.763TrpVal: 0.763 ± 0.025
0.188TrpTrp: 0.188 ± 0.012
0.537TrpTyr: 0.537 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.903TyrAla: 2.903 ± 0.045
0.527TyrCys: 0.527 ± 0.021
2.574TyrAsp: 2.574 ± 0.047
2.418TyrGlu: 2.418 ± 0.041
2.172TyrPhe: 2.172 ± 0.036
3.009TyrGly: 3.009 ± 0.059
0.897TyrHis: 0.897 ± 0.024
2.882TyrIle: 2.882 ± 0.045
2.777TyrLys: 2.777 ± 0.046
4.12TyrLeu: 4.12 ± 0.05
1.13TyrMet: 1.13 ± 0.026
2.422TyrAsn: 2.422 ± 0.051
2.05TyrPro: 2.05 ± 0.038
1.591TyrGln: 1.591 ± 0.034
2.226TyrArg: 2.226 ± 0.042
2.753TyrSer: 2.753 ± 0.056
2.808TyrThr: 2.808 ± 0.05
2.578TyrVal: 2.578 ± 0.044
0.602TyrTrp: 0.602 ± 0.022
2.075TyrTyr: 2.075 ± 0.04
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3830 proteins (1437791 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski