Amino acid dipepetide frequency for Dysgonomonas gadei ATCC BAA-286

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.684AlaAla: 4.684 ± 0.072
0.719AlaCys: 0.719 ± 0.022
3.85AlaAsp: 3.85 ± 0.052
3.987AlaGlu: 3.987 ± 0.06
2.988AlaPhe: 2.988 ± 0.051
4.779AlaGly: 4.779 ± 0.067
0.991AlaHis: 0.991 ± 0.03
5.159AlaIle: 5.159 ± 0.067
4.367AlaLys: 4.367 ± 0.054
5.869AlaLeu: 5.869 ± 0.084
1.592AlaMet: 1.592 ± 0.038
3.315AlaAsn: 3.315 ± 0.053
1.931AlaPro: 1.931 ± 0.035
2.359AlaGln: 2.359 ± 0.039
2.439AlaArg: 2.439 ± 0.042
4.273AlaSer: 4.273 ± 0.065
3.489AlaThr: 3.489 ± 0.054
4.104AlaVal: 4.104 ± 0.063
0.74AlaTrp: 0.74 ± 0.024
2.953AlaTyr: 2.953 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.546CysAla: 0.546 ± 0.02
0.143CysCys: 0.143 ± 0.011
0.566CysAsp: 0.566 ± 0.02
0.508CysGlu: 0.508 ± 0.02
0.516CysPhe: 0.516 ± 0.019
0.772CysGly: 0.772 ± 0.027
0.186CysHis: 0.186 ± 0.013
0.85CysIle: 0.85 ± 0.027
0.571CysLys: 0.571 ± 0.02
0.849CysLeu: 0.849 ± 0.026
0.226CysMet: 0.226 ± 0.013
0.485CysAsn: 0.485 ± 0.019
0.384CysPro: 0.384 ± 0.016
0.221CysGln: 0.221 ± 0.013
0.404CysArg: 0.404 ± 0.019
0.687CysSer: 0.687 ± 0.025
0.503CysThr: 0.503 ± 0.017
0.528CysVal: 0.528 ± 0.017
0.134CysTrp: 0.134 ± 0.008
0.406CysTyr: 0.406 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.746AspAla: 3.746 ± 0.062
0.508AspCys: 0.508 ± 0.02
2.89AspAsp: 2.89 ± 0.05
3.703AspGlu: 3.703 ± 0.052
3.195AspPhe: 3.195 ± 0.046
4.224AspGly: 4.224 ± 0.065
0.785AspHis: 0.785 ± 0.025
5.147AspIle: 5.147 ± 0.062
4.994AspLys: 4.994 ± 0.056
4.748AspLeu: 4.748 ± 0.056
1.643AspMet: 1.643 ± 0.041
3.549AspAsn: 3.549 ± 0.054
1.995AspPro: 1.995 ± 0.039
1.326AspGln: 1.326 ± 0.031
2.342AspArg: 2.342 ± 0.035
3.266AspSer: 3.266 ± 0.046
2.711AspThr: 2.711 ± 0.047
3.502AspVal: 3.502 ± 0.047
0.881AspTrp: 0.881 ± 0.027
3.044AspTyr: 3.044 ± 0.048
0.0AspXaa: 0.0 ± 0.0
Glu
4.098GluAla: 4.098 ± 0.056
0.447GluCys: 0.447 ± 0.021
3.409GluAsp: 3.409 ± 0.047
4.277GluGlu: 4.277 ± 0.073
2.524GluPhe: 2.524 ± 0.037
3.734GluGly: 3.734 ± 0.055
1.005GluHis: 1.005 ± 0.023
4.885GluIle: 4.885 ± 0.067
5.391GluLys: 5.391 ± 0.071
5.584GluLeu: 5.584 ± 0.072
1.752GluMet: 1.752 ± 0.042
3.918GluAsn: 3.918 ± 0.052
1.738GluPro: 1.738 ± 0.036
2.291GluGln: 2.291 ± 0.047
2.686GluArg: 2.686 ± 0.05
3.461GluSer: 3.461 ± 0.047
3.224GluThr: 3.224 ± 0.053
3.809GluVal: 3.809 ± 0.053
0.79GluTrp: 0.79 ± 0.024
2.968GluTyr: 2.968 ± 0.054
0.0GluXaa: 0.0 ± 0.0
Phe
2.946PheAla: 2.946 ± 0.053
0.564PheCys: 0.564 ± 0.02
3.018PheAsp: 3.018 ± 0.046
2.696PheGlu: 2.696 ± 0.043
2.53PhePhe: 2.53 ± 0.053
3.345PheGly: 3.345 ± 0.051
0.772PheHis: 0.772 ± 0.023
3.776PheIle: 3.776 ± 0.064
2.802PheLys: 2.802 ± 0.046
4.08PheLeu: 4.08 ± 0.065
1.199PheMet: 1.199 ± 0.027
2.662PheAsn: 2.662 ± 0.046
1.658PhePro: 1.658 ± 0.032
1.226PheGln: 1.226 ± 0.028
2.011PheArg: 2.011 ± 0.036
3.934PheSer: 3.934 ± 0.051
2.892PheThr: 2.892 ± 0.043
2.857PheVal: 2.857 ± 0.049
0.633PheTrp: 0.633 ± 0.022
2.147PheTyr: 2.147 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
4.075GlyAla: 4.075 ± 0.063
0.696GlyCys: 0.696 ± 0.023
3.548GlyAsp: 3.548 ± 0.049
3.923GlyGlu: 3.923 ± 0.052
3.412GlyPhe: 3.412 ± 0.047
4.724GlyGly: 4.724 ± 0.081
1.153GlyHis: 1.153 ± 0.03
5.622GlyIle: 5.622 ± 0.07
5.173GlyLys: 5.173 ± 0.065
5.84GlyLeu: 5.84 ± 0.065
1.775GlyMet: 1.775 ± 0.035
3.893GlyAsn: 3.893 ± 0.062
1.246GlyPro: 1.246 ± 0.031
2.141GlyGln: 2.141 ± 0.041
2.553GlyArg: 2.553 ± 0.042
4.079GlySer: 4.079 ± 0.065
4.125GlyThr: 4.125 ± 0.065
4.378GlyVal: 4.378 ± 0.062
1.061GlyTrp: 1.061 ± 0.027
3.617GlyTyr: 3.617 ± 0.055
0.0GlyXaa: 0.0 ± 0.0
His
0.925HisAla: 0.925 ± 0.025
0.18HisCys: 0.18 ± 0.011
0.864HisAsp: 0.864 ± 0.024
0.851HisGlu: 0.851 ± 0.025
0.924HisPhe: 0.924 ± 0.025
1.071HisGly: 1.071 ± 0.026
0.381HisHis: 0.381 ± 0.018
1.393HisIle: 1.393 ± 0.029
1.063HisLys: 1.063 ± 0.027
1.496HisLeu: 1.496 ± 0.037
0.366HisMet: 0.366 ± 0.014
0.967HisAsn: 0.967 ± 0.026
0.819HisPro: 0.819 ± 0.027
0.529HisGln: 0.529 ± 0.017
0.733HisArg: 0.733 ± 0.021
0.999HisSer: 0.999 ± 0.027
0.972HisThr: 0.972 ± 0.027
0.806HisVal: 0.806 ± 0.023
0.226HisTrp: 0.226 ± 0.013
0.853HisTyr: 0.853 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
5.382IleAla: 5.382 ± 0.073
0.878IleCys: 0.878 ± 0.028
4.82IleAsp: 4.82 ± 0.069
5.036IleGlu: 5.036 ± 0.066
3.501IlePhe: 3.501 ± 0.066
5.016IleGly: 5.016 ± 0.07
1.314IleHis: 1.314 ± 0.032
5.906IleIle: 5.906 ± 0.071
5.27IleLys: 5.27 ± 0.066
7.03IleLeu: 7.03 ± 0.092
1.553IleMet: 1.553 ± 0.032
4.326IleAsn: 4.326 ± 0.056
3.466IlePro: 3.466 ± 0.046
2.326IleGln: 2.326 ± 0.038
3.313IleArg: 3.313 ± 0.047
5.82IleSer: 5.82 ± 0.069
4.593IleThr: 4.593 ± 0.055
4.725IleVal: 4.725 ± 0.061
0.807IleTrp: 0.807 ± 0.028
3.335IleTyr: 3.335 ± 0.053
0.0IleXaa: 0.0 ± 0.0
Lys
4.777LysAla: 4.777 ± 0.066
0.443LysCys: 0.443 ± 0.017
5.001LysAsp: 5.001 ± 0.069
5.871LysGlu: 5.871 ± 0.072
2.51LysPhe: 2.51 ± 0.039
4.753LysGly: 4.753 ± 0.059
1.179LysHis: 1.179 ± 0.029
5.289LysIle: 5.289 ± 0.071
5.865LysLys: 5.865 ± 0.067
5.561LysLeu: 5.561 ± 0.058
2.107LysMet: 2.107 ± 0.037
4.408LysAsn: 4.408 ± 0.058
2.388LysPro: 2.388 ± 0.041
2.561LysGln: 2.561 ± 0.043
2.939LysArg: 2.939 ± 0.046
4.21LysSer: 4.21 ± 0.058
4.174LysThr: 4.174 ± 0.054
4.497LysVal: 4.497 ± 0.057
0.876LysTrp: 0.876 ± 0.025
3.567LysTyr: 3.567 ± 0.053
0.0LysXaa: 0.0 ± 0.0
Leu
5.623LeuAla: 5.623 ± 0.075
0.974LeuCys: 0.974 ± 0.027
4.843LeuAsp: 4.843 ± 0.053
4.789LeuGlu: 4.789 ± 0.067
4.583LeuPhe: 4.583 ± 0.071
5.296LeuGly: 5.296 ± 0.076
1.46LeuHis: 1.46 ± 0.032
6.547LeuIle: 6.547 ± 0.084
6.6LeuLys: 6.6 ± 0.072
8.345LeuLeu: 8.345 ± 0.109
2.102LeuMet: 2.102 ± 0.039
5.09LeuAsn: 5.09 ± 0.061
3.696LeuPro: 3.696 ± 0.047
2.9LeuGln: 2.9 ± 0.049
3.586LeuArg: 3.586 ± 0.046
7.019LeuSer: 7.019 ± 0.083
5.091LeuThr: 5.091 ± 0.053
4.76LeuVal: 4.76 ± 0.065
1.019LeuTrp: 1.019 ± 0.028
3.784LeuTyr: 3.784 ± 0.05
0.0LeuXaa: 0.0 ± 0.0
Met
1.694MetAla: 1.694 ± 0.039
0.211MetCys: 0.211 ± 0.012
1.39MetAsp: 1.39 ± 0.029
1.529MetGlu: 1.529 ± 0.031
1.032MetPhe: 1.032 ± 0.03
1.64MetGly: 1.64 ± 0.035
0.42MetHis: 0.42 ± 0.019
1.652MetIle: 1.652 ± 0.035
2.427MetLys: 2.427 ± 0.043
2.188MetLeu: 2.188 ± 0.042
0.658MetMet: 0.658 ± 0.022
1.497MetAsn: 1.497 ± 0.028
1.019MetPro: 1.019 ± 0.024
0.958MetGln: 0.958 ± 0.026
1.149MetArg: 1.149 ± 0.034
1.553MetSer: 1.553 ± 0.029
1.418MetThr: 1.418 ± 0.029
1.407MetVal: 1.407 ± 0.033
0.248MetTrp: 0.248 ± 0.014
0.907MetTyr: 0.907 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
3.418AsnAla: 3.418 ± 0.05
0.452AsnCys: 0.452 ± 0.019
3.053AsnAsp: 3.053 ± 0.044
3.273AsnGlu: 3.273 ± 0.046
2.438AsnPhe: 2.438 ± 0.04
3.971AsnGly: 3.971 ± 0.066
0.946AsnHis: 0.946 ± 0.028
5.098AsnIle: 5.098 ± 0.073
4.272AsnLys: 4.272 ± 0.052
4.841AsnLeu: 4.841 ± 0.068
1.515AsnMet: 1.515 ± 0.028
3.609AsnAsn: 3.609 ± 0.062
2.769AsnPro: 2.769 ± 0.05
1.786AsnGln: 1.786 ± 0.038
2.458AsnArg: 2.458 ± 0.045
3.52AsnSer: 3.52 ± 0.056
3.413AsnThr: 3.413 ± 0.053
3.262AsnVal: 3.262 ± 0.049
0.74AsnTrp: 0.74 ± 0.025
2.715AsnTyr: 2.715 ± 0.047
0.0AsnXaa: 0.0 ± 0.0
Pro
2.565ProAla: 2.565 ± 0.038
0.26ProCys: 0.26 ± 0.012
2.579ProAsp: 2.579 ± 0.042
3.05ProGlu: 3.05 ± 0.045
1.844ProPhe: 1.844 ± 0.037
2.215ProGly: 2.215 ± 0.04
0.593ProHis: 0.593 ± 0.021
2.361ProIle: 2.361 ± 0.038
2.135ProLys: 2.135 ± 0.036
3.115ProLeu: 3.115 ± 0.05
0.819ProMet: 0.819 ± 0.022
1.781ProAsn: 1.781 ± 0.042
0.902ProPro: 0.902 ± 0.026
1.295ProGln: 1.295 ± 0.032
1.146ProArg: 1.146 ± 0.029
2.284ProSer: 2.284 ± 0.043
1.82ProThr: 1.82 ± 0.036
3.025ProVal: 3.025 ± 0.053
0.438ProTrp: 0.438 ± 0.016
1.69ProTyr: 1.69 ± 0.03
0.0ProXaa: 0.0 ± 0.0
Gln
2.182GlnAla: 2.182 ± 0.042
0.204GlnCys: 0.204 ± 0.012
1.646GlnAsp: 1.646 ± 0.036
2.051GlnGlu: 2.051 ± 0.039
1.33GlnPhe: 1.33 ± 0.033
1.911GlnGly: 1.911 ± 0.036
0.511GlnHis: 0.511 ± 0.021
2.542GlnIle: 2.542 ± 0.043
2.559GlnLys: 2.559 ± 0.041
2.918GlnLeu: 2.918 ± 0.045
0.874GlnMet: 0.874 ± 0.022
1.902GlnAsn: 1.902 ± 0.036
1.139GlnPro: 1.139 ± 0.031
1.355GlnGln: 1.355 ± 0.035
1.365GlnArg: 1.365 ± 0.032
2.08GlnSer: 2.08 ± 0.035
1.987GlnThr: 1.987 ± 0.035
1.876GlnVal: 1.876 ± 0.035
0.439GlnTrp: 0.439 ± 0.017
1.481GlnTyr: 1.481 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
2.287ArgAla: 2.287 ± 0.044
0.311ArgCys: 0.311 ± 0.015
2.113ArgAsp: 2.113 ± 0.038
2.615ArgGlu: 2.615 ± 0.051
2.144ArgPhe: 2.144 ± 0.044
2.277ArgGly: 2.277 ± 0.045
0.7ArgHis: 0.7 ± 0.021
3.489ArgIle: 3.489 ± 0.05
3.202ArgLys: 3.202 ± 0.051
3.738ArgLeu: 3.738 ± 0.052
1.202ArgMet: 1.202 ± 0.024
2.439ArgAsn: 2.439 ± 0.041
1.289ArgPro: 1.289 ± 0.035
1.427ArgGln: 1.427 ± 0.029
1.711ArgArg: 1.711 ± 0.038
2.237ArgSer: 2.237 ± 0.038
2.168ArgThr: 2.168 ± 0.041
2.334ArgVal: 2.334 ± 0.039
0.596ArgTrp: 0.596 ± 0.021
2.067ArgTyr: 2.067 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
4.23SerAla: 4.23 ± 0.061
0.733SerCys: 0.733 ± 0.024
3.861SerAsp: 3.861 ± 0.052
3.686SerGlu: 3.686 ± 0.054
3.571SerPhe: 3.571 ± 0.05
5.009SerGly: 5.009 ± 0.073
1.132SerHis: 1.132 ± 0.027
5.093SerIle: 5.093 ± 0.067
4.112SerLys: 4.112 ± 0.048
6.296SerLeu: 6.296 ± 0.07
1.476SerMet: 1.476 ± 0.033
3.385SerAsn: 3.385 ± 0.063
2.373SerPro: 2.373 ± 0.041
2.203SerGln: 2.203 ± 0.036
2.537SerArg: 2.537 ± 0.046
4.425SerSer: 4.425 ± 0.079
3.452SerThr: 3.452 ± 0.05
4.588SerVal: 4.588 ± 0.058
0.811SerTrp: 0.811 ± 0.022
3.161SerTyr: 3.161 ± 0.05
0.0SerXaa: 0.0 ± 0.0
Thr
3.733ThrAla: 3.733 ± 0.061
0.447ThrCys: 0.447 ± 0.017
3.667ThrAsp: 3.667 ± 0.055
3.299ThrGlu: 3.299 ± 0.05
2.785ThrPhe: 2.785 ± 0.048
4.581ThrGly: 4.581 ± 0.059
0.933ThrHis: 0.933 ± 0.024
4.463ThrIle: 4.463 ± 0.054
3.406ThrLys: 3.406 ± 0.047
5.062ThrLeu: 5.062 ± 0.056
1.149ThrMet: 1.149 ± 0.029
3.016ThrAsn: 3.016 ± 0.052
2.537ThrPro: 2.537 ± 0.044
1.728ThrGln: 1.728 ± 0.036
1.932ThrArg: 1.932 ± 0.04
3.53ThrSer: 3.53 ± 0.054
3.267ThrThr: 3.267 ± 0.055
3.827ThrVal: 3.827 ± 0.053
0.674ThrTrp: 0.674 ± 0.022
2.594ThrTyr: 2.594 ± 0.044
0.0ThrXaa: 0.0 ± 0.0
Val
4.093ValAla: 4.093 ± 0.062
0.718ValCys: 0.718 ± 0.02
3.553ValAsp: 3.553 ± 0.055
3.682ValGlu: 3.682 ± 0.056
3.024ValPhe: 3.024 ± 0.046
3.689ValGly: 3.689 ± 0.059
0.927ValHis: 0.927 ± 0.026
4.561ValIle: 4.561 ± 0.064
4.509ValLys: 4.509 ± 0.054
5.339ValLeu: 5.339 ± 0.071
1.468ValMet: 1.468 ± 0.035
3.474ValAsn: 3.474 ± 0.05
2.289ValPro: 2.289 ± 0.042
1.76ValGln: 1.76 ± 0.036
2.462ValArg: 2.462 ± 0.04
4.716ValSer: 4.716 ± 0.06
3.591ValThr: 3.591 ± 0.057
4.04ValVal: 4.04 ± 0.063
0.719ValTrp: 0.719 ± 0.021
2.791ValTyr: 2.791 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
0.773TrpAla: 0.773 ± 0.025
0.146TrpCys: 0.146 ± 0.009
0.823TrpAsp: 0.823 ± 0.026
0.707TrpGlu: 0.707 ± 0.023
0.557TrpPhe: 0.557 ± 0.022
1.005TrpGly: 1.005 ± 0.028
0.221TrpHis: 0.221 ± 0.013
0.935TrpIle: 0.935 ± 0.025
0.87TrpLys: 0.87 ± 0.024
1.128TrpLeu: 1.128 ± 0.03
0.396TrpMet: 0.396 ± 0.016
0.82TrpAsn: 0.82 ± 0.023
0.288TrpPro: 0.288 ± 0.015
0.468TrpGln: 0.468 ± 0.014
0.546TrpArg: 0.546 ± 0.021
0.75TrpSer: 0.75 ± 0.025
0.738TrpThr: 0.738 ± 0.022
0.736TrpVal: 0.736 ± 0.025
0.197TrpTrp: 0.197 ± 0.012
0.567TrpTyr: 0.567 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.806TyrAla: 2.806 ± 0.038
0.452TyrCys: 0.452 ± 0.016
2.702TyrAsp: 2.702 ± 0.048
2.459TyrGlu: 2.459 ± 0.042
2.339TyrPhe: 2.339 ± 0.042
3.024TyrGly: 3.024 ± 0.048
0.811TyrHis: 0.811 ± 0.025
3.647TyrIle: 3.647 ± 0.052
3.391TyrLys: 3.391 ± 0.049
4.081TyrLeu: 4.081 ± 0.053
1.127TyrMet: 1.127 ± 0.025
3.014TyrAsn: 3.014 ± 0.058
1.993TyrPro: 1.993 ± 0.04
1.492TyrGln: 1.492 ± 0.033
2.033TyrArg: 2.033 ± 0.037
3.292TyrSer: 3.292 ± 0.049
3.001TyrThr: 3.001 ± 0.05
2.343TyrVal: 2.343 ± 0.039
0.641TyrTrp: 0.641 ± 0.018
2.382TyrTyr: 2.382 ± 0.045
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4153 proteins (1514011 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski