Amino acid dipepetide frequency for Geobacter uraniireducens (strain Rf4) (Geobacter uraniumreducens)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.444AlaAla: 10.444 ± 0.126
1.28AlaCys: 1.28 ± 0.042
5.055AlaAsp: 5.055 ± 0.06
6.252AlaGlu: 6.252 ± 0.107
3.579AlaPhe: 3.579 ± 0.05
8.465AlaGly: 8.465 ± 0.107
1.555AlaHis: 1.555 ± 0.032
5.461AlaIle: 5.461 ± 0.061
4.549AlaLys: 4.549 ± 0.062
9.188AlaLeu: 9.188 ± 0.1
2.414AlaMet: 2.414 ± 0.047
2.946AlaAsn: 2.946 ± 0.069
3.415AlaPro: 3.415 ± 0.059
2.565AlaGln: 2.565 ± 0.059
4.985AlaArg: 4.985 ± 0.065
4.975AlaSer: 4.975 ± 0.083
5.171AlaThr: 5.171 ± 0.103
7.242AlaVal: 7.242 ± 0.079
0.905AlaTrp: 0.905 ± 0.027
2.475AlaTyr: 2.475 ± 0.051
0.0AlaXaa: 0.0 ± 0.0
Cys
0.989CysAla: 0.989 ± 0.029
0.235CysCys: 0.235 ± 0.014
0.646CysAsp: 0.646 ± 0.023
0.591CysGlu: 0.591 ± 0.024
0.485CysPhe: 0.485 ± 0.018
1.272CysGly: 1.272 ± 0.035
1.074CysHis: 1.074 ± 0.103
0.662CysIle: 0.662 ± 0.019
0.474CysLys: 0.474 ± 0.018
1.1CysLeu: 1.1 ± 0.03
0.279CysMet: 0.279 ± 0.016
0.475CysAsn: 0.475 ± 0.023
0.64CysPro: 0.64 ± 0.027
0.34CysGln: 0.34 ± 0.016
0.813CysArg: 0.813 ± 0.027
0.834CysSer: 0.834 ± 0.032
0.648CysThr: 0.648 ± 0.027
0.698CysVal: 0.698 ± 0.024
0.124CysTrp: 0.124 ± 0.009
0.421CysTyr: 0.421 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
4.515AspAla: 4.515 ± 0.059
0.704AspCys: 0.704 ± 0.028
2.692AspAsp: 2.692 ± 0.049
3.491AspGlu: 3.491 ± 0.06
2.292AspPhe: 2.292 ± 0.04
4.399AspGly: 4.399 ± 0.085
0.945AspHis: 0.945 ± 0.026
3.672AspIle: 3.672 ± 0.051
2.699AspLys: 2.699 ± 0.048
5.333AspLeu: 5.333 ± 0.066
1.304AspMet: 1.304 ± 0.028
2.018AspAsn: 2.018 ± 0.048
2.557AspPro: 2.557 ± 0.038
1.372AspGln: 1.372 ± 0.033
3.056AspArg: 3.056 ± 0.053
2.737AspSer: 2.737 ± 0.055
2.554AspThr: 2.554 ± 0.053
3.6AspVal: 3.6 ± 0.053
0.605AspTrp: 0.605 ± 0.024
1.979AspTyr: 1.979 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
5.678GluAla: 5.678 ± 0.088
0.586GluCys: 0.586 ± 0.023
2.619GluAsp: 2.619 ± 0.05
4.951GluGlu: 4.951 ± 0.088
2.305GluPhe: 2.305 ± 0.045
4.126GluGly: 4.126 ± 0.057
1.222GluHis: 1.222 ± 0.031
4.549GluIle: 4.549 ± 0.07
4.681GluLys: 4.681 ± 0.074
6.726GluLeu: 6.726 ± 0.094
1.902GluMet: 1.902 ± 0.042
2.317GluAsn: 2.317 ± 0.044
2.184GluPro: 2.184 ± 0.048
2.443GluGln: 2.443 ± 0.052
4.413GluArg: 4.413 ± 0.079
3.137GluSer: 3.137 ± 0.053
3.358GluThr: 3.358 ± 0.052
4.371GluVal: 4.371 ± 0.071
0.653GluTrp: 0.653 ± 0.023
1.739GluTyr: 1.739 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
3.753PheAla: 3.753 ± 0.057
0.585PheCys: 0.585 ± 0.019
2.494PheAsp: 2.494 ± 0.039
2.096PheGlu: 2.096 ± 0.045
1.931PhePhe: 1.931 ± 0.039
3.201PheGly: 3.201 ± 0.053
0.861PheHis: 0.861 ± 0.02
2.501PheIle: 2.501 ± 0.039
1.794PheLys: 1.794 ± 0.042
3.934PheLeu: 3.934 ± 0.069
0.912PheMet: 0.912 ± 0.025
1.656PheAsn: 1.656 ± 0.037
1.836PhePro: 1.836 ± 0.044
1.14PheGln: 1.14 ± 0.03
2.33PheArg: 2.33 ± 0.048
3.012PheSer: 3.012 ± 0.055
2.605PheThr: 2.605 ± 0.051
2.76PheVal: 2.76 ± 0.048
0.429PheTrp: 0.429 ± 0.019
1.27PheTyr: 1.27 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
6.631GlyAla: 6.631 ± 0.086
1.288GlyCys: 1.288 ± 0.042
3.987GlyAsp: 3.987 ± 0.076
4.942GlyGlu: 4.942 ± 0.063
3.398GlyPhe: 3.398 ± 0.046
6.514GlyGly: 6.514 ± 0.129
1.518GlyHis: 1.518 ± 0.041
5.582GlyIle: 5.582 ± 0.062
5.268GlyLys: 5.268 ± 0.073
6.976GlyLeu: 6.976 ± 0.072
2.296GlyMet: 2.296 ± 0.04
3.381GlyAsn: 3.381 ± 0.082
2.16GlyPro: 2.16 ± 0.047
2.142GlyGln: 2.142 ± 0.044
4.469GlyArg: 4.469 ± 0.064
4.89GlySer: 4.89 ± 0.103
4.81GlyThr: 4.81 ± 0.114
5.985GlyVal: 5.985 ± 0.077
0.97GlyTrp: 0.97 ± 0.025
2.801GlyTyr: 2.801 ± 0.05
0.0GlyXaa: 0.0 ± 0.0
His
1.648HisAla: 1.648 ± 0.038
0.293HisCys: 0.293 ± 0.014
1.132HisAsp: 1.132 ± 0.032
1.153HisGlu: 1.153 ± 0.03
0.929HisPhe: 0.929 ± 0.025
1.831HisGly: 1.831 ± 0.049
0.489HisHis: 0.489 ± 0.02
1.138HisIle: 1.138 ± 0.026
0.885HisLys: 0.885 ± 0.024
2.04HisLeu: 2.04 ± 0.036
0.453HisMet: 0.453 ± 0.02
0.787HisAsn: 0.787 ± 0.027
1.211HisPro: 1.211 ± 0.033
0.608HisGln: 0.608 ± 0.022
1.161HisArg: 1.161 ± 0.031
1.119HisSer: 1.119 ± 0.034
0.946HisThr: 0.946 ± 0.03
1.236HisVal: 1.236 ± 0.032
0.213HisTrp: 0.213 ± 0.013
0.652HisTyr: 0.652 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
5.987IleAla: 5.987 ± 0.063
0.789IleCys: 0.789 ± 0.026
3.714IleAsp: 3.714 ± 0.049
3.83IleGlu: 3.83 ± 0.054
2.542IlePhe: 2.542 ± 0.042
4.76IleGly: 4.76 ± 0.055
1.198IleHis: 1.198 ± 0.028
3.882IleIle: 3.882 ± 0.061
3.154IleLys: 3.154 ± 0.057
5.63IleLeu: 5.63 ± 0.078
1.366IleMet: 1.366 ± 0.034
2.57IleAsn: 2.57 ± 0.043
3.086IlePro: 3.086 ± 0.047
1.656IleGln: 1.656 ± 0.04
3.356IleArg: 3.356 ± 0.053
4.17IleSer: 4.17 ± 0.061
3.902IleThr: 3.902 ± 0.066
4.351IleVal: 4.351 ± 0.061
0.512IleTrp: 0.512 ± 0.018
1.807IleTyr: 1.807 ± 0.036
0.0IleXaa: 0.0 ± 0.0
Lys
4.809LysAla: 4.809 ± 0.079
0.519LysCys: 0.519 ± 0.023
2.832LysAsp: 2.832 ± 0.042
4.172LysGlu: 4.172 ± 0.076
1.55LysPhe: 1.55 ± 0.035
4.398LysGly: 4.398 ± 0.064
0.952LysHis: 0.952 ± 0.029
3.453LysIle: 3.453 ± 0.053
3.872LysLys: 3.872 ± 0.07
4.872LysLeu: 4.872 ± 0.061
1.466LysMet: 1.466 ± 0.035
2.179LysAsn: 2.179 ± 0.041
2.279LysPro: 2.279 ± 0.046
1.644LysGln: 1.644 ± 0.038
3.105LysArg: 3.105 ± 0.052
3.111LysSer: 3.111 ± 0.044
3.107LysThr: 3.107 ± 0.046
3.83LysVal: 3.83 ± 0.062
0.5LysTrp: 0.5 ± 0.019
1.51LysTyr: 1.51 ± 0.04
0.0LysXaa: 0.0 ± 0.0
Leu
10.029LeuAla: 10.029 ± 0.107
1.156LeuCys: 1.156 ± 0.036
5.302LeuAsp: 5.302 ± 0.072
6.109LeuGlu: 6.109 ± 0.085
4.247LeuPhe: 4.247 ± 0.065
6.879LeuGly: 6.879 ± 0.073
1.959LeuHis: 1.959 ± 0.042
5.416LeuIle: 5.416 ± 0.08
5.851LeuLys: 5.851 ± 0.072
10.447LeuLeu: 10.447 ± 0.138
2.19LeuMet: 2.19 ± 0.041
3.547LeuAsn: 3.547 ± 0.056
4.819LeuPro: 4.819 ± 0.057
3.154LeuGln: 3.154 ± 0.047
5.554LeuArg: 5.554 ± 0.077
6.469LeuSer: 6.469 ± 0.078
5.702LeuThr: 5.702 ± 0.078
6.802LeuVal: 6.802 ± 0.084
0.876LeuTrp: 0.876 ± 0.027
2.591LeuTyr: 2.591 ± 0.042
0.0LeuXaa: 0.0 ± 0.0
Met
2.568MetAla: 2.568 ± 0.042
0.213MetCys: 0.213 ± 0.012
1.235MetAsp: 1.235 ± 0.03
1.775MetGlu: 1.775 ± 0.044
0.78MetPhe: 0.78 ± 0.022
1.885MetGly: 1.885 ± 0.041
0.458MetHis: 0.458 ± 0.02
1.296MetIle: 1.296 ± 0.037
1.735MetLys: 1.735 ± 0.04
2.354MetLeu: 2.354 ± 0.043
0.611MetMet: 0.611 ± 0.021
0.988MetAsn: 0.988 ± 0.027
1.171MetPro: 1.171 ± 0.028
0.842MetGln: 0.842 ± 0.023
1.417MetArg: 1.417 ± 0.031
1.395MetSer: 1.395 ± 0.031
1.502MetThr: 1.502 ± 0.033
1.862MetVal: 1.862 ± 0.038
0.155MetTrp: 0.155 ± 0.011
0.452MetTyr: 0.452 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
3.117AsnAla: 3.117 ± 0.062
0.545AsnCys: 0.545 ± 0.025
1.877AsnAsp: 1.877 ± 0.053
1.861AsnGlu: 1.861 ± 0.037
1.454AsnPhe: 1.454 ± 0.036
3.341AsnGly: 3.341 ± 0.081
0.706AsnHis: 0.706 ± 0.024
2.506AsnIle: 2.506 ± 0.044
1.691AsnLys: 1.691 ± 0.037
3.9AsnLeu: 3.9 ± 0.079
0.825AsnMet: 0.825 ± 0.025
1.555AsnAsn: 1.555 ± 0.047
2.234AsnPro: 2.234 ± 0.044
1.046AsnGln: 1.046 ± 0.032
2.263AsnArg: 2.263 ± 0.042
2.227AsnSer: 2.227 ± 0.057
1.924AsnThr: 1.924 ± 0.053
2.636AsnVal: 2.636 ± 0.049
0.446AsnTrp: 0.446 ± 0.018
1.229AsnTyr: 1.229 ± 0.032
0.0AsnXaa: 0.0 ± 0.0
Pro
4.516ProAla: 4.516 ± 0.071
0.437ProCys: 0.437 ± 0.017
2.657ProAsp: 2.657 ± 0.043
3.229ProGlu: 3.229 ± 0.05
2.02ProPhe: 2.02 ± 0.041
3.555ProGly: 3.555 ± 0.057
0.89ProHis: 0.89 ± 0.028
2.077ProIle: 2.077 ± 0.039
1.808ProLys: 1.808 ± 0.043
4.572ProLeu: 4.572 ± 0.066
0.937ProMet: 0.937 ± 0.027
1.321ProAsn: 1.321 ± 0.03
2.046ProPro: 2.046 ± 0.046
1.427ProGln: 1.427 ± 0.03
1.96ProArg: 1.96 ± 0.042
2.369ProSer: 2.369 ± 0.051
2.3ProThr: 2.3 ± 0.054
3.804ProVal: 3.804 ± 0.062
0.49ProTrp: 0.49 ± 0.019
1.33ProTyr: 1.33 ± 0.03
0.0ProXaa: 0.0 ± 0.0
Gln
2.964GlnAla: 2.964 ± 0.056
0.316GlnCys: 0.316 ± 0.019
1.341GlnAsp: 1.341 ± 0.035
2.12GlnGlu: 2.12 ± 0.049
1.12GlnPhe: 1.12 ± 0.03
2.278GlnGly: 2.278 ± 0.043
0.568GlnHis: 0.568 ± 0.02
1.892GlnIle: 1.892 ± 0.038
1.819GlnLys: 1.819 ± 0.04
2.953GlnLeu: 2.953 ± 0.048
0.861GlnMet: 0.861 ± 0.025
1.111GlnAsn: 1.111 ± 0.029
1.274GlnPro: 1.274 ± 0.032
1.2GlnGln: 1.2 ± 0.035
1.908GlnArg: 1.908 ± 0.038
1.722GlnSer: 1.722 ± 0.039
1.566GlnThr: 1.566 ± 0.036
2.299GlnVal: 2.299 ± 0.045
0.316GlnTrp: 0.316 ± 0.015
0.782GlnTyr: 0.782 ± 0.023
0.0GlnXaa: 0.0 ± 0.0
Arg
4.021ArgAla: 4.021 ± 0.066
0.695ArgCys: 0.695 ± 0.023
2.984ArgAsp: 2.984 ± 0.047
4.581ArgGlu: 4.581 ± 0.08
2.692ArgPhe: 2.692 ± 0.053
3.678ArgGly: 3.678 ± 0.053
1.247ArgHis: 1.247 ± 0.027
4.029ArgIle: 4.029 ± 0.054
3.299ArgLys: 3.299 ± 0.058
6.34ArgLeu: 6.34 ± 0.085
1.601ArgMet: 1.601 ± 0.038
2.133ArgAsn: 2.133 ± 0.036
2.079ArgPro: 2.079 ± 0.043
2.154ArgGln: 2.154 ± 0.05
3.498ArgArg: 3.498 ± 0.069
3.05ArgSer: 3.05 ± 0.052
2.6ArgThr: 2.6 ± 0.037
3.875ArgVal: 3.875 ± 0.063
0.648ArgTrp: 0.648 ± 0.024
1.908ArgTyr: 1.908 ± 0.039
0.0ArgXaa: 0.0 ± 0.0
Ser
5.426SerAla: 5.426 ± 0.086
0.926SerCys: 0.926 ± 0.037
3.001SerAsp: 3.001 ± 0.05
3.074SerGlu: 3.074 ± 0.057
2.826SerPhe: 2.826 ± 0.05
5.751SerGly: 5.751 ± 0.099
1.254SerHis: 1.254 ± 0.034
3.703SerIle: 3.703 ± 0.056
2.346SerLys: 2.346 ± 0.035
6.171SerLeu: 6.171 ± 0.066
1.408SerMet: 1.408 ± 0.032
1.982SerAsn: 1.982 ± 0.055
2.77SerPro: 2.77 ± 0.05
1.799SerGln: 1.799 ± 0.035
3.49SerArg: 3.49 ± 0.059
3.84SerSer: 3.84 ± 0.078
3.143SerThr: 3.143 ± 0.073
4.086SerVal: 4.086 ± 0.065
0.673SerTrp: 0.673 ± 0.023
1.863SerTyr: 1.863 ± 0.044
0.0SerXaa: 0.0 ± 0.0
Thr
5.674ThrAla: 5.674 ± 0.111
0.728ThrCys: 0.728 ± 0.036
2.855ThrAsp: 2.855 ± 0.061
2.866ThrGlu: 2.866 ± 0.055
2.389ThrPhe: 2.389 ± 0.054
5.363ThrGly: 5.363 ± 0.099
0.918ThrHis: 0.918 ± 0.028
3.839ThrIle: 3.839 ± 0.069
2.185ThrLys: 2.185 ± 0.044
5.681ThrLeu: 5.681 ± 0.071
1.231ThrMet: 1.231 ± 0.032
1.937ThrAsn: 1.937 ± 0.057
3.054ThrPro: 3.054 ± 0.06
1.284ThrGln: 1.284 ± 0.033
2.557ThrArg: 2.557 ± 0.05
3.326ThrSer: 3.326 ± 0.068
3.417ThrThr: 3.417 ± 0.111
4.73ThrVal: 4.73 ± 0.091
0.579ThrTrp: 0.579 ± 0.025
1.533ThrTyr: 1.533 ± 0.051
0.0ThrXaa: 0.0 ± 0.0
Val
7.056ValAla: 7.056 ± 0.065
0.912ValCys: 0.912 ± 0.024
3.862ValAsp: 3.862 ± 0.056
4.695ValGlu: 4.695 ± 0.071
2.771ValPhe: 2.771 ± 0.045
4.996ValGly: 4.996 ± 0.07
1.234ValHis: 1.234 ± 0.032
4.636ValIle: 4.636 ± 0.056
4.207ValLys: 4.207 ± 0.058
6.473ValLeu: 6.473 ± 0.075
1.803ValMet: 1.803 ± 0.041
2.826ValAsn: 2.826 ± 0.05
3.085ValPro: 3.085 ± 0.048
2.049ValGln: 2.049 ± 0.037
3.981ValArg: 3.981 ± 0.061
4.64ValSer: 4.64 ± 0.07
4.771ValThr: 4.771 ± 0.094
5.655ValVal: 5.655 ± 0.085
0.67ValTrp: 0.67 ± 0.024
1.933ValTyr: 1.933 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
0.684TrpAla: 0.684 ± 0.028
0.126TrpCys: 0.126 ± 0.01
0.515TrpAsp: 0.515 ± 0.02
0.614TrpGlu: 0.614 ± 0.022
0.414TrpPhe: 0.414 ± 0.018
0.846TrpGly: 0.846 ± 0.031
0.238TrpHis: 0.238 ± 0.012
0.527TrpIle: 0.527 ± 0.019
0.559TrpLys: 0.559 ± 0.026
1.169TrpLeu: 1.169 ± 0.033
0.266TrpMet: 0.266 ± 0.015
0.452TrpAsn: 0.452 ± 0.021
0.375TrpPro: 0.375 ± 0.019
0.491TrpGln: 0.491 ± 0.02
0.66TrpArg: 0.66 ± 0.022
0.671TrpSer: 0.671 ± 0.026
0.524TrpThr: 0.524 ± 0.027
0.657TrpVal: 0.657 ± 0.023
0.148TrpTrp: 0.148 ± 0.009
0.334TrpTyr: 0.334 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.413TyrAla: 2.413 ± 0.045
0.465TyrCys: 0.465 ± 0.024
1.723TyrAsp: 1.723 ± 0.045
1.558TyrGlu: 1.558 ± 0.033
1.379TyrPhe: 1.379 ± 0.037
2.471TyrGly: 2.471 ± 0.047
0.64TyrHis: 0.64 ± 0.021
1.438TyrIle: 1.438 ± 0.033
1.284TyrLys: 1.284 ± 0.037
3.241TyrLeu: 3.241 ± 0.047
0.566TyrMet: 0.566 ± 0.02
1.204TyrAsn: 1.204 ± 0.033
1.471TyrPro: 1.471 ± 0.036
1.057TyrGln: 1.057 ± 0.027
2.138TyrArg: 2.138 ± 0.046
1.838TyrSer: 1.838 ± 0.041
1.595TyrThr: 1.595 ± 0.052
1.775TyrVal: 1.775 ± 0.039
0.356TyrTrp: 0.356 ± 0.016
1.085TyrTyr: 1.085 ± 0.033
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4255 proteins (1447873 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski