Amino acid dipepetide frequency for Geobacter sp. OR-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.514AlaAla: 10.514 ± 0.123
1.214AlaCys: 1.214 ± 0.038
4.845AlaAsp: 4.845 ± 0.062
6.04AlaGlu: 6.04 ± 0.097
3.442AlaPhe: 3.442 ± 0.057
8.401AlaGly: 8.401 ± 0.099
1.508AlaHis: 1.508 ± 0.039
5.991AlaIle: 5.991 ± 0.072
4.393AlaLys: 4.393 ± 0.062
9.156AlaLeu: 9.156 ± 0.097
2.464AlaMet: 2.464 ± 0.054
3.113AlaAsn: 3.113 ± 0.066
3.58AlaPro: 3.58 ± 0.063
2.562AlaGln: 2.562 ± 0.044
4.93AlaArg: 4.93 ± 0.069
5.281AlaSer: 5.281 ± 0.071
5.267AlaThr: 5.267 ± 0.104
7.031AlaVal: 7.031 ± 0.089
1.008AlaTrp: 1.008 ± 0.03
2.27AlaTyr: 2.27 ± 0.039
0.0AlaXaa: 0.0 ± 0.0
Cys
1.007CysAla: 1.007 ± 0.036
0.258CysCys: 0.258 ± 0.016
0.661CysAsp: 0.661 ± 0.022
0.593CysGlu: 0.593 ± 0.022
0.504CysPhe: 0.504 ± 0.021
1.277CysGly: 1.277 ± 0.04
1.08CysHis: 1.08 ± 0.127
0.683CysIle: 0.683 ± 0.024
0.475CysLys: 0.475 ± 0.018
1.189CysLeu: 1.189 ± 0.034
0.301CysMet: 0.301 ± 0.016
0.528CysAsn: 0.528 ± 0.026
0.651CysPro: 0.651 ± 0.031
0.369CysGln: 0.369 ± 0.017
0.924CysArg: 0.924 ± 0.034
1.065CysSer: 1.065 ± 0.033
0.652CysThr: 0.652 ± 0.028
0.663CysVal: 0.663 ± 0.023
0.154CysTrp: 0.154 ± 0.01
0.438CysTyr: 0.438 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
4.31AspAla: 4.31 ± 0.056
0.779AspCys: 0.779 ± 0.033
2.742AspAsp: 2.742 ± 0.049
3.404AspGlu: 3.404 ± 0.065
2.295AspPhe: 2.295 ± 0.049
4.131AspGly: 4.131 ± 0.083
0.998AspHis: 0.998 ± 0.03
3.589AspIle: 3.589 ± 0.057
2.455AspLys: 2.455 ± 0.055
5.413AspLeu: 5.413 ± 0.077
1.17AspMet: 1.17 ± 0.029
1.966AspAsn: 1.966 ± 0.047
2.732AspPro: 2.732 ± 0.053
1.615AspGln: 1.615 ± 0.032
3.503AspArg: 3.503 ± 0.059
3.123AspSer: 3.123 ± 0.052
2.535AspThr: 2.535 ± 0.048
3.171AspVal: 3.171 ± 0.057
0.611AspTrp: 0.611 ± 0.022
1.752AspTyr: 1.752 ± 0.037
0.0AspXaa: 0.0 ± 0.0
Glu
5.365GluAla: 5.365 ± 0.082
0.642GluCys: 0.642 ± 0.024
2.488GluAsp: 2.488 ± 0.051
4.386GluGlu: 4.386 ± 0.085
2.329GluPhe: 2.329 ± 0.039
3.851GluGly: 3.851 ± 0.053
1.173GluHis: 1.173 ± 0.03
4.617GluIle: 4.617 ± 0.068
3.895GluLys: 3.895 ± 0.069
6.997GluLeu: 6.997 ± 0.098
1.865GluMet: 1.865 ± 0.04
2.19GluAsn: 2.19 ± 0.038
2.326GluPro: 2.326 ± 0.045
2.638GluGln: 2.638 ± 0.053
4.258GluArg: 4.258 ± 0.068
3.323GluSer: 3.323 ± 0.054
3.259GluThr: 3.259 ± 0.057
3.989GluVal: 3.989 ± 0.065
0.626GluTrp: 0.626 ± 0.021
1.78GluTyr: 1.78 ± 0.039
0.0GluXaa: 0.0 ± 0.0
Phe
3.581PheAla: 3.581 ± 0.058
0.633PheCys: 0.633 ± 0.022
2.411PheAsp: 2.411 ± 0.047
2.032PheGlu: 2.032 ± 0.041
1.841PhePhe: 1.841 ± 0.04
3.345PheGly: 3.345 ± 0.053
0.786PheHis: 0.786 ± 0.026
2.476PheIle: 2.476 ± 0.043
1.657PheLys: 1.657 ± 0.038
3.859PheLeu: 3.859 ± 0.06
0.949PheMet: 0.949 ± 0.026
1.637PheAsn: 1.637 ± 0.039
1.825PhePro: 1.825 ± 0.035
1.031PheGln: 1.031 ± 0.023
2.435PheArg: 2.435 ± 0.047
3.249PheSer: 3.249 ± 0.053
2.356PheThr: 2.356 ± 0.044
2.701PheVal: 2.701 ± 0.049
0.472PheTrp: 0.472 ± 0.016
1.176PheTyr: 1.176 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
6.431GlyAla: 6.431 ± 0.086
1.39GlyCys: 1.39 ± 0.042
3.929GlyAsp: 3.929 ± 0.071
4.806GlyGlu: 4.806 ± 0.064
3.399GlyPhe: 3.399 ± 0.059
6.16GlyGly: 6.16 ± 0.117
1.552GlyHis: 1.552 ± 0.031
5.764GlyIle: 5.764 ± 0.064
4.837GlyLys: 4.837 ± 0.067
6.977GlyLeu: 6.977 ± 0.081
2.315GlyMet: 2.315 ± 0.043
3.241GlyAsn: 3.241 ± 0.073
2.167GlyPro: 2.167 ± 0.043
2.26GlyGln: 2.26 ± 0.045
4.483GlyArg: 4.483 ± 0.061
5.242GlySer: 5.242 ± 0.097
4.864GlyThr: 4.864 ± 0.118
5.763GlyVal: 5.763 ± 0.065
0.977GlyTrp: 0.977 ± 0.029
2.791GlyTyr: 2.791 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
1.626HisAla: 1.626 ± 0.046
0.293HisCys: 0.293 ± 0.015
1.144HisAsp: 1.144 ± 0.031
1.18HisGlu: 1.18 ± 0.032
0.875HisPhe: 0.875 ± 0.025
1.782HisGly: 1.782 ± 0.05
0.506HisHis: 0.506 ± 0.026
1.095HisIle: 1.095 ± 0.028
0.828HisLys: 0.828 ± 0.032
2.104HisLeu: 2.104 ± 0.042
0.402HisMet: 0.402 ± 0.019
0.761HisAsn: 0.761 ± 0.027
1.222HisPro: 1.222 ± 0.03
0.657HisGln: 0.657 ± 0.022
1.214HisArg: 1.214 ± 0.033
1.247HisSer: 1.247 ± 0.041
0.976HisThr: 0.976 ± 0.034
1.131HisVal: 1.131 ± 0.033
0.224HisTrp: 0.224 ± 0.012
0.63HisTyr: 0.63 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
6.417IleAla: 6.417 ± 0.082
0.881IleCys: 0.881 ± 0.025
3.805IleAsp: 3.805 ± 0.057
3.972IleGlu: 3.972 ± 0.051
2.444IlePhe: 2.444 ± 0.047
4.804IleGly: 4.804 ± 0.071
1.175IleHis: 1.175 ± 0.031
4.043IleIle: 4.043 ± 0.07
3.031IleLys: 3.031 ± 0.057
5.705IleLeu: 5.705 ± 0.079
1.499IleMet: 1.499 ± 0.035
2.498IleAsn: 2.498 ± 0.048
3.244IlePro: 3.244 ± 0.048
1.668IleGln: 1.668 ± 0.043
3.709IleArg: 3.709 ± 0.057
4.446IleSer: 4.446 ± 0.066
3.853IleThr: 3.853 ± 0.051
4.431IleVal: 4.431 ± 0.065
0.586IleTrp: 0.586 ± 0.019
1.62IleTyr: 1.62 ± 0.033
0.0IleXaa: 0.0 ± 0.0
Lys
4.401LysAla: 4.401 ± 0.07
0.504LysCys: 0.504 ± 0.023
2.454LysAsp: 2.454 ± 0.053
3.612LysGlu: 3.612 ± 0.069
1.571LysPhe: 1.571 ± 0.036
4.192LysGly: 4.192 ± 0.061
0.895LysHis: 0.895 ± 0.029
3.305LysIle: 3.305 ± 0.054
3.146LysLys: 3.146 ± 0.073
4.499LysLeu: 4.499 ± 0.072
1.399LysMet: 1.399 ± 0.033
1.943LysAsn: 1.943 ± 0.044
2.362LysPro: 2.362 ± 0.044
1.692LysGln: 1.692 ± 0.04
2.828LysArg: 2.828 ± 0.049
3.085LysSer: 3.085 ± 0.056
2.68LysThr: 2.68 ± 0.048
3.574LysVal: 3.574 ± 0.063
0.43LysTrp: 0.43 ± 0.017
1.348LysTyr: 1.348 ± 0.037
0.0LysXaa: 0.0 ± 0.0
Leu
10.453LeuAla: 10.453 ± 0.126
1.213LeuCys: 1.213 ± 0.032
5.339LeuAsp: 5.339 ± 0.068
6.014LeuGlu: 6.014 ± 0.087
4.264LeuPhe: 4.264 ± 0.071
6.822LeuGly: 6.822 ± 0.085
1.899LeuHis: 1.899 ± 0.039
5.533LeuIle: 5.533 ± 0.08
5.413LeuLys: 5.413 ± 0.075
11.11LeuLeu: 11.11 ± 0.152
2.26LeuMet: 2.26 ± 0.042
3.632LeuAsn: 3.632 ± 0.052
4.952LeuPro: 4.952 ± 0.067
3.228LeuGln: 3.228 ± 0.049
5.474LeuArg: 5.474 ± 0.074
6.769LeuSer: 6.769 ± 0.077
5.5LeuThr: 5.5 ± 0.063
6.867LeuVal: 6.867 ± 0.081
0.898LeuTrp: 0.898 ± 0.034
2.603LeuTyr: 2.603 ± 0.043
0.0LeuXaa: 0.0 ± 0.0
Met
2.702MetAla: 2.702 ± 0.055
0.149MetCys: 0.149 ± 0.009
1.176MetAsp: 1.176 ± 0.03
1.43MetGlu: 1.43 ± 0.036
0.771MetPhe: 0.771 ± 0.028
1.767MetGly: 1.767 ± 0.041
0.482MetHis: 0.482 ± 0.02
1.498MetIle: 1.498 ± 0.035
1.648MetLys: 1.648 ± 0.038
2.497MetLeu: 2.497 ± 0.043
0.605MetMet: 0.605 ± 0.023
1.072MetAsn: 1.072 ± 0.029
1.297MetPro: 1.297 ± 0.031
0.835MetGln: 0.835 ± 0.025
1.314MetArg: 1.314 ± 0.036
1.462MetSer: 1.462 ± 0.032
1.57MetThr: 1.57 ± 0.038
1.708MetVal: 1.708 ± 0.033
0.139MetTrp: 0.139 ± 0.009
0.449MetTyr: 0.449 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.105AsnAla: 3.105 ± 0.052
0.592AsnCys: 0.592 ± 0.028
1.897AsnAsp: 1.897 ± 0.044
1.926AsnGlu: 1.926 ± 0.036
1.323AsnPhe: 1.323 ± 0.033
3.215AsnGly: 3.215 ± 0.092
0.708AsnHis: 0.708 ± 0.022
2.473AsnIle: 2.473 ± 0.041
1.607AsnLys: 1.607 ± 0.042
3.764AsnLeu: 3.764 ± 0.052
0.841AsnMet: 0.841 ± 0.024
1.635AsnAsn: 1.635 ± 0.053
2.233AsnPro: 2.233 ± 0.042
1.126AsnGln: 1.126 ± 0.032
2.495AsnArg: 2.495 ± 0.041
2.541AsnSer: 2.541 ± 0.073
1.809AsnThr: 1.809 ± 0.055
2.32AsnVal: 2.32 ± 0.05
0.457AsnTrp: 0.457 ± 0.02
1.159AsnTyr: 1.159 ± 0.04
0.0AsnXaa: 0.0 ± 0.0
Pro
4.653ProAla: 4.653 ± 0.068
0.451ProCys: 0.451 ± 0.019
2.956ProAsp: 2.956 ± 0.043
3.483ProGlu: 3.483 ± 0.057
2.027ProPhe: 2.027 ± 0.036
3.643ProGly: 3.643 ± 0.048
0.871ProHis: 0.871 ± 0.025
2.171ProIle: 2.171 ± 0.042
1.915ProLys: 1.915 ± 0.041
4.636ProLeu: 4.636 ± 0.067
0.907ProMet: 0.907 ± 0.024
1.321ProAsn: 1.321 ± 0.037
2.09ProPro: 2.09 ± 0.044
1.482ProGln: 1.482 ± 0.035
1.913ProArg: 1.913 ± 0.039
2.424ProSer: 2.424 ± 0.046
2.105ProThr: 2.105 ± 0.05
4.046ProVal: 4.046 ± 0.073
0.488ProTrp: 0.488 ± 0.021
1.382ProTyr: 1.382 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
2.943GlnAla: 2.943 ± 0.047
0.333GlnCys: 0.333 ± 0.021
1.351GlnAsp: 1.351 ± 0.03
2.181GlnGlu: 2.181 ± 0.041
1.154GlnPhe: 1.154 ± 0.031
2.328GlnGly: 2.328 ± 0.04
0.591GlnHis: 0.591 ± 0.019
1.949GlnIle: 1.949 ± 0.039
1.827GlnLys: 1.827 ± 0.044
3.34GlnLeu: 3.34 ± 0.061
0.885GlnMet: 0.885 ± 0.025
1.109GlnAsn: 1.109 ± 0.03
1.308GlnPro: 1.308 ± 0.033
1.434GlnGln: 1.434 ± 0.04
1.932GlnArg: 1.932 ± 0.041
1.755GlnSer: 1.755 ± 0.035
1.62GlnThr: 1.62 ± 0.042
2.359GlnVal: 2.359 ± 0.044
0.361GlnTrp: 0.361 ± 0.016
0.869GlnTyr: 0.869 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
3.898ArgAla: 3.898 ± 0.056
0.753ArgCys: 0.753 ± 0.029
3.024ArgAsp: 3.024 ± 0.059
4.378ArgGlu: 4.378 ± 0.07
2.768ArgPhe: 2.768 ± 0.046
3.632ArgGly: 3.632 ± 0.059
1.349ArgHis: 1.349 ± 0.033
4.307ArgIle: 4.307 ± 0.049
3.121ArgLys: 3.121 ± 0.053
6.434ArgLeu: 6.434 ± 0.098
1.572ArgMet: 1.572 ± 0.033
2.254ArgAsn: 2.254 ± 0.039
2.202ArgPro: 2.202 ± 0.049
2.248ArgGln: 2.248 ± 0.046
3.59ArgArg: 3.59 ± 0.066
3.439ArgSer: 3.439 ± 0.057
2.744ArgThr: 2.744 ± 0.047
3.895ArgVal: 3.895 ± 0.059
0.642ArgTrp: 0.642 ± 0.021
2.033ArgTyr: 2.033 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
5.77SerAla: 5.77 ± 0.081
1.039SerCys: 1.039 ± 0.04
3.328SerAsp: 3.328 ± 0.06
3.373SerGlu: 3.373 ± 0.058
2.821SerPhe: 2.821 ± 0.048
6.582SerGly: 6.582 ± 0.107
1.351SerHis: 1.351 ± 0.038
3.943SerIle: 3.943 ± 0.074
2.35SerLys: 2.35 ± 0.054
6.407SerLeu: 6.407 ± 0.083
1.4SerMet: 1.4 ± 0.033
2.101SerAsn: 2.101 ± 0.062
2.925SerPro: 2.925 ± 0.055
1.906SerGln: 1.906 ± 0.042
3.737SerArg: 3.737 ± 0.056
4.406SerSer: 4.406 ± 0.09
3.196SerThr: 3.196 ± 0.076
4.272SerVal: 4.272 ± 0.065
0.804SerTrp: 0.804 ± 0.027
1.983SerTyr: 1.983 ± 0.05
0.0SerXaa: 0.0 ± 0.0
Thr
5.473ThrAla: 5.473 ± 0.096
0.737ThrCys: 0.737 ± 0.033
2.765ThrAsp: 2.765 ± 0.057
2.841ThrGlu: 2.841 ± 0.049
2.217ThrPhe: 2.217 ± 0.044
5.545ThrGly: 5.545 ± 0.102
0.876ThrHis: 0.876 ± 0.029
3.643ThrIle: 3.643 ± 0.064
2.079ThrLys: 2.079 ± 0.045
5.326ThrLeu: 5.326 ± 0.065
1.185ThrMet: 1.185 ± 0.031
1.881ThrAsn: 1.881 ± 0.06
3.013ThrPro: 3.013 ± 0.06
1.313ThrGln: 1.313 ± 0.036
2.597ThrArg: 2.597 ± 0.049
3.266ThrSer: 3.266 ± 0.068
3.271ThrThr: 3.271 ± 0.098
4.803ThrVal: 4.803 ± 0.096
0.635ThrTrp: 0.635 ± 0.031
1.467ThrTyr: 1.467 ± 0.049
0.0ThrXaa: 0.0 ± 0.0
Val
7.255ValAla: 7.255 ± 0.081
0.94ValCys: 0.94 ± 0.028
3.742ValAsp: 3.742 ± 0.058
4.241ValGlu: 4.241 ± 0.07
2.593ValPhe: 2.593 ± 0.05
4.689ValGly: 4.689 ± 0.064
1.213ValHis: 1.213 ± 0.031
4.759ValIle: 4.759 ± 0.064
3.602ValLys: 3.602 ± 0.057
6.279ValLeu: 6.279 ± 0.07
1.799ValMet: 1.799 ± 0.04
2.766ValAsn: 2.766 ± 0.046
3.039ValPro: 3.039 ± 0.048
1.943ValGln: 1.943 ± 0.041
4.058ValArg: 4.058 ± 0.056
4.834ValSer: 4.834 ± 0.076
4.639ValThr: 4.639 ± 0.085
5.443ValVal: 5.443 ± 0.075
0.751ValTrp: 0.751 ± 0.027
1.873ValTyr: 1.873 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
0.718TrpAla: 0.718 ± 0.027
0.141TrpCys: 0.141 ± 0.01
0.582TrpAsp: 0.582 ± 0.021
0.655TrpGlu: 0.655 ± 0.021
0.469TrpPhe: 0.469 ± 0.018
0.987TrpGly: 0.987 ± 0.038
0.265TrpHis: 0.265 ± 0.014
0.54TrpIle: 0.54 ± 0.02
0.512TrpLys: 0.512 ± 0.023
1.229TrpLeu: 1.229 ± 0.034
0.24TrpMet: 0.24 ± 0.013
0.431TrpAsn: 0.431 ± 0.022
0.427TrpPro: 0.427 ± 0.02
0.574TrpGln: 0.574 ± 0.019
0.631TrpArg: 0.631 ± 0.022
0.695TrpSer: 0.695 ± 0.029
0.526TrpThr: 0.526 ± 0.032
0.657TrpVal: 0.657 ± 0.022
0.16TrpTrp: 0.16 ± 0.011
0.344TrpTyr: 0.344 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.386TyrAla: 2.386 ± 0.052
0.531TyrCys: 0.531 ± 0.025
1.657TyrAsp: 1.657 ± 0.044
1.452TyrGlu: 1.452 ± 0.037
1.341TyrPhe: 1.341 ± 0.036
2.197TyrGly: 2.197 ± 0.041
0.617TyrHis: 0.617 ± 0.02
1.452TyrIle: 1.452 ± 0.03
1.128TyrLys: 1.128 ± 0.032
3.32TyrLeu: 3.32 ± 0.056
0.51TyrMet: 0.51 ± 0.019
1.146TyrAsn: 1.146 ± 0.038
1.371TyrPro: 1.371 ± 0.035
1.049TyrGln: 1.049 ± 0.027
2.258TyrArg: 2.258 ± 0.044
2.043TyrSer: 2.043 ± 0.06
1.511TyrThr: 1.511 ± 0.044
1.634TyrVal: 1.634 ± 0.037
0.36TyrTrp: 0.36 ± 0.015
1.029TyrTyr: 1.029 ± 0.033
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4153 proteins (1364415 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski