Amino acid dipepetide frequency for Desulfovibrio sp. OH1186_COT-070

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.716AlaAla: 14.716 ± 0.222
2.087AlaCys: 2.087 ± 0.055
5.746AlaAsp: 5.746 ± 0.107
6.666AlaGlu: 6.666 ± 0.108
3.86AlaPhe: 3.86 ± 0.083
9.519AlaGly: 9.519 ± 0.127
2.73AlaHis: 2.73 ± 0.064
3.879AlaIle: 3.879 ± 0.078
3.335AlaLys: 3.335 ± 0.08
13.876AlaLeu: 13.876 ± 0.187
2.768AlaMet: 2.768 ± 0.065
2.353AlaAsn: 2.353 ± 0.061
5.424AlaPro: 5.424 ± 0.097
4.339AlaGln: 4.339 ± 0.082
8.65AlaArg: 8.65 ± 0.133
5.75AlaSer: 5.75 ± 0.097
4.175AlaThr: 4.175 ± 0.079
8.31AlaVal: 8.31 ± 0.133
1.571AlaTrp: 1.571 ± 0.054
2.283AlaTyr: 2.283 ± 0.051
0.0AlaXaa: 0.0 ± 0.0
Cys
1.834CysAla: 1.834 ± 0.054
0.34CysCys: 0.34 ± 0.021
0.671CysAsp: 0.671 ± 0.032
0.678CysGlu: 0.678 ± 0.027
0.579CysPhe: 0.579 ± 0.031
1.751CysGly: 1.751 ± 0.052
0.463CysHis: 0.463 ± 0.03
0.742CysIle: 0.742 ± 0.033
0.478CysLys: 0.478 ± 0.03
1.925CysLeu: 1.925 ± 0.056
0.447CysMet: 0.447 ± 0.026
0.461CysAsn: 0.461 ± 0.026
1.085CysPro: 1.085 ± 0.041
0.388CysGln: 0.388 ± 0.024
1.323CysArg: 1.323 ± 0.037
0.915CysSer: 0.915 ± 0.037
0.782CysThr: 0.782 ± 0.029
1.155CysVal: 1.155 ± 0.042
0.202CysTrp: 0.202 ± 0.016
0.349CysTyr: 0.349 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
6.198AspAla: 6.198 ± 0.091
0.779AspCys: 0.779 ± 0.034
2.255AspAsp: 2.255 ± 0.064
2.642AspGlu: 2.642 ± 0.065
2.435AspPhe: 2.435 ± 0.068
4.057AspGly: 4.057 ± 0.112
0.936AspHis: 0.936 ± 0.033
3.081AspIle: 3.081 ± 0.067
2.088AspLys: 2.088 ± 0.062
5.145AspLeu: 5.145 ± 0.081
1.95AspMet: 1.95 ± 0.053
1.55AspAsn: 1.55 ± 0.048
2.524AspPro: 2.524 ± 0.056
1.222AspGln: 1.222 ± 0.04
2.76AspArg: 2.76 ± 0.06
2.734AspSer: 2.734 ± 0.063
2.51AspThr: 2.51 ± 0.068
3.778AspVal: 3.778 ± 0.073
0.783AspTrp: 0.783 ± 0.034
1.284AspTyr: 1.284 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
6.903GluAla: 6.903 ± 0.117
0.664GluCys: 0.664 ± 0.028
3.284GluAsp: 3.284 ± 0.068
4.371GluGlu: 4.371 ± 0.099
1.716GluPhe: 1.716 ± 0.054
4.446GluGly: 4.446 ± 0.086
1.418GluHis: 1.418 ± 0.048
3.042GluIle: 3.042 ± 0.066
3.891GluLys: 3.891 ± 0.091
5.395GluLeu: 5.395 ± 0.096
1.699GluMet: 1.699 ± 0.042
2.508GluAsn: 2.508 ± 0.058
2.225GluPro: 2.225 ± 0.052
2.474GluGln: 2.474 ± 0.063
4.3GluArg: 4.3 ± 0.078
3.107GluSer: 3.107 ± 0.071
2.465GluThr: 2.465 ± 0.067
3.69GluVal: 3.69 ± 0.074
0.551GluTrp: 0.551 ± 0.029
1.519GluTyr: 1.519 ± 0.05
0.0GluXaa: 0.0 ± 0.0
Phe
4.167PheAla: 4.167 ± 0.072
0.944PheCys: 0.944 ± 0.038
1.986PheAsp: 1.986 ± 0.055
1.725PheGlu: 1.725 ± 0.053
2.075PhePhe: 2.075 ± 0.058
2.972PheGly: 2.972 ± 0.074
0.807PheHis: 0.807 ± 0.037
1.333PheIle: 1.333 ± 0.046
1.078PheLys: 1.078 ± 0.042
4.59PheLeu: 4.59 ± 0.085
0.94PheMet: 0.94 ± 0.041
0.916PheAsn: 0.916 ± 0.047
1.805PhePro: 1.805 ± 0.048
1.01PheGln: 1.01 ± 0.034
2.499PheArg: 2.499 ± 0.052
2.987PheSer: 2.987 ± 0.063
1.919PheThr: 1.919 ± 0.066
2.742PheVal: 2.742 ± 0.055
0.794PheTrp: 0.794 ± 0.032
0.982PheTyr: 0.982 ± 0.038
0.0PheXaa: 0.0 ± 0.0
Gly
7.507GlyAla: 7.507 ± 0.124
1.4GlyCys: 1.4 ± 0.048
3.732GlyAsp: 3.732 ± 0.07
4.579GlyGlu: 4.579 ± 0.081
3.226GlyPhe: 3.226 ± 0.078
6.932GlyGly: 6.932 ± 0.141
2.008GlyHis: 2.008 ± 0.048
4.565GlyIle: 4.565 ± 0.086
4.139GlyLys: 4.139 ± 0.083
8.747GlyLeu: 8.747 ± 0.129
2.81GlyMet: 2.81 ± 0.057
2.452GlyAsn: 2.452 ± 0.071
2.929GlyPro: 2.929 ± 0.061
3.306GlyGln: 3.306 ± 0.065
5.68GlyArg: 5.68 ± 0.086
4.438GlySer: 4.438 ± 0.09
4.189GlyThr: 4.189 ± 0.121
5.68GlyVal: 5.68 ± 0.098
0.999GlyTrp: 0.999 ± 0.046
2.203GlyTyr: 2.203 ± 0.051
0.0GlyXaa: 0.0 ± 0.0
His
2.602HisAla: 2.602 ± 0.058
0.475HisCys: 0.475 ± 0.028
1.182HisAsp: 1.182 ± 0.04
1.275HisGlu: 1.275 ± 0.04
0.994HisPhe: 0.994 ± 0.037
1.91HisGly: 1.91 ± 0.053
0.523HisHis: 0.523 ± 0.032
1.317HisIle: 1.317 ± 0.051
0.859HisLys: 0.859 ± 0.034
2.49HisLeu: 2.49 ± 0.059
0.71HisMet: 0.71 ± 0.032
0.679HisAsn: 0.679 ± 0.029
1.338HisPro: 1.338 ± 0.048
0.581HisGln: 0.581 ± 0.029
1.24HisArg: 1.24 ± 0.046
1.304HisSer: 1.304 ± 0.038
1.232HisThr: 1.232 ± 0.045
1.737HisVal: 1.737 ± 0.051
0.361HisTrp: 0.361 ± 0.022
0.558HisTyr: 0.558 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
4.403IleAla: 4.403 ± 0.094
0.804IleCys: 0.804 ± 0.03
2.13IleAsp: 2.13 ± 0.063
2.105IleGlu: 2.105 ± 0.06
2.121IlePhe: 2.121 ± 0.055
3.095IleGly: 3.095 ± 0.079
0.942IleHis: 0.942 ± 0.033
2.172IleIle: 2.172 ± 0.069
1.533IleLys: 1.533 ± 0.055
4.992IleLeu: 4.992 ± 0.084
1.219IleMet: 1.219 ± 0.048
1.457IleAsn: 1.457 ± 0.042
2.53IlePro: 2.53 ± 0.063
1.223IleGln: 1.223 ± 0.043
3.124IleArg: 3.124 ± 0.069
3.012IleSer: 3.012 ± 0.063
2.433IleThr: 2.433 ± 0.064
3.394IleVal: 3.394 ± 0.075
0.506IleTrp: 0.506 ± 0.028
1.109IleTyr: 1.109 ± 0.039
0.0IleXaa: 0.0 ± 0.0
Lys
4.197LysAla: 4.197 ± 0.091
0.382LysCys: 0.382 ± 0.024
2.059LysAsp: 2.059 ± 0.06
2.498LysGlu: 2.498 ± 0.075
1.033LysPhe: 1.033 ± 0.041
2.97LysGly: 2.97 ± 0.081
0.757LysHis: 0.757 ± 0.033
2.204LysIle: 2.204 ± 0.057
2.676LysLys: 2.676 ± 0.076
3.418LysLeu: 3.418 ± 0.086
1.069LysMet: 1.069 ± 0.039
1.901LysAsn: 1.901 ± 0.056
1.81LysPro: 1.81 ± 0.048
1.169LysGln: 1.169 ± 0.05
2.244LysArg: 2.244 ± 0.059
2.251LysSer: 2.251 ± 0.052
2.257LysThr: 2.257 ± 0.063
2.357LysVal: 2.357 ± 0.066
0.316LysTrp: 0.316 ± 0.021
1.049LysTyr: 1.049 ± 0.043
0.0LysXaa: 0.0 ± 0.0
Leu
13.688LeuAla: 13.688 ± 0.189
2.116LeuCys: 2.116 ± 0.06
6.014LeuAsp: 6.014 ± 0.088
7.069LeuGlu: 7.069 ± 0.108
4.063LeuPhe: 4.063 ± 0.08
8.601LeuGly: 8.601 ± 0.12
2.787LeuHis: 2.787 ± 0.074
3.309LeuIle: 3.309 ± 0.079
3.742LeuLys: 3.742 ± 0.078
13.064LeuLeu: 13.064 ± 0.19
2.292LeuMet: 2.292 ± 0.057
2.643LeuAsn: 2.643 ± 0.059
7.098LeuPro: 7.098 ± 0.122
3.24LeuGln: 3.24 ± 0.069
8.51LeuArg: 8.51 ± 0.134
6.408LeuSer: 6.408 ± 0.094
5.538LeuThr: 5.538 ± 0.092
7.427LeuVal: 7.427 ± 0.102
1.416LeuTrp: 1.416 ± 0.05
2.443LeuTyr: 2.443 ± 0.058
0.0LeuXaa: 0.0 ± 0.0
Met
3.139MetAla: 3.139 ± 0.078
0.27MetCys: 0.27 ± 0.018
1.462MetAsp: 1.462 ± 0.045
1.728MetGlu: 1.728 ± 0.044
0.729MetPhe: 0.729 ± 0.028
2.216MetGly: 2.216 ± 0.056
0.608MetHis: 0.608 ± 0.03
0.969MetIle: 0.969 ± 0.036
1.066MetLys: 1.066 ± 0.034
2.97MetLeu: 2.97 ± 0.062
0.5MetMet: 0.5 ± 0.026
0.804MetAsn: 0.804 ± 0.039
1.711MetPro: 1.711 ± 0.045
1.029MetGln: 1.029 ± 0.036
1.992MetArg: 1.992 ± 0.053
1.587MetSer: 1.587 ± 0.042
1.417MetThr: 1.417 ± 0.049
1.686MetVal: 1.686 ± 0.047
0.185MetTrp: 0.185 ± 0.015
0.514MetTyr: 0.514 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.174AsnAla: 3.174 ± 0.073
0.411AsnCys: 0.411 ± 0.024
1.294AsnAsp: 1.294 ± 0.053
1.354AsnGlu: 1.354 ± 0.048
1.218AsnPhe: 1.218 ± 0.036
2.215AsnGly: 2.215 ± 0.065
0.552AsnHis: 0.552 ± 0.03
1.794AsnIle: 1.794 ± 0.06
0.973AsnLys: 0.973 ± 0.041
3.085AsnLeu: 3.085 ± 0.067
0.896AsnMet: 0.896 ± 0.035
0.863AsnAsn: 0.863 ± 0.04
1.87AsnPro: 1.87 ± 0.044
0.734AsnGln: 0.734 ± 0.036
1.778AsnArg: 1.778 ± 0.056
1.561AsnSer: 1.561 ± 0.057
1.59AsnThr: 1.59 ± 0.051
2.034AsnVal: 2.034 ± 0.056
0.382AsnTrp: 0.382 ± 0.022
0.751AsnTyr: 0.751 ± 0.033
0.0AsnXaa: 0.0 ± 0.0
Pro
6.041ProAla: 6.041 ± 0.11
0.813ProCys: 0.813 ± 0.031
3.368ProAsp: 3.368 ± 0.069
4.187ProGlu: 4.187 ± 0.078
1.931ProPhe: 1.931 ± 0.047
4.629ProGly: 4.629 ± 0.087
1.359ProHis: 1.359 ± 0.044
1.339ProIle: 1.339 ± 0.05
1.42ProLys: 1.42 ± 0.05
5.859ProLeu: 5.859 ± 0.103
1.014ProMet: 1.014 ± 0.03
1.019ProAsn: 1.019 ± 0.044
3.112ProPro: 3.112 ± 0.089
2.392ProGln: 2.392 ± 0.058
3.059ProArg: 3.059 ± 0.066
2.609ProSer: 2.609 ± 0.062
1.873ProThr: 1.873 ± 0.054
4.358ProVal: 4.358 ± 0.08
0.813ProTrp: 0.813 ± 0.033
1.255ProTyr: 1.255 ± 0.042
0.0ProXaa: 0.0 ± 0.0
Gln
4.399GlnAla: 4.399 ± 0.087
0.551GlnCys: 0.551 ± 0.026
1.724GlnAsp: 1.724 ± 0.044
2.425GlnGlu: 2.425 ± 0.069
0.923GlnPhe: 0.923 ± 0.035
3.11GlnGly: 3.11 ± 0.074
0.713GlnHis: 0.713 ± 0.03
1.612GlnIle: 1.612 ± 0.049
1.91GlnLys: 1.91 ± 0.053
2.588GlnLeu: 2.588 ± 0.065
0.979GlnMet: 0.979 ± 0.033
1.143GlnAsn: 1.143 ± 0.043
1.566GlnPro: 1.566 ± 0.051
1.326GlnGln: 1.326 ± 0.047
2.551GlnArg: 2.551 ± 0.056
1.838GlnSer: 1.838 ± 0.052
1.781GlnThr: 1.781 ± 0.054
2.083GlnVal: 2.083 ± 0.057
0.559GlnTrp: 0.559 ± 0.027
0.854GlnTyr: 0.854 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
7.027ArgAla: 7.027 ± 0.127
0.999ArgCys: 0.999 ± 0.039
3.485ArgAsp: 3.485 ± 0.079
5.173ArgGlu: 5.173 ± 0.094
2.614ArgPhe: 2.614 ± 0.067
4.392ArgGly: 4.392 ± 0.08
2.067ArgHis: 2.067 ± 0.065
3.749ArgIle: 3.749 ± 0.073
3.079ArgLys: 3.079 ± 0.084
8.134ArgLeu: 8.134 ± 0.126
2.174ArgMet: 2.174 ± 0.049
2.055ArgAsn: 2.055 ± 0.049
3.253ArgPro: 3.253 ± 0.081
3.346ArgGln: 3.346 ± 0.075
5.203ArgArg: 5.203 ± 0.093
3.452ArgSer: 3.452 ± 0.079
3.145ArgThr: 3.145 ± 0.073
4.392ArgVal: 4.392 ± 0.085
0.894ArgTrp: 0.894 ± 0.039
1.82ArgTyr: 1.82 ± 0.048
0.0ArgXaa: 0.0 ± 0.0
Ser
5.899SerAla: 5.899 ± 0.085
0.89SerCys: 0.89 ± 0.036
2.217SerAsp: 2.217 ± 0.052
2.418SerGlu: 2.418 ± 0.057
2.427SerPhe: 2.427 ± 0.059
5.77SerGly: 5.77 ± 0.105
1.255SerHis: 1.255 ± 0.041
2.483SerIle: 2.483 ± 0.061
1.553SerLys: 1.553 ± 0.053
7.117SerLeu: 7.117 ± 0.111
1.497SerMet: 1.497 ± 0.047
1.267SerAsn: 1.267 ± 0.048
3.442SerPro: 3.442 ± 0.07
1.636SerGln: 1.636 ± 0.049
4.105SerArg: 4.105 ± 0.075
3.191SerSer: 3.191 ± 0.072
2.551SerThr: 2.551 ± 0.065
4.316SerVal: 4.316 ± 0.072
0.725SerTrp: 0.725 ± 0.029
1.16SerTyr: 1.16 ± 0.041
0.0SerXaa: 0.0 ± 0.0
Thr
5.387ThrAla: 5.387 ± 0.089
0.659ThrCys: 0.659 ± 0.027
2.487ThrAsp: 2.487 ± 0.072
2.586ThrGlu: 2.586 ± 0.068
1.835ThrPhe: 1.835 ± 0.06
4.672ThrGly: 4.672 ± 0.104
1.007ThrHis: 1.007 ± 0.034
2.055ThrIle: 2.055 ± 0.063
1.279ThrLys: 1.279 ± 0.047
5.476ThrLeu: 5.476 ± 0.103
0.911ThrMet: 0.911 ± 0.033
1.19ThrAsn: 1.19 ± 0.048
3.344ThrPro: 3.344 ± 0.074
1.354ThrGln: 1.354 ± 0.045
2.954ThrArg: 2.954 ± 0.064
2.565ThrSer: 2.565 ± 0.058
2.182ThrThr: 2.182 ± 0.069
3.926ThrVal: 3.926 ± 0.089
0.544ThrTrp: 0.544 ± 0.027
1.105ThrTyr: 1.105 ± 0.048
0.0ThrXaa: 0.0 ± 0.0
Val
6.881ValAla: 6.881 ± 0.111
1.363ValCys: 1.363 ± 0.046
3.765ValAsp: 3.765 ± 0.073
4.238ValGlu: 4.238 ± 0.078
3.003ValPhe: 3.003 ± 0.071
5.031ValGly: 5.031 ± 0.085
1.528ValHis: 1.528 ± 0.04
3.071ValIle: 3.071 ± 0.067
2.079ValLys: 2.079 ± 0.064
8.423ValLeu: 8.423 ± 0.117
1.843ValMet: 1.843 ± 0.052
2.056ValAsn: 2.056 ± 0.061
3.55ValPro: 3.55 ± 0.068
2.423ValGln: 2.423 ± 0.05
5.651ValArg: 5.651 ± 0.108
4.2ValSer: 4.2 ± 0.083
3.555ValThr: 3.555 ± 0.089
5.075ValVal: 5.075 ± 0.094
0.995ValTrp: 0.995 ± 0.035
1.599ValTyr: 1.599 ± 0.048
0.0ValXaa: 0.0 ± 0.0
Trp
1.144TrpAla: 1.144 ± 0.04
0.196TrpCys: 0.196 ± 0.017
0.591TrpAsp: 0.591 ± 0.028
0.739TrpGlu: 0.739 ± 0.027
0.473TrpPhe: 0.473 ± 0.028
1.015TrpGly: 1.015 ± 0.033
0.357TrpHis: 0.357 ± 0.023
0.554TrpIle: 0.554 ± 0.032
0.61TrpLys: 0.61 ± 0.03
1.807TrpLeu: 1.807 ± 0.056
0.324TrpMet: 0.324 ± 0.024
0.442TrpAsn: 0.442 ± 0.024
0.687TrpPro: 0.687 ± 0.03
0.733TrpGln: 0.733 ± 0.031
1.109TrpArg: 1.109 ± 0.038
0.639TrpSer: 0.639 ± 0.025
0.569TrpThr: 0.569 ± 0.028
0.639TrpVal: 0.639 ± 0.032
0.216TrpTrp: 0.216 ± 0.017
0.322TrpTyr: 0.322 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.731TyrAla: 2.731 ± 0.06
0.422TyrCys: 0.422 ± 0.025
1.258TyrAsp: 1.258 ± 0.041
1.214TyrGlu: 1.214 ± 0.039
1.018TyrPhe: 1.018 ± 0.04
2.218TyrGly: 2.218 ± 0.061
0.552TyrHis: 0.552 ± 0.028
1.035TyrIle: 1.035 ± 0.039
0.828TyrLys: 0.828 ± 0.036
2.365TyrLeu: 2.365 ± 0.06
0.536TyrMet: 0.536 ± 0.025
0.771TyrAsn: 0.771 ± 0.033
1.148TyrPro: 1.148 ± 0.038
0.708TyrGln: 0.708 ± 0.033
1.723TyrArg: 1.723 ± 0.049
1.37TyrSer: 1.37 ± 0.044
1.355TyrThr: 1.355 ± 0.057
1.551TyrVal: 1.551 ± 0.042
0.355TyrTrp: 0.355 ± 0.023
0.73TyrTyr: 0.73 ± 0.031
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2373 proteins (758626 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski