Amino acid dipepetide frequency for Desulfovibrio desulfuricans (strain ATCC 27774 / DSM 6949 / MB)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.559AlaAla: 15.559 ± 0.224
1.922AlaCys: 1.922 ± 0.052
5.828AlaAsp: 5.828 ± 0.104
6.658AlaGlu: 6.658 ± 0.117
3.73AlaPhe: 3.73 ± 0.077
9.87AlaGly: 9.87 ± 0.153
2.425AlaHis: 2.425 ± 0.056
4.373AlaIle: 4.373 ± 0.084
3.34AlaLys: 3.34 ± 0.08
13.325AlaLeu: 13.325 ± 0.175
3.225AlaMet: 3.225 ± 0.068
2.478AlaAsn: 2.478 ± 0.068
5.485AlaPro: 5.485 ± 0.108
4.349AlaGln: 4.349 ± 0.1
7.974AlaArg: 7.974 ± 0.127
5.73AlaSer: 5.73 ± 0.102
4.485AlaThr: 4.485 ± 0.097
8.421AlaVal: 8.421 ± 0.128
1.448AlaTrp: 1.448 ± 0.045
2.333AlaTyr: 2.333 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
1.703CysAla: 1.703 ± 0.054
0.368CysCys: 0.368 ± 0.024
0.661CysAsp: 0.661 ± 0.032
0.688CysGlu: 0.688 ± 0.032
0.549CysPhe: 0.549 ± 0.032
1.672CysGly: 1.672 ± 0.06
0.467CysHis: 0.467 ± 0.034
0.793CysIle: 0.793 ± 0.035
0.472CysLys: 0.472 ± 0.023
1.752CysLeu: 1.752 ± 0.055
0.433CysMet: 0.433 ± 0.024
0.5CysAsn: 0.5 ± 0.025
1.046CysPro: 1.046 ± 0.043
0.387CysGln: 0.387 ± 0.02
1.253CysArg: 1.253 ± 0.037
0.831CysSer: 0.831 ± 0.039
0.803CysThr: 0.803 ± 0.032
1.078CysVal: 1.078 ± 0.039
0.2CysTrp: 0.2 ± 0.015
0.402CysTyr: 0.402 ± 0.026
0.0CysXaa: 0.0 ± 0.0
Asp
5.906AspAla: 5.906 ± 0.095
0.779AspCys: 0.779 ± 0.035
2.49AspAsp: 2.49 ± 0.068
3.104AspGlu: 3.104 ± 0.077
2.207AspPhe: 2.207 ± 0.062
4.258AspGly: 4.258 ± 0.103
1.052AspHis: 1.052 ± 0.038
2.94AspIle: 2.94 ± 0.066
2.043AspLys: 2.043 ± 0.062
5.248AspLeu: 5.248 ± 0.074
1.979AspMet: 1.979 ± 0.051
1.499AspAsn: 1.499 ± 0.051
2.51AspPro: 2.51 ± 0.06
1.406AspGln: 1.406 ± 0.041
2.83AspArg: 2.83 ± 0.055
2.677AspSer: 2.677 ± 0.06
2.453AspThr: 2.453 ± 0.057
4.025AspVal: 4.025 ± 0.072
0.756AspTrp: 0.756 ± 0.033
1.393AspTyr: 1.393 ± 0.043
0.0AspXaa: 0.0 ± 0.0
Glu
7.112GluAla: 7.112 ± 0.107
0.62GluCys: 0.62 ± 0.029
3.161GluAsp: 3.161 ± 0.075
4.122GluGlu: 4.122 ± 0.09
1.666GluPhe: 1.666 ± 0.051
4.383GluGly: 4.383 ± 0.068
1.48GluHis: 1.48 ± 0.045
2.848GluIle: 2.848 ± 0.06
3.477GluLys: 3.477 ± 0.088
5.509GluLeu: 5.509 ± 0.083
1.715GluMet: 1.715 ± 0.049
2.384GluAsn: 2.384 ± 0.054
2.293GluPro: 2.293 ± 0.054
2.528GluGln: 2.528 ± 0.057
3.828GluArg: 3.828 ± 0.079
3.07GluSer: 3.07 ± 0.067
2.625GluThr: 2.625 ± 0.062
3.818GluVal: 3.818 ± 0.076
0.56GluTrp: 0.56 ± 0.027
1.4GluTyr: 1.4 ± 0.048
0.0GluXaa: 0.0 ± 0.0
Phe
3.616PheAla: 3.616 ± 0.063
0.766PheCys: 0.766 ± 0.035
1.983PheAsp: 1.983 ± 0.051
1.784PheGlu: 1.784 ± 0.047
1.895PhePhe: 1.895 ± 0.06
2.995PheGly: 2.995 ± 0.06
0.687PheHis: 0.687 ± 0.029
1.82PheIle: 1.82 ± 0.053
1.243PheLys: 1.243 ± 0.044
3.68PheLeu: 3.68 ± 0.075
1.146PheMet: 1.146 ± 0.04
1.128PheAsn: 1.128 ± 0.048
1.541PhePro: 1.541 ± 0.049
0.905PheGln: 0.905 ± 0.036
1.984PheArg: 1.984 ± 0.058
2.749PheSer: 2.749 ± 0.06
2.173PheThr: 2.173 ± 0.06
2.574PheVal: 2.574 ± 0.062
0.643PheTrp: 0.643 ± 0.032
1.09PheTyr: 1.09 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
7.835GlyAla: 7.835 ± 0.117
1.418GlyCys: 1.418 ± 0.049
3.678GlyAsp: 3.678 ± 0.069
4.268GlyGlu: 4.268 ± 0.077
3.151GlyPhe: 3.151 ± 0.067
7.109GlyGly: 7.109 ± 0.146
2.095GlyHis: 2.095 ± 0.054
4.613GlyIle: 4.613 ± 0.092
4.214GlyLys: 4.214 ± 0.093
9.196GlyLeu: 9.196 ± 0.146
2.922GlyMet: 2.922 ± 0.066
2.567GlyAsn: 2.567 ± 0.076
3.339GlyPro: 3.339 ± 0.077
3.728GlyGln: 3.728 ± 0.081
5.641GlyArg: 5.641 ± 0.101
4.612GlySer: 4.612 ± 0.101
4.283GlyThr: 4.283 ± 0.111
5.848GlyVal: 5.848 ± 0.093
1.039GlyTrp: 1.039 ± 0.04
2.254GlyTyr: 2.254 ± 0.06
0.0GlyXaa: 0.0 ± 0.0
His
2.359HisAla: 2.359 ± 0.057
0.456HisCys: 0.456 ± 0.023
1.14HisAsp: 1.14 ± 0.038
1.378HisGlu: 1.378 ± 0.042
0.862HisPhe: 0.862 ± 0.032
1.835HisGly: 1.835 ± 0.049
0.463HisHis: 0.463 ± 0.027
1.181HisIle: 1.181 ± 0.039
0.915HisLys: 0.915 ± 0.035
2.345HisLeu: 2.345 ± 0.054
0.82HisMet: 0.82 ± 0.036
0.739HisAsn: 0.739 ± 0.029
1.254HisPro: 1.254 ± 0.04
0.552HisGln: 0.552 ± 0.026
1.083HisArg: 1.083 ± 0.043
1.26HisSer: 1.26 ± 0.037
1.169HisThr: 1.169 ± 0.039
1.541HisVal: 1.541 ± 0.045
0.364HisTrp: 0.364 ± 0.021
0.59HisTyr: 0.59 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
4.389IleAla: 4.389 ± 0.085
0.884IleCys: 0.884 ± 0.032
2.133IleAsp: 2.133 ± 0.065
2.464IleGlu: 2.464 ± 0.063
2.156IlePhe: 2.156 ± 0.055
3.248IleGly: 3.248 ± 0.071
0.886IleHis: 0.886 ± 0.033
2.623IleIle: 2.623 ± 0.071
1.854IleLys: 1.854 ± 0.048
4.806IleLeu: 4.806 ± 0.086
1.513IleMet: 1.513 ± 0.047
1.663IleAsn: 1.663 ± 0.058
2.476IlePro: 2.476 ± 0.054
1.194IleGln: 1.194 ± 0.039
2.697IleArg: 2.697 ± 0.058
3.218IleSer: 3.218 ± 0.067
2.824IleThr: 2.824 ± 0.075
3.415IleVal: 3.415 ± 0.073
0.582IleTrp: 0.582 ± 0.029
1.216IleTyr: 1.216 ± 0.033
0.0IleXaa: 0.0 ± 0.0
Lys
4.474LysAla: 4.474 ± 0.09
0.382LysCys: 0.382 ± 0.024
2.22LysAsp: 2.22 ± 0.068
2.463LysGlu: 2.463 ± 0.059
0.998LysPhe: 0.998 ± 0.043
3.23LysGly: 3.23 ± 0.08
0.713LysHis: 0.713 ± 0.031
1.963LysIle: 1.963 ± 0.06
2.533LysLys: 2.533 ± 0.078
3.472LysLeu: 3.472 ± 0.073
1.132LysMet: 1.132 ± 0.041
1.772LysAsn: 1.772 ± 0.049
1.907LysPro: 1.907 ± 0.049
1.28LysGln: 1.28 ± 0.048
2.037LysArg: 2.037 ± 0.055
2.169LysSer: 2.169 ± 0.067
2.208LysThr: 2.208 ± 0.063
2.53LysVal: 2.53 ± 0.069
0.294LysTrp: 0.294 ± 0.02
0.974LysTyr: 0.974 ± 0.044
0.0LysXaa: 0.0 ± 0.0
Leu
13.439LeuAla: 13.439 ± 0.182
1.875LeuCys: 1.875 ± 0.053
5.912LeuAsp: 5.912 ± 0.083
6.524LeuGlu: 6.524 ± 0.106
3.757LeuPhe: 3.757 ± 0.081
8.456LeuGly: 8.456 ± 0.132
2.482LeuHis: 2.482 ± 0.063
3.734LeuIle: 3.734 ± 0.076
3.71LeuLys: 3.71 ± 0.075
11.993LeuLeu: 11.993 ± 0.173
2.699LeuMet: 2.699 ± 0.064
2.789LeuAsn: 2.789 ± 0.058
6.759LeuPro: 6.759 ± 0.12
3.316LeuGln: 3.316 ± 0.061
7.814LeuArg: 7.814 ± 0.119
6.188LeuSer: 6.188 ± 0.092
5.493LeuThr: 5.493 ± 0.091
7.255LeuVal: 7.255 ± 0.108
1.391LeuTrp: 1.391 ± 0.047
2.379LeuTyr: 2.379 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
3.536MetAla: 3.536 ± 0.065
0.316MetCys: 0.316 ± 0.02
1.672MetAsp: 1.672 ± 0.044
1.676MetGlu: 1.676 ± 0.047
0.759MetPhe: 0.759 ± 0.034
2.61MetGly: 2.61 ± 0.063
0.638MetHis: 0.638 ± 0.029
1.005MetIle: 1.005 ± 0.035
1.1MetLys: 1.1 ± 0.04
3.199MetLeu: 3.199 ± 0.077
0.545MetMet: 0.545 ± 0.028
0.949MetAsn: 0.949 ± 0.037
1.958MetPro: 1.958 ± 0.048
1.176MetGln: 1.176 ± 0.036
2.038MetArg: 2.038 ± 0.051
1.72MetSer: 1.72 ± 0.044
1.55MetThr: 1.55 ± 0.043
1.9MetVal: 1.9 ± 0.053
0.222MetTrp: 0.222 ± 0.015
0.54MetTyr: 0.54 ± 0.03
0.0MetXaa: 0.0 ± 0.0
Asn
3.336AsnAla: 3.336 ± 0.074
0.496AsnCys: 0.496 ± 0.028
1.506AsnAsp: 1.506 ± 0.063
1.452AsnGlu: 1.452 ± 0.051
1.186AsnPhe: 1.186 ± 0.041
2.448AsnGly: 2.448 ± 0.076
0.516AsnHis: 0.516 ± 0.024
1.833AsnIle: 1.833 ± 0.064
1.143AsnLys: 1.143 ± 0.043
3.066AsnLeu: 3.066 ± 0.065
0.983AsnMet: 0.983 ± 0.038
0.984AsnAsn: 0.984 ± 0.041
1.915AsnPro: 1.915 ± 0.054
0.754AsnGln: 0.754 ± 0.034
1.708AsnArg: 1.708 ± 0.047
1.643AsnSer: 1.643 ± 0.054
1.662AsnThr: 1.662 ± 0.053
2.204AsnVal: 2.204 ± 0.061
0.367AsnTrp: 0.367 ± 0.018
0.833AsnTyr: 0.833 ± 0.035
0.0AsnXaa: 0.0 ± 0.0
Pro
6.15ProAla: 6.15 ± 0.113
0.743ProCys: 0.743 ± 0.034
3.277ProAsp: 3.277 ± 0.067
4.142ProGlu: 4.142 ± 0.078
1.809ProPhe: 1.809 ± 0.051
5.089ProGly: 5.089 ± 0.093
1.313ProHis: 1.313 ± 0.046
1.536ProIle: 1.536 ± 0.054
1.427ProLys: 1.427 ± 0.053
5.473ProLeu: 5.473 ± 0.094
1.185ProMet: 1.185 ± 0.045
1.137ProAsn: 1.137 ± 0.043
2.684ProPro: 2.684 ± 0.071
2.369ProGln: 2.369 ± 0.062
2.962ProArg: 2.962 ± 0.071
2.579ProSer: 2.579 ± 0.065
1.969ProThr: 1.969 ± 0.06
4.401ProVal: 4.401 ± 0.078
0.751ProTrp: 0.751 ± 0.032
1.238ProTyr: 1.238 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
4.481GlnAla: 4.481 ± 0.1
0.575GlnCys: 0.575 ± 0.027
1.908GlnAsp: 1.908 ± 0.055
2.225GlnGlu: 2.225 ± 0.061
0.941GlnPhe: 0.941 ± 0.036
3.202GlnGly: 3.202 ± 0.063
0.742GlnHis: 0.742 ± 0.036
1.505GlnIle: 1.505 ± 0.047
1.668GlnLys: 1.668 ± 0.05
2.915GlnLeu: 2.915 ± 0.065
1.069GlnMet: 1.069 ± 0.04
1.169GlnAsn: 1.169 ± 0.037
1.821GlnPro: 1.821 ± 0.052
1.436GlnGln: 1.436 ± 0.049
2.372GlnArg: 2.372 ± 0.059
1.989GlnSer: 1.989 ± 0.049
1.741GlnThr: 1.741 ± 0.051
2.353GlnVal: 2.353 ± 0.052
0.656GlnTrp: 0.656 ± 0.028
0.905GlnTyr: 0.905 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
6.258ArgAla: 6.258 ± 0.097
0.94ArgCys: 0.94 ± 0.041
3.143ArgAsp: 3.143 ± 0.062
4.295ArgGlu: 4.295 ± 0.08
2.376ArgPhe: 2.376 ± 0.052
4.191ArgGly: 4.191 ± 0.085
1.879ArgHis: 1.879 ± 0.051
3.446ArgIle: 3.446 ± 0.078
2.936ArgLys: 2.936 ± 0.061
7.648ArgLeu: 7.648 ± 0.126
2.11ArgMet: 2.11 ± 0.052
2.102ArgAsn: 2.102 ± 0.052
3.272ArgPro: 3.272 ± 0.081
3.212ArgGln: 3.212 ± 0.073
4.72ArgArg: 4.72 ± 0.099
3.153ArgSer: 3.153 ± 0.077
2.793ArgThr: 2.793 ± 0.066
4.094ArgVal: 4.094 ± 0.077
0.79ArgTrp: 0.79 ± 0.036
1.706ArgTyr: 1.706 ± 0.048
0.0ArgXaa: 0.0 ± 0.0
Ser
5.82SerAla: 5.82 ± 0.104
0.872SerCys: 0.872 ± 0.04
2.382SerAsp: 2.382 ± 0.056
2.569SerGlu: 2.569 ± 0.067
2.249SerPhe: 2.249 ± 0.062
5.903SerGly: 5.903 ± 0.108
1.197SerHis: 1.197 ± 0.039
2.779SerIle: 2.779 ± 0.068
1.677SerLys: 1.677 ± 0.051
6.594SerLeu: 6.594 ± 0.098
1.651SerMet: 1.651 ± 0.047
1.314SerAsn: 1.314 ± 0.049
3.146SerPro: 3.146 ± 0.064
1.743SerGln: 1.743 ± 0.044
3.931SerArg: 3.931 ± 0.077
3.383SerSer: 3.383 ± 0.092
2.671SerThr: 2.671 ± 0.065
3.949SerVal: 3.949 ± 0.059
0.722SerTrp: 0.722 ± 0.036
1.347SerTyr: 1.347 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
5.928ThrAla: 5.928 ± 0.112
0.689ThrCys: 0.689 ± 0.031
2.507ThrAsp: 2.507 ± 0.07
2.527ThrGlu: 2.527 ± 0.065
1.807ThrPhe: 1.807 ± 0.047
4.974ThrGly: 4.974 ± 0.092
1.008ThrHis: 1.008 ± 0.035
2.282ThrIle: 2.282 ± 0.078
1.251ThrLys: 1.251 ± 0.041
5.406ThrLeu: 5.406 ± 0.102
1.088ThrMet: 1.088 ± 0.042
1.239ThrAsn: 1.239 ± 0.056
3.252ThrPro: 3.252 ± 0.066
1.403ThrGln: 1.403 ± 0.039
2.809ThrArg: 2.809 ± 0.061
2.628ThrSer: 2.628 ± 0.074
2.356ThrThr: 2.356 ± 0.063
4.057ThrVal: 4.057 ± 0.095
0.536ThrTrp: 0.536 ± 0.025
1.159ThrTyr: 1.159 ± 0.051
0.0ThrXaa: 0.0 ± 0.0
Val
7.349ValAla: 7.349 ± 0.107
1.319ValCys: 1.319 ± 0.047
3.961ValAsp: 3.961 ± 0.075
4.349ValGlu: 4.349 ± 0.08
2.78ValPhe: 2.78 ± 0.061
5.141ValGly: 5.141 ± 0.086
1.397ValHis: 1.397 ± 0.04
3.373ValIle: 3.373 ± 0.071
2.291ValLys: 2.291 ± 0.062
8.029ValLeu: 8.029 ± 0.113
1.964ValMet: 1.964 ± 0.053
2.286ValAsn: 2.286 ± 0.058
3.675ValPro: 3.675 ± 0.074
2.475ValGln: 2.475 ± 0.063
4.824ValArg: 4.824 ± 0.087
4.234ValSer: 4.234 ± 0.073
3.773ValThr: 3.773 ± 0.082
5.165ValVal: 5.165 ± 0.109
0.882ValTrp: 0.882 ± 0.041
1.746ValTyr: 1.746 ± 0.048
0.0ValXaa: 0.0 ± 0.0
Trp
1.09TrpAla: 1.09 ± 0.043
0.179TrpCys: 0.179 ± 0.016
0.624TrpAsp: 0.624 ± 0.028
0.566TrpGlu: 0.566 ± 0.025
0.486TrpPhe: 0.486 ± 0.025
0.978TrpGly: 0.978 ± 0.041
0.359TrpHis: 0.359 ± 0.021
0.517TrpIle: 0.517 ± 0.027
0.498TrpLys: 0.498 ± 0.026
1.89TrpLeu: 1.89 ± 0.058
0.294TrpMet: 0.294 ± 0.019
0.506TrpAsn: 0.506 ± 0.026
0.723TrpPro: 0.723 ± 0.032
0.749TrpGln: 0.749 ± 0.031
1.028TrpArg: 1.028 ± 0.043
0.586TrpSer: 0.586 ± 0.027
0.523TrpThr: 0.523 ± 0.026
0.65TrpVal: 0.65 ± 0.029
0.222TrpTrp: 0.222 ± 0.018
0.339TrpTyr: 0.339 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.617TyrAla: 2.617 ± 0.052
0.457TyrCys: 0.457 ± 0.025
1.369TyrAsp: 1.369 ± 0.049
1.343TyrGlu: 1.343 ± 0.041
1.037TyrPhe: 1.037 ± 0.037
2.217TyrGly: 2.217 ± 0.067
0.508TyrHis: 0.508 ± 0.026
1.079TyrIle: 1.079 ± 0.035
0.958TyrLys: 0.958 ± 0.034
2.419TyrLeu: 2.419 ± 0.056
0.636TyrMet: 0.636 ± 0.032
0.832TyrAsn: 0.832 ± 0.037
1.176TyrPro: 1.176 ± 0.04
0.713TyrGln: 0.713 ± 0.031
1.529TyrArg: 1.529 ± 0.047
1.48TyrSer: 1.48 ± 0.042
1.354TyrThr: 1.354 ± 0.054
1.733TyrVal: 1.733 ± 0.044
0.384TyrTrp: 0.384 ± 0.023
0.766TyrTyr: 0.766 ± 0.038
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2345 proteins (796692 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski