Amino acid dipepetide frequency for Desulfonauticus submarinus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.856AlaAla: 3.856 ± 0.112
0.932AlaCys: 0.932 ± 0.044
2.409AlaAsp: 2.409 ± 0.065
3.513AlaGlu: 3.513 ± 0.09
2.722AlaPhe: 2.722 ± 0.07
4.299AlaGly: 4.299 ± 0.109
1.168AlaHis: 1.168 ± 0.039
4.937AlaIle: 4.937 ± 0.1
5.921AlaLys: 5.921 ± 0.119
7.481AlaLeu: 7.481 ± 0.122
1.351AlaMet: 1.351 ± 0.054
2.444AlaAsn: 2.444 ± 0.068
1.812AlaPro: 1.812 ± 0.061
2.274AlaGln: 2.274 ± 0.055
2.739AlaArg: 2.739 ± 0.073
3.51AlaSer: 3.51 ± 0.073
2.695AlaThr: 2.695 ± 0.075
3.664AlaVal: 3.664 ± 0.08
0.628AlaTrp: 0.628 ± 0.03
2.2AlaTyr: 2.2 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
0.805CysAla: 0.805 ± 0.039
0.215CysCys: 0.215 ± 0.02
0.461CysAsp: 0.461 ± 0.026
0.6CysGlu: 0.6 ± 0.031
0.785CysPhe: 0.785 ± 0.035
1.145CysGly: 1.145 ± 0.05
0.336CysHis: 0.336 ± 0.03
0.877CysIle: 0.877 ± 0.04
0.941CysLys: 0.941 ± 0.039
1.526CysLeu: 1.526 ± 0.047
0.218CysMet: 0.218 ± 0.018
0.401CysAsn: 0.401 ± 0.027
0.881CysPro: 0.881 ± 0.039
0.375CysGln: 0.375 ± 0.026
0.482CysArg: 0.482 ± 0.027
0.872CysSer: 0.872 ± 0.039
0.528CysThr: 0.528 ± 0.03
0.766CysVal: 0.766 ± 0.043
0.15CysTrp: 0.15 ± 0.013
0.433CysTyr: 0.433 ± 0.028
0.0CysXaa: 0.0 ± 0.0
Asp
2.292AspAla: 2.292 ± 0.063
0.519AspCys: 0.519 ± 0.029
1.78AspAsp: 1.78 ± 0.056
2.933AspGlu: 2.933 ± 0.071
3.2AspPhe: 3.2 ± 0.08
2.34AspGly: 2.34 ± 0.07
0.633AspHis: 0.633 ± 0.035
4.852AspIle: 4.852 ± 0.101
4.341AspLys: 4.341 ± 0.085
5.618AspLeu: 5.618 ± 0.102
0.887AspMet: 0.887 ± 0.038
2.033AspAsn: 2.033 ± 0.059
2.019AspPro: 2.019 ± 0.057
0.972AspGln: 0.972 ± 0.037
1.601AspArg: 1.601 ± 0.058
2.12AspSer: 2.12 ± 0.065
1.79AspThr: 1.79 ± 0.054
2.847AspVal: 2.847 ± 0.083
0.507AspTrp: 0.507 ± 0.029
1.717AspTyr: 1.717 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
4.226GluAla: 4.226 ± 0.101
0.651GluCys: 0.651 ± 0.029
3.498GluAsp: 3.498 ± 0.09
6.097GluGlu: 6.097 ± 0.134
2.856GluPhe: 2.856 ± 0.065
3.409GluGly: 3.409 ± 0.083
1.165GluHis: 1.165 ± 0.044
6.711GluIle: 6.711 ± 0.117
7.304GluLys: 7.304 ± 0.14
7.005GluLeu: 7.005 ± 0.118
1.631GluMet: 1.631 ± 0.051
3.676GluAsn: 3.676 ± 0.074
1.846GluPro: 1.846 ± 0.058
2.848GluGln: 2.848 ± 0.078
2.475GluArg: 2.475 ± 0.066
2.662GluSer: 2.662 ± 0.068
2.521GluThr: 2.521 ± 0.072
4.972GluVal: 4.972 ± 0.118
0.623GluTrp: 0.623 ± 0.034
2.239GluTyr: 2.239 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
2.793PheAla: 2.793 ± 0.079
0.872PheCys: 0.872 ± 0.035
2.354PheAsp: 2.354 ± 0.06
2.92PheGlu: 2.92 ± 0.074
3.538PhePhe: 3.538 ± 0.093
3.043PheGly: 3.043 ± 0.069
0.846PheHis: 0.846 ± 0.038
4.55PheIle: 4.55 ± 0.097
4.278PheLys: 4.278 ± 0.087
6.862PheLeu: 6.862 ± 0.128
0.97PheMet: 0.97 ± 0.038
2.377PheAsn: 2.377 ± 0.06
2.128PhePro: 2.128 ± 0.065
1.598PheGln: 1.598 ± 0.05
1.79PheArg: 1.79 ± 0.052
3.969PheSer: 3.969 ± 0.088
2.259PheThr: 2.259 ± 0.061
2.954PheVal: 2.954 ± 0.082
0.871PheTrp: 0.871 ± 0.037
2.177PheTyr: 2.177 ± 0.062
0.0PheXaa: 0.0 ± 0.0
Gly
3.965GlyAla: 3.965 ± 0.087
1.009GlyCys: 1.009 ± 0.05
2.73GlyAsp: 2.73 ± 0.069
4.229GlyGlu: 4.229 ± 0.103
3.456GlyPhe: 3.456 ± 0.076
4.267GlyGly: 4.267 ± 0.119
1.187GlyHis: 1.187 ± 0.044
5.555GlyIle: 5.555 ± 0.099
5.635GlyLys: 5.635 ± 0.091
6.974GlyLeu: 6.974 ± 0.104
1.367GlyMet: 1.367 ± 0.045
2.243GlyAsn: 2.243 ± 0.059
1.976GlyPro: 1.976 ± 0.066
2.101GlyGln: 2.101 ± 0.058
2.607GlyArg: 2.607 ± 0.065
3.148GlySer: 3.148 ± 0.076
2.93GlyThr: 2.93 ± 0.086
4.342GlyVal: 4.342 ± 0.103
0.691GlyTrp: 0.691 ± 0.027
2.492GlyTyr: 2.492 ± 0.067
0.0GlyXaa: 0.0 ± 0.0
His
0.921HisAla: 0.921 ± 0.039
0.243HisCys: 0.243 ± 0.02
0.63HisAsp: 0.63 ± 0.032
0.723HisGlu: 0.723 ± 0.037
1.029HisPhe: 1.029 ± 0.039
1.026HisGly: 1.026 ± 0.043
0.393HisHis: 0.393 ± 0.025
1.483HisIle: 1.483 ± 0.044
1.362HisLys: 1.362 ± 0.045
2.339HisLeu: 2.339 ± 0.057
0.29HisMet: 0.29 ± 0.021
0.705HisAsn: 0.705 ± 0.036
1.118HisPro: 1.118 ± 0.039
0.596HisGln: 0.596 ± 0.029
0.694HisArg: 0.694 ± 0.033
0.989HisSer: 0.989 ± 0.04
0.837HisThr: 0.837 ± 0.034
0.771HisVal: 0.771 ± 0.032
0.178HisTrp: 0.178 ± 0.018
0.732HisTyr: 0.732 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
5.128IleAla: 5.128 ± 0.111
1.155IleCys: 1.155 ± 0.041
4.018IleAsp: 4.018 ± 0.088
5.517IleGlu: 5.517 ± 0.093
5.056IlePhe: 5.056 ± 0.111
5.273IleGly: 5.273 ± 0.103
1.314IleHis: 1.314 ± 0.043
6.926IleIle: 6.926 ± 0.137
7.59IleLys: 7.59 ± 0.129
10.593IleLeu: 10.593 ± 0.151
1.383IleMet: 1.383 ± 0.046
3.96IleAsn: 3.96 ± 0.103
3.614IlePro: 3.614 ± 0.08
2.752IleGln: 2.752 ± 0.064
2.913IleArg: 2.913 ± 0.071
5.792IleSer: 5.792 ± 0.112
3.809IleThr: 3.809 ± 0.089
5.158IleVal: 5.158 ± 0.103
0.869IleTrp: 0.869 ± 0.04
3.303IleTyr: 3.303 ± 0.079
0.0IleXaa: 0.0 ± 0.0
Lys
5.27LysAla: 5.27 ± 0.109
0.946LysCys: 0.946 ± 0.045
4.758LysAsp: 4.758 ± 0.095
8.06LysGlu: 8.06 ± 0.141
3.653LysPhe: 3.653 ± 0.086
5.442LysGly: 5.442 ± 0.103
1.502LysHis: 1.502 ± 0.053
9.438LysIle: 9.438 ± 0.142
10.039LysLys: 10.039 ± 0.164
8.315LysLeu: 8.315 ± 0.136
1.964LysMet: 1.964 ± 0.048
5.647LysAsn: 5.647 ± 0.121
2.816LysPro: 2.816 ± 0.071
3.86LysGln: 3.86 ± 0.093
3.785LysArg: 3.785 ± 0.08
4.269LysSer: 4.269 ± 0.095
4.212LysThr: 4.212 ± 0.08
5.885LysVal: 5.885 ± 0.098
0.955LysTrp: 0.955 ± 0.042
3.117LysTyr: 3.117 ± 0.073
0.0LysXaa: 0.0 ± 0.0
Leu
8.143LeuAla: 8.143 ± 0.126
1.314LeuCys: 1.314 ± 0.048
5.439LeuAsp: 5.439 ± 0.109
8.488LeuGlu: 8.488 ± 0.151
5.434LeuPhe: 5.434 ± 0.119
7.679LeuGly: 7.679 ± 0.126
1.689LeuHis: 1.689 ± 0.06
8.282LeuIle: 8.282 ± 0.136
11.634LeuLys: 11.634 ± 0.158
10.865LeuLeu: 10.865 ± 0.16
1.721LeuMet: 1.721 ± 0.053
6.025LeuAsn: 6.025 ± 0.125
4.92LeuPro: 4.92 ± 0.093
3.613LeuGln: 3.613 ± 0.073
4.35LeuArg: 4.35 ± 0.083
7.395LeuSer: 7.395 ± 0.117
5.376LeuThr: 5.376 ± 0.086
6.262LeuVal: 6.262 ± 0.111
1.096LeuTrp: 1.096 ± 0.043
3.099LeuTyr: 3.099 ± 0.079
0.0LeuXaa: 0.0 ± 0.0
Met
1.738MetAla: 1.738 ± 0.052
0.207MetCys: 0.207 ± 0.018
0.958MetAsp: 0.958 ± 0.038
1.259MetGlu: 1.259 ± 0.048
0.829MetPhe: 0.829 ± 0.036
1.465MetGly: 1.465 ± 0.049
0.352MetHis: 0.352 ± 0.024
1.127MetIle: 1.127 ± 0.039
1.42MetLys: 1.42 ± 0.045
1.972MetLeu: 1.972 ± 0.059
0.361MetMet: 0.361 ± 0.025
0.742MetAsn: 0.742 ± 0.031
0.852MetPro: 0.852 ± 0.037
0.723MetGln: 0.723 ± 0.035
0.864MetArg: 0.864 ± 0.04
1.219MetSer: 1.219 ± 0.041
0.868MetThr: 0.868 ± 0.037
1.291MetVal: 1.291 ± 0.042
0.197MetTrp: 0.197 ± 0.017
0.484MetTyr: 0.484 ± 0.03
0.0MetXaa: 0.0 ± 0.0
Asn
2.315AsnAla: 2.315 ± 0.069
0.64AsnCys: 0.64 ± 0.034
1.488AsnAsp: 1.488 ± 0.052
2.211AsnGlu: 2.211 ± 0.067
3.132AsnPhe: 3.132 ± 0.079
2.185AsnGly: 2.185 ± 0.059
0.706AsnHis: 0.706 ± 0.032
5.125AsnIle: 5.125 ± 0.106
4.714AsnLys: 4.714 ± 0.106
6.384AsnLeu: 6.384 ± 0.128
0.995AsnMet: 0.995 ± 0.039
2.474AsnAsn: 2.474 ± 0.086
2.423AsnPro: 2.423 ± 0.068
1.531AsnGln: 1.531 ± 0.055
1.671AsnArg: 1.671 ± 0.056
2.707AsnSer: 2.707 ± 0.08
2.134AsnThr: 2.134 ± 0.059
2.36AsnVal: 2.36 ± 0.059
0.571AsnTrp: 0.571 ± 0.032
1.889AsnTyr: 1.889 ± 0.063
0.0AsnXaa: 0.0 ± 0.0
Pro
2.048ProAla: 2.048 ± 0.057
0.544ProCys: 0.544 ± 0.03
1.948ProAsp: 1.948 ± 0.056
3.165ProGlu: 3.165 ± 0.072
2.234ProPhe: 2.234 ± 0.065
2.555ProGly: 2.555 ± 0.077
0.785ProHis: 0.785 ± 0.035
2.954ProIle: 2.954 ± 0.077
3.674ProLys: 3.674 ± 0.087
4.299ProLeu: 4.299 ± 0.087
0.643ProMet: 0.643 ± 0.034
1.964ProAsn: 1.964 ± 0.053
1.466ProPro: 1.466 ± 0.051
1.474ProGln: 1.474 ± 0.051
1.342ProArg: 1.342 ± 0.041
2.317ProSer: 2.317 ± 0.065
1.82ProThr: 1.82 ± 0.061
2.523ProVal: 2.523 ± 0.069
0.521ProTrp: 0.521 ± 0.026
1.557ProTyr: 1.557 ± 0.049
0.0ProXaa: 0.0 ± 0.0
Gln
2.518GlnAla: 2.518 ± 0.066
0.341GlnCys: 0.341 ± 0.027
1.77GlnAsp: 1.77 ± 0.055
3.123GlnGlu: 3.123 ± 0.077
1.149GlnPhe: 1.149 ± 0.043
2.2GlnGly: 2.2 ± 0.059
0.534GlnHis: 0.534 ± 0.031
3.054GlnIle: 3.054 ± 0.073
4.284GlnLys: 4.284 ± 0.08
2.813GlnLeu: 2.813 ± 0.075
0.692GlnMet: 0.692 ± 0.034
2.077GlnAsn: 2.077 ± 0.06
0.977GlnPro: 0.977 ± 0.04
1.337GlnGln: 1.337 ± 0.057
1.333GlnArg: 1.333 ± 0.046
1.526GlnSer: 1.526 ± 0.05
1.572GlnThr: 1.572 ± 0.055
2.245GlnVal: 2.245 ± 0.06
0.224GlnTrp: 0.224 ± 0.021
0.866GlnTyr: 0.866 ± 0.042
0.0GlnXaa: 0.0 ± 0.0
Arg
2.383ArgAla: 2.383 ± 0.062
0.482ArgCys: 0.482 ± 0.031
1.792ArgAsp: 1.792 ± 0.054
3.049ArgGlu: 3.049 ± 0.087
1.895ArgPhe: 1.895 ± 0.051
2.312ArgGly: 2.312 ± 0.063
0.688ArgHis: 0.688 ± 0.029
3.217ArgIle: 3.217 ± 0.08
3.423ArgLys: 3.423 ± 0.081
4.152ArgLeu: 4.152 ± 0.089
0.754ArgMet: 0.754 ± 0.035
1.578ArgAsn: 1.578 ± 0.054
1.552ArgPro: 1.552 ± 0.054
1.416ArgGln: 1.416 ± 0.052
1.743ArgArg: 1.743 ± 0.054
1.767ArgSer: 1.767 ± 0.051
1.545ArgThr: 1.545 ± 0.047
2.58ArgVal: 2.58 ± 0.065
0.402ArgTrp: 0.402 ± 0.029
1.468ArgTyr: 1.468 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
3.005SerAla: 3.005 ± 0.07
0.771SerCys: 0.771 ± 0.036
2.01SerAsp: 2.01 ± 0.061
3.094SerGlu: 3.094 ± 0.074
3.601SerPhe: 3.601 ± 0.083
3.857SerGly: 3.857 ± 0.08
1.021SerHis: 1.021 ± 0.038
4.726SerIle: 4.726 ± 0.104
4.955SerLys: 4.955 ± 0.094
7.625SerLeu: 7.625 ± 0.131
1.064SerMet: 1.064 ± 0.045
2.492SerAsn: 2.492 ± 0.066
2.581SerPro: 2.581 ± 0.067
2.064SerGln: 2.064 ± 0.064
2.073SerArg: 2.073 ± 0.052
3.914SerSer: 3.914 ± 0.095
2.552SerThr: 2.552 ± 0.078
2.99SerVal: 2.99 ± 0.081
0.729SerTrp: 0.729 ± 0.031
2.051SerTyr: 2.051 ± 0.064
0.0SerXaa: 0.0 ± 0.0
Thr
2.607ThrAla: 2.607 ± 0.067
0.482ThrCys: 0.482 ± 0.029
1.801ThrAsp: 1.801 ± 0.047
2.434ThrGlu: 2.434 ± 0.069
2.176ThrPhe: 2.176 ± 0.063
3.366ThrGly: 3.366 ± 0.065
0.852ThrHis: 0.852 ± 0.038
3.899ThrIle: 3.899 ± 0.074
3.51ThrLys: 3.51 ± 0.078
5.155ThrLeu: 5.155 ± 0.098
0.749ThrMet: 0.749 ± 0.032
1.979ThrAsn: 1.979 ± 0.053
2.253ThrPro: 2.253 ± 0.056
1.459ThrGln: 1.459 ± 0.053
1.529ThrArg: 1.529 ± 0.05
2.758ThrSer: 2.758 ± 0.072
2.366ThrThr: 2.366 ± 0.066
2.408ThrVal: 2.408 ± 0.069
0.427ThrTrp: 0.427 ± 0.03
1.612ThrTyr: 1.612 ± 0.055
0.0ThrXaa: 0.0 ± 0.0
Val
3.946ValAla: 3.946 ± 0.085
0.946ValCys: 0.946 ± 0.041
3.143ValAsp: 3.143 ± 0.076
4.238ValGlu: 4.238 ± 0.094
3.267ValPhe: 3.267 ± 0.077
4.08ValGly: 4.08 ± 0.085
1.009ValHis: 1.009 ± 0.038
4.791ValIle: 4.791 ± 0.097
5.219ValLys: 5.219 ± 0.1
6.98ValLeu: 6.98 ± 0.11
1.159ValMet: 1.159 ± 0.046
2.673ValAsn: 2.673 ± 0.074
2.28ValPro: 2.28 ± 0.058
1.953ValGln: 1.953 ± 0.062
2.414ValArg: 2.414 ± 0.065
3.63ValSer: 3.63 ± 0.085
1.975ValThr: 1.975 ± 0.066
4.433ValVal: 4.433 ± 0.1
0.614ValTrp: 0.614 ± 0.034
2.14ValTyr: 2.14 ± 0.064
0.0ValXaa: 0.0 ± 0.0
Trp
0.649TrpAla: 0.649 ± 0.035
0.103TrpCys: 0.103 ± 0.012
0.56TrpAsp: 0.56 ± 0.031
0.849TrpGlu: 0.849 ± 0.035
0.553TrpPhe: 0.553 ± 0.031
0.706TrpGly: 0.706 ± 0.035
0.229TrpHis: 0.229 ± 0.019
0.917TrpIle: 0.917 ± 0.042
0.84TrpLys: 0.84 ± 0.037
1.265TrpLeu: 1.265 ± 0.051
0.203TrpMet: 0.203 ± 0.015
0.451TrpAsn: 0.451 ± 0.025
0.567TrpPro: 0.567 ± 0.028
0.547TrpGln: 0.547 ± 0.026
0.451TrpArg: 0.451 ± 0.03
0.504TrpSer: 0.504 ± 0.029
0.427TrpThr: 0.427 ± 0.026
0.579TrpVal: 0.579 ± 0.03
0.184TrpTrp: 0.184 ± 0.02
0.275TrpTyr: 0.275 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.947TyrAla: 1.947 ± 0.061
0.427TyrCys: 0.427 ± 0.022
1.454TyrAsp: 1.454 ± 0.052
1.913TyrGlu: 1.913 ± 0.058
2.586TyrPhe: 2.586 ± 0.068
2.254TyrGly: 2.254 ± 0.061
0.648TyrHis: 0.648 ± 0.03
2.702TyrIle: 2.702 ± 0.071
2.85TyrLys: 2.85 ± 0.081
4.6TyrLeu: 4.6 ± 0.097
0.522TyrMet: 0.522 ± 0.029
1.714TyrAsn: 1.714 ± 0.054
1.752TyrPro: 1.752 ± 0.045
1.145TyrGln: 1.145 ± 0.041
1.34TyrArg: 1.34 ± 0.048
2.107TyrSer: 2.107 ± 0.06
1.606TyrThr: 1.606 ± 0.055
1.863TyrVal: 1.863 ± 0.056
0.421TyrTrp: 0.421 ± 0.027
1.396TyrTyr: 1.396 ± 0.055
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2044 proteins (651272 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski