Amino acid dipepetide frequency for Selenomonas sputigena (strain ATCC 35185 / DSM 20758 / VPI D19B-28)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.796AlaAla: 14.796 ± 0.24
1.146AlaCys: 1.146 ± 0.037
5.963AlaAsp: 5.963 ± 0.107
8.182AlaGlu: 8.182 ± 0.161
3.937AlaPhe: 3.937 ± 0.089
8.725AlaGly: 8.725 ± 0.158
2.289AlaHis: 2.289 ± 0.056
5.518AlaIle: 5.518 ± 0.107
5.873AlaLys: 5.873 ± 0.1
11.027AlaLeu: 11.027 ± 0.181
3.049AlaMet: 3.049 ± 0.071
2.891AlaAsn: 2.891 ± 0.091
3.641AlaPro: 3.641 ± 0.079
3.357AlaGln: 3.357 ± 0.075
5.751AlaArg: 5.751 ± 0.116
5.424AlaSer: 5.424 ± 0.112
4.154AlaThr: 4.154 ± 0.12
8.097AlaVal: 8.097 ± 0.107
0.996AlaTrp: 0.996 ± 0.041
2.952AlaTyr: 2.952 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
1.034CysAla: 1.034 ± 0.039
0.214CysCys: 0.214 ± 0.018
0.554CysAsp: 0.554 ± 0.031
0.55CysGlu: 0.55 ± 0.032
0.452CysPhe: 0.452 ± 0.022
1.161CysGly: 1.161 ± 0.049
0.306CysHis: 0.306 ± 0.021
0.667CysIle: 0.667 ± 0.029
0.445CysLys: 0.445 ± 0.027
1.084CysLeu: 1.084 ± 0.049
0.333CysMet: 0.333 ± 0.023
0.319CysAsn: 0.319 ± 0.022
0.541CysPro: 0.541 ± 0.028
0.284CysGln: 0.284 ± 0.02
0.947CysArg: 0.947 ± 0.038
0.619CysSer: 0.619 ± 0.033
0.563CysThr: 0.563 ± 0.027
0.773CysVal: 0.773 ± 0.035
0.108CysTrp: 0.108 ± 0.013
0.373CysTyr: 0.373 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
6.265AspAla: 6.265 ± 0.11
0.591AspCys: 0.591 ± 0.029
2.884AspAsp: 2.884 ± 0.063
4.402AspGlu: 4.402 ± 0.088
2.859AspPhe: 2.859 ± 0.059
4.586AspGly: 4.586 ± 0.105
0.937AspHis: 0.937 ± 0.038
3.778AspIle: 3.778 ± 0.075
2.653AspLys: 2.653 ± 0.073
4.824AspLeu: 4.824 ± 0.086
1.696AspMet: 1.696 ± 0.041
1.343AspAsn: 1.343 ± 0.049
1.841AspPro: 1.841 ± 0.051
0.826AspGln: 0.826 ± 0.034
2.567AspArg: 2.567 ± 0.068
2.418AspSer: 2.418 ± 0.063
2.657AspThr: 2.657 ± 0.069
4.563AspVal: 4.563 ± 0.077
0.681AspTrp: 0.681 ± 0.036
2.156AspTyr: 2.156 ± 0.059
0.0AspXaa: 0.0 ± 0.0
Glu
8.219GluAla: 8.219 ± 0.152
0.594GluCys: 0.594 ± 0.032
3.847GluAsp: 3.847 ± 0.076
7.094GluGlu: 7.094 ± 0.161
2.083GluPhe: 2.083 ± 0.049
5.077GluGly: 5.077 ± 0.093
1.489GluHis: 1.489 ± 0.045
4.754GluIle: 4.754 ± 0.083
6.16GluLys: 6.16 ± 0.116
6.386GluLeu: 6.386 ± 0.106
2.292GluMet: 2.292 ± 0.058
3.0GluAsn: 3.0 ± 0.07
2.169GluPro: 2.169 ± 0.068
2.492GluGln: 2.492 ± 0.076
5.342GluArg: 5.342 ± 0.1
3.297GluSer: 3.297 ± 0.068
3.591GluThr: 3.591 ± 0.081
4.708GluVal: 4.708 ± 0.099
0.527GluTrp: 0.527 ± 0.026
2.009GluTyr: 2.009 ± 0.061
0.0GluXaa: 0.0 ± 0.0
Phe
4.057PheAla: 4.057 ± 0.082
0.66PheCys: 0.66 ± 0.031
2.428PheAsp: 2.428 ± 0.059
2.068PheGlu: 2.068 ± 0.062
2.229PhePhe: 2.229 ± 0.068
3.283PheGly: 3.283 ± 0.065
0.826PheHis: 0.826 ± 0.031
2.257PheIle: 2.257 ± 0.068
1.432PheLys: 1.432 ± 0.041
4.422PheLeu: 4.422 ± 0.1
0.971PheMet: 0.971 ± 0.04
1.113PheAsn: 1.113 ± 0.039
1.383PhePro: 1.383 ± 0.046
1.115PheGln: 1.115 ± 0.042
2.127PheArg: 2.127 ± 0.056
3.018PheSer: 3.018 ± 0.073
1.929PheThr: 1.929 ± 0.047
2.851PheVal: 2.851 ± 0.079
0.434PheTrp: 0.434 ± 0.024
1.32PheTyr: 1.32 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
7.933GlyAla: 7.933 ± 0.156
1.039GlyCys: 1.039 ± 0.044
3.88GlyAsp: 3.88 ± 0.088
5.296GlyGlu: 5.296 ± 0.095
3.204GlyPhe: 3.204 ± 0.069
6.119GlyGly: 6.119 ± 0.191
1.479GlyHis: 1.479 ± 0.047
5.138GlyIle: 5.138 ± 0.093
5.079GlyLys: 5.079 ± 0.1
6.432GlyLeu: 6.432 ± 0.102
2.524GlyMet: 2.524 ± 0.067
2.519GlyAsn: 2.519 ± 0.163
1.388GlyPro: 1.388 ± 0.046
1.902GlyGln: 1.902 ± 0.054
4.681GlyArg: 4.681 ± 0.083
4.247GlySer: 4.247 ± 0.132
4.782GlyThr: 4.782 ± 0.165
5.862GlyVal: 5.862 ± 0.099
0.871GlyTrp: 0.871 ± 0.042
2.521GlyTyr: 2.521 ± 0.061
0.0GlyXaa: 0.0 ± 0.0
His
2.068HisAla: 2.068 ± 0.056
0.275HisCys: 0.275 ± 0.019
1.166HisAsp: 1.166 ± 0.042
1.498HisGlu: 1.498 ± 0.044
0.915HisPhe: 0.915 ± 0.038
1.707HisGly: 1.707 ± 0.049
0.499HisHis: 0.499 ± 0.031
1.51HisIle: 1.51 ± 0.047
0.837HisLys: 0.837 ± 0.036
1.941HisLeu: 1.941 ± 0.05
0.482HisMet: 0.482 ± 0.027
0.579HisAsn: 0.579 ± 0.027
1.027HisPro: 1.027 ± 0.037
0.428HisGln: 0.428 ± 0.024
1.144HisArg: 1.144 ± 0.033
0.909HisSer: 0.909 ± 0.034
1.178HisThr: 1.178 ± 0.045
1.5HisVal: 1.5 ± 0.041
0.202HisTrp: 0.202 ± 0.019
0.695HisTyr: 0.695 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
6.334IleAla: 6.334 ± 0.113
0.764IleCys: 0.764 ± 0.035
3.446IleAsp: 3.446 ± 0.069
4.572IleGlu: 4.572 ± 0.079
2.541IlePhe: 2.541 ± 0.08
4.612IleGly: 4.612 ± 0.094
1.192IleHis: 1.192 ± 0.044
2.972IleIle: 2.972 ± 0.076
2.654IleLys: 2.654 ± 0.068
5.566IleLeu: 5.566 ± 0.1
1.367IleMet: 1.367 ± 0.049
1.752IleAsn: 1.752 ± 0.058
2.361IlePro: 2.361 ± 0.059
1.401IleGln: 1.401 ± 0.046
3.139IleArg: 3.139 ± 0.064
3.257IleSer: 3.257 ± 0.071
2.945IleThr: 2.945 ± 0.078
4.82IleVal: 4.82 ± 0.089
0.438IleTrp: 0.438 ± 0.023
1.974IleTyr: 1.974 ± 0.052
0.0IleXaa: 0.0 ± 0.0
Lys
5.463LysAla: 5.463 ± 0.097
0.405LysCys: 0.405 ± 0.023
3.39LysAsp: 3.39 ± 0.08
5.168LysGlu: 5.168 ± 0.092
1.601LysPhe: 1.601 ± 0.043
3.961LysGly: 3.961 ± 0.101
0.895LysHis: 0.895 ± 0.034
3.539LysIle: 3.539 ± 0.078
4.749LysLys: 4.749 ± 0.112
4.579LysLeu: 4.579 ± 0.084
1.684LysMet: 1.684 ± 0.047
2.472LysAsn: 2.472 ± 0.08
1.936LysPro: 1.936 ± 0.057
1.427LysGln: 1.427 ± 0.042
3.38LysArg: 3.38 ± 0.07
2.96LysSer: 2.96 ± 0.062
3.006LysThr: 3.006 ± 0.072
3.406LysVal: 3.406 ± 0.083
0.416LysTrp: 0.416 ± 0.026
1.607LysTyr: 1.607 ± 0.05
0.0LysXaa: 0.0 ± 0.0
Leu
11.715LeuAla: 11.715 ± 0.197
1.218LeuCys: 1.218 ± 0.044
5.379LeuAsp: 5.379 ± 0.094
6.043LeuGlu: 6.043 ± 0.108
3.859LeuPhe: 3.859 ± 0.086
6.98LeuGly: 6.98 ± 0.126
2.145LeuHis: 2.145 ± 0.058
4.373LeuIle: 4.373 ± 0.095
5.073LeuLys: 5.073 ± 0.096
10.396LeuLeu: 10.396 ± 0.194
2.47LeuMet: 2.47 ± 0.059
2.625LeuAsn: 2.625 ± 0.07
4.575LeuPro: 4.575 ± 0.079
3.26LeuGln: 3.26 ± 0.068
5.61LeuArg: 5.61 ± 0.114
6.23LeuSer: 6.23 ± 0.098
5.105LeuThr: 5.105 ± 0.09
6.464LeuVal: 6.464 ± 0.115
0.883LeuTrp: 0.883 ± 0.038
2.808LeuTyr: 2.808 ± 0.069
0.0LeuXaa: 0.0 ± 0.0
Met
2.802MetAla: 2.802 ± 0.071
0.227MetCys: 0.227 ± 0.018
1.525MetAsp: 1.525 ± 0.046
2.378MetGlu: 2.378 ± 0.066
0.845MetPhe: 0.845 ± 0.031
2.033MetGly: 2.033 ± 0.062
0.543MetHis: 0.543 ± 0.029
1.376MetIle: 1.376 ± 0.047
2.129MetLys: 2.129 ± 0.054
2.69MetLeu: 2.69 ± 0.066
0.833MetMet: 0.833 ± 0.036
1.097MetAsn: 1.097 ± 0.044
1.326MetPro: 1.326 ± 0.044
1.263MetGln: 1.263 ± 0.045
1.797MetArg: 1.797 ± 0.054
1.447MetSer: 1.447 ± 0.047
1.712MetThr: 1.712 ± 0.05
1.496MetVal: 1.496 ± 0.052
0.159MetTrp: 0.159 ± 0.014
0.579MetTyr: 0.579 ± 0.03
0.0MetXaa: 0.0 ± 0.0
Asn
3.15AsnAla: 3.15 ± 0.105
0.371AsnCys: 0.371 ± 0.025
1.58AsnAsp: 1.58 ± 0.054
1.933AsnGlu: 1.933 ± 0.058
1.375AsnPhe: 1.375 ± 0.045
2.476AsnGly: 2.476 ± 0.086
0.624AsnHis: 0.624 ± 0.034
2.436AsnIle: 2.436 ± 0.066
1.395AsnLys: 1.395 ± 0.066
3.058AsnLeu: 3.058 ± 0.083
0.935AsnMet: 0.935 ± 0.034
0.952AsnAsn: 0.952 ± 0.058
1.626AsnPro: 1.626 ± 0.046
0.756AsnGln: 0.756 ± 0.036
1.583AsnArg: 1.583 ± 0.05
1.399AsnSer: 1.399 ± 0.056
1.655AsnThr: 1.655 ± 0.122
2.455AsnVal: 2.455 ± 0.083
0.325AsnTrp: 0.325 ± 0.023
1.194AsnTyr: 1.194 ± 0.062
0.0AsnXaa: 0.0 ± 0.0
Pro
3.863ProAla: 3.863 ± 0.082
0.418ProCys: 0.418 ± 0.022
2.151ProAsp: 2.151 ± 0.057
3.419ProGlu: 3.419 ± 0.071
1.617ProPhe: 1.617 ± 0.047
2.456ProGly: 2.456 ± 0.061
0.806ProHis: 0.806 ± 0.033
1.928ProIle: 1.928 ± 0.06
1.892ProLys: 1.892 ± 0.051
3.817ProLeu: 3.817 ± 0.087
0.951ProMet: 0.951 ± 0.034
1.093ProAsn: 1.093 ± 0.041
1.441ProPro: 1.441 ± 0.076
1.391ProGln: 1.391 ± 0.057
1.71ProArg: 1.71 ± 0.05
1.847ProSer: 1.847 ± 0.053
1.91ProThr: 1.91 ± 0.059
2.94ProVal: 2.94 ± 0.075
0.413ProTrp: 0.413 ± 0.024
1.243ProTyr: 1.243 ± 0.042
0.0ProXaa: 0.0 ± 0.0
Gln
3.117GlnAla: 3.117 ± 0.067
0.258GlnCys: 0.258 ± 0.019
1.473GlnAsp: 1.473 ± 0.051
2.802GlnGlu: 2.802 ± 0.068
0.846GlnPhe: 0.846 ± 0.034
2.226GlnGly: 2.226 ± 0.059
0.596GlnHis: 0.596 ± 0.031
1.695GlnIle: 1.695 ± 0.048
2.329GlnLys: 2.329 ± 0.067
2.272GlnLeu: 2.272 ± 0.054
0.994GlnMet: 0.994 ± 0.037
1.109GlnAsn: 1.109 ± 0.042
1.112GlnPro: 1.112 ± 0.05
1.322GlnGln: 1.322 ± 0.047
1.799GlnArg: 1.799 ± 0.06
1.53GlnSer: 1.53 ± 0.05
1.491GlnThr: 1.491 ± 0.048
1.657GlnVal: 1.657 ± 0.044
0.239GlnTrp: 0.239 ± 0.02
0.906GlnTyr: 0.906 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
5.848ArgAla: 5.848 ± 0.122
0.642ArgCys: 0.642 ± 0.027
2.838ArgAsp: 2.838 ± 0.07
5.182ArgGlu: 5.182 ± 0.096
2.414ArgPhe: 2.414 ± 0.06
3.774ArgGly: 3.774 ± 0.082
1.21ArgHis: 1.21 ± 0.035
3.955ArgIle: 3.955 ± 0.077
2.911ArgLys: 2.911 ± 0.068
6.184ArgLeu: 6.184 ± 0.137
1.835ArgMet: 1.835 ± 0.052
1.65ArgAsn: 1.65 ± 0.048
1.945ArgPro: 1.945 ± 0.055
1.897ArgGln: 1.897 ± 0.052
4.351ArgArg: 4.351 ± 0.116
2.831ArgSer: 2.831 ± 0.065
3.223ArgThr: 3.223 ± 0.078
3.749ArgVal: 3.749 ± 0.073
0.586ArgTrp: 0.586 ± 0.029
2.033ArgTyr: 2.033 ± 0.052
0.0ArgXaa: 0.0 ± 0.0
Ser
5.383SerAla: 5.383 ± 0.096
0.655SerCys: 0.655 ± 0.034
2.822SerAsp: 2.822 ± 0.059
3.196SerGlu: 3.196 ± 0.064
2.617SerPhe: 2.617 ± 0.066
4.531SerGly: 4.531 ± 0.109
1.133SerHis: 1.133 ± 0.037
3.11SerIle: 3.11 ± 0.067
2.196SerLys: 2.196 ± 0.062
5.841SerLeu: 5.841 ± 0.113
1.58SerMet: 1.58 ± 0.039
1.432SerAsn: 1.432 ± 0.048
2.06SerPro: 2.06 ± 0.05
1.294SerGln: 1.294 ± 0.044
3.183SerArg: 3.183 ± 0.069
2.98SerSer: 2.98 ± 0.071
2.532SerThr: 2.532 ± 0.088
4.09SerVal: 4.09 ± 0.069
0.566SerTrp: 0.566 ± 0.027
1.908SerTyr: 1.908 ± 0.055
0.0SerXaa: 0.0 ± 0.0
Thr
5.623ThrAla: 5.623 ± 0.13
0.449ThrCys: 0.449 ± 0.026
2.687ThrAsp: 2.687 ± 0.063
3.304ThrGlu: 3.304 ± 0.063
1.872ThrPhe: 1.872 ± 0.051
4.936ThrGly: 4.936 ± 0.197
1.002ThrHis: 1.002 ± 0.038
3.037ThrIle: 3.037 ± 0.075
2.537ThrLys: 2.537 ± 0.072
5.36ThrLeu: 5.36 ± 0.132
1.315ThrMet: 1.315 ± 0.042
1.626ThrAsn: 1.626 ± 0.087
2.385ThrPro: 2.385 ± 0.068
1.513ThrGln: 1.513 ± 0.05
2.474ThrArg: 2.474 ± 0.063
2.622ThrSer: 2.622 ± 0.074
2.617ThrThr: 2.617 ± 0.104
4.037ThrVal: 4.037 ± 0.122
0.428ThrTrp: 0.428 ± 0.024
1.545ThrTyr: 1.545 ± 0.055
0.0ThrXaa: 0.0 ± 0.0
Val
6.507ValAla: 6.507 ± 0.107
0.886ValCys: 0.886 ± 0.034
3.907ValAsp: 3.907 ± 0.089
5.072ValGlu: 5.072 ± 0.084
3.009ValPhe: 3.009 ± 0.068
4.914ValGly: 4.914 ± 0.108
1.548ValHis: 1.548 ± 0.054
3.896ValIle: 3.896 ± 0.09
3.713ValLys: 3.713 ± 0.084
7.356ValLeu: 7.356 ± 0.124
1.888ValMet: 1.888 ± 0.062
2.268ValAsn: 2.268 ± 0.071
3.126ValPro: 3.126 ± 0.071
2.264ValGln: 2.264 ± 0.05
4.664ValArg: 4.664 ± 0.082
4.008ValSer: 4.008 ± 0.082
4.101ValThr: 4.101 ± 0.102
5.332ValVal: 5.332 ± 0.105
0.607ValTrp: 0.607 ± 0.027
2.335ValTyr: 2.335 ± 0.068
0.0ValXaa: 0.0 ± 0.0
Trp
0.67TrpAla: 0.67 ± 0.036
0.113TrpCys: 0.113 ± 0.012
0.469TrpAsp: 0.469 ± 0.027
0.634TrpGlu: 0.634 ± 0.028
0.345TrpPhe: 0.345 ± 0.023
0.626TrpGly: 0.626 ± 0.033
0.263TrpHis: 0.263 ± 0.019
0.408TrpIle: 0.408 ± 0.025
0.602TrpLys: 0.602 ± 0.036
1.16TrpLeu: 1.16 ± 0.039
0.26TrpMet: 0.26 ± 0.018
0.393TrpAsn: 0.393 ± 0.023
0.228TrpPro: 0.228 ± 0.017
0.691TrpGln: 0.691 ± 0.036
0.692TrpArg: 0.692 ± 0.031
0.492TrpSer: 0.492 ± 0.028
0.441TrpThr: 0.441 ± 0.023
0.44TrpVal: 0.44 ± 0.023
0.105TrpTrp: 0.105 ± 0.013
0.321TrpTyr: 0.321 ± 0.025
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.924TyrAla: 2.924 ± 0.058
0.401TyrCys: 0.401 ± 0.028
2.137TyrAsp: 2.137 ± 0.069
2.335TyrGlu: 2.335 ± 0.055
1.305TyrPhe: 1.305 ± 0.044
2.715TyrGly: 2.715 ± 0.116
0.778TyrHis: 0.778 ± 0.035
1.76TyrIle: 1.76 ± 0.051
1.386TyrLys: 1.386 ± 0.044
2.819TyrLeu: 2.819 ± 0.066
0.798TyrMet: 0.798 ± 0.036
1.045TyrAsn: 1.045 ± 0.042
1.196TyrPro: 1.196 ± 0.037
0.889TyrGln: 0.889 ± 0.035
2.116TyrArg: 2.116 ± 0.056
1.574TyrSer: 1.574 ± 0.051
1.776TyrThr: 1.776 ± 0.064
2.16TyrVal: 2.16 ± 0.051
0.365TyrTrp: 0.365 ± 0.021
1.116TyrTyr: 1.116 ± 0.046
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2243 proteins (752766 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski