Amino acid dipepetide frequency for Vagococcus penaei

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.722AlaAla: 4.722 ± 0.123
0.634AlaCys: 0.634 ± 0.034
3.704AlaAsp: 3.704 ± 0.073
4.102AlaGlu: 4.102 ± 0.094
2.882AlaPhe: 2.882 ± 0.07
4.395AlaGly: 4.395 ± 0.107
1.239AlaHis: 1.239 ± 0.042
5.991AlaIle: 5.991 ± 0.113
4.904AlaLys: 4.904 ± 0.093
6.736AlaLeu: 6.736 ± 0.113
1.981AlaMet: 1.981 ± 0.058
3.269AlaAsn: 3.269 ± 0.085
1.921AlaPro: 1.921 ± 0.065
2.509AlaGln: 2.509 ± 0.062
2.431AlaArg: 2.431 ± 0.055
3.969AlaSer: 3.969 ± 0.086
4.478AlaThr: 4.478 ± 0.091
4.66AlaVal: 4.66 ± 0.093
0.506AlaTrp: 0.506 ± 0.032
2.405AlaTyr: 2.405 ± 0.064
0.0AlaXaa: 0.0 ± 0.0
Cys
0.381CysAla: 0.381 ± 0.026
0.073CysCys: 0.073 ± 0.01
0.429CysAsp: 0.429 ± 0.03
0.413CysGlu: 0.413 ± 0.025
0.315CysPhe: 0.315 ± 0.024
0.639CysGly: 0.639 ± 0.032
0.238CysHis: 0.238 ± 0.02
0.407CysIle: 0.407 ± 0.027
0.206CysLys: 0.206 ± 0.019
0.782CysLeu: 0.782 ± 0.037
0.153CysMet: 0.153 ± 0.015
0.177CysAsn: 0.177 ± 0.017
0.319CysPro: 0.319 ± 0.024
0.432CysGln: 0.432 ± 0.031
0.289CysArg: 0.289 ± 0.022
0.397CysSer: 0.397 ± 0.024
0.335CysThr: 0.335 ± 0.024
0.466CysVal: 0.466 ± 0.028
0.052CysTrp: 0.052 ± 0.009
0.267CysTyr: 0.267 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
3.607AspAla: 3.607 ± 0.08
0.359AspCys: 0.359 ± 0.022
3.212AspAsp: 3.212 ± 0.082
4.057AspGlu: 4.057 ± 0.095
2.624AspPhe: 2.624 ± 0.064
3.613AspGly: 3.613 ± 0.082
0.943AspHis: 0.943 ± 0.036
4.663AspIle: 4.663 ± 0.091
3.83AspLys: 3.83 ± 0.083
5.535AspLeu: 5.535 ± 0.092
1.559AspMet: 1.559 ± 0.051
2.581AspAsn: 2.581 ± 0.074
1.605AspPro: 1.605 ± 0.056
1.921AspGln: 1.921 ± 0.061
1.835AspArg: 1.835 ± 0.054
3.125AspSer: 3.125 ± 0.072
3.203AspThr: 3.203 ± 0.076
4.391AspVal: 4.391 ± 0.09
0.63AspTrp: 0.63 ± 0.027
2.85AspTyr: 2.85 ± 0.08
0.0AspXaa: 0.0 ± 0.0
Glu
4.919GluAla: 4.919 ± 0.114
0.362GluCys: 0.362 ± 0.027
3.196GluAsp: 3.196 ± 0.087
4.864GluGlu: 4.864 ± 0.105
2.578GluPhe: 2.578 ± 0.062
3.429GluGly: 3.429 ± 0.087
1.27GluHis: 1.27 ± 0.043
4.543GluIle: 4.543 ± 0.086
5.687GluLys: 5.687 ± 0.097
7.432GluLeu: 7.432 ± 0.129
2.095GluMet: 2.095 ± 0.057
3.18GluAsn: 3.18 ± 0.066
1.797GluPro: 1.797 ± 0.057
3.151GluGln: 3.151 ± 0.086
2.878GluArg: 2.878 ± 0.085
3.741GluSer: 3.741 ± 0.085
4.134GluThr: 4.134 ± 0.086
4.435GluVal: 4.435 ± 0.091
0.61GluTrp: 0.61 ± 0.036
1.91GluTyr: 1.91 ± 0.054
0.0GluXaa: 0.0 ± 0.0
Phe
2.71PheAla: 2.71 ± 0.065
0.327PheCys: 0.327 ± 0.022
2.732PheAsp: 2.732 ± 0.072
2.424PheGlu: 2.424 ± 0.067
2.205PhePhe: 2.205 ± 0.052
3.194PheGly: 3.194 ± 0.073
0.79PheHis: 0.79 ± 0.036
3.755PheIle: 3.755 ± 0.098
2.748PheLys: 2.748 ± 0.067
4.204PheLeu: 4.204 ± 0.091
1.171PheMet: 1.171 ± 0.046
2.358PheAsn: 2.358 ± 0.063
1.496PhePro: 1.496 ± 0.051
1.43PheGln: 1.43 ± 0.056
1.339PheArg: 1.339 ± 0.053
3.275PheSer: 3.275 ± 0.07
2.558PheThr: 2.558 ± 0.066
2.994PheVal: 2.994 ± 0.069
0.385PheTrp: 0.385 ± 0.023
1.816PheTyr: 1.816 ± 0.063
0.0PheXaa: 0.0 ± 0.0
Gly
4.4GlyAla: 4.4 ± 0.102
0.52GlyCys: 0.52 ± 0.027
3.196GlyAsp: 3.196 ± 0.084
3.72GlyGlu: 3.72 ± 0.066
3.13GlyPhe: 3.13 ± 0.085
4.363GlyGly: 4.363 ± 0.094
1.209GlyHis: 1.209 ± 0.044
5.705GlyIle: 5.705 ± 0.107
4.436GlyLys: 4.436 ± 0.099
6.674GlyLeu: 6.674 ± 0.124
2.02GlyMet: 2.02 ± 0.062
2.549GlyAsn: 2.549 ± 0.072
1.519GlyPro: 1.519 ± 0.047
2.708GlyGln: 2.708 ± 0.066
2.425GlyArg: 2.425 ± 0.071
3.692GlySer: 3.692 ± 0.083
4.123GlyThr: 4.123 ± 0.072
4.768GlyVal: 4.768 ± 0.087
0.573GlyTrp: 0.573 ± 0.031
2.731GlyTyr: 2.731 ± 0.062
0.0GlyXaa: 0.0 ± 0.0
His
1.169HisAla: 1.169 ± 0.042
0.153HisCys: 0.153 ± 0.014
1.051HisAsp: 1.051 ± 0.038
1.168HisGlu: 1.168 ± 0.038
0.993HisPhe: 0.993 ± 0.038
1.262HisGly: 1.262 ± 0.047
0.601HisHis: 0.601 ± 0.028
1.32HisIle: 1.32 ± 0.051
0.925HisLys: 0.925 ± 0.039
2.08HisLeu: 2.08 ± 0.055
0.448HisMet: 0.448 ± 0.024
0.688HisAsn: 0.688 ± 0.036
1.032HisPro: 1.032 ± 0.041
1.015HisGln: 1.015 ± 0.042
0.833HisArg: 0.833 ± 0.04
1.087HisSer: 1.087 ± 0.041
0.949HisThr: 0.949 ± 0.039
1.288HisVal: 1.288 ± 0.048
0.173HisTrp: 0.173 ± 0.016
0.94HisTyr: 0.94 ± 0.039
0.0HisXaa: 0.0 ± 0.0
Ile
5.508IleAla: 5.508 ± 0.11
0.683IleCys: 0.683 ± 0.037
5.075IleAsp: 5.075 ± 0.107
5.121IleGlu: 5.121 ± 0.097
3.443IlePhe: 3.443 ± 0.091
5.809IleGly: 5.809 ± 0.116
1.336IleHis: 1.336 ± 0.049
6.322IleIle: 6.322 ± 0.112
4.97IleLys: 4.97 ± 0.086
7.386IleLeu: 7.386 ± 0.116
1.991IleMet: 1.991 ± 0.055
3.749IleAsn: 3.749 ± 0.083
3.231IlePro: 3.231 ± 0.078
2.994IleGln: 2.994 ± 0.065
2.71IleArg: 2.71 ± 0.064
5.146IleSer: 5.146 ± 0.104
4.631IleThr: 4.631 ± 0.102
5.434IleVal: 5.434 ± 0.118
0.546IleTrp: 0.546 ± 0.031
2.616IleTyr: 2.616 ± 0.072
0.0IleXaa: 0.0 ± 0.0
Lys
4.259LysAla: 4.259 ± 0.079
0.26LysCys: 0.26 ± 0.019
3.715LysAsp: 3.715 ± 0.091
6.154LysGlu: 6.154 ± 0.106
1.99LysPhe: 1.99 ± 0.054
3.753LysGly: 3.753 ± 0.077
1.18LysHis: 1.18 ± 0.043
4.97LysIle: 4.97 ± 0.094
6.507LysLys: 6.507 ± 0.125
5.974LysLeu: 5.974 ± 0.1
2.097LysMet: 2.097 ± 0.056
3.793LysAsn: 3.793 ± 0.086
2.034LysPro: 2.034 ± 0.061
3.457LysGln: 3.457 ± 0.072
3.015LysArg: 3.015 ± 0.073
3.544LysSer: 3.544 ± 0.077
3.97LysThr: 3.97 ± 0.09
4.498LysVal: 4.498 ± 0.081
0.607LysTrp: 0.607 ± 0.032
2.184LysTyr: 2.184 ± 0.058
0.0LysXaa: 0.0 ± 0.0
Leu
8.124LeuAla: 8.124 ± 0.119
0.599LeuCys: 0.599 ± 0.034
5.69LeuAsp: 5.69 ± 0.098
6.197LeuGlu: 6.197 ± 0.097
4.722LeuPhe: 4.722 ± 0.104
6.426LeuGly: 6.426 ± 0.12
1.493LeuHis: 1.493 ± 0.045
7.967LeuIle: 7.967 ± 0.132
6.437LeuLys: 6.437 ± 0.107
10.094LeuLeu: 10.094 ± 0.177
2.736LeuMet: 2.736 ± 0.066
4.569LeuAsn: 4.569 ± 0.086
4.221LeuPro: 4.221 ± 0.077
3.148LeuGln: 3.148 ± 0.077
3.264LeuArg: 3.264 ± 0.076
7.096LeuSer: 7.096 ± 0.12
7.432LeuThr: 7.432 ± 0.12
7.277LeuVal: 7.277 ± 0.117
0.715LeuTrp: 0.715 ± 0.034
3.09LeuTyr: 3.09 ± 0.074
0.0LeuXaa: 0.0 ± 0.0
Met
2.0MetAla: 2.0 ± 0.055
0.144MetCys: 0.144 ± 0.016
1.336MetAsp: 1.336 ± 0.051
1.527MetGlu: 1.527 ± 0.042
0.993MetPhe: 0.993 ± 0.04
1.712MetGly: 1.712 ± 0.051
0.382MetHis: 0.382 ± 0.025
2.37MetIle: 2.37 ± 0.059
2.092MetLys: 2.092 ± 0.055
2.529MetLeu: 2.529 ± 0.061
0.906MetMet: 0.906 ± 0.043
1.441MetAsn: 1.441 ± 0.047
0.981MetPro: 0.981 ± 0.037
0.929MetGln: 0.929 ± 0.036
0.989MetArg: 0.989 ± 0.04
1.932MetSer: 1.932 ± 0.056
2.465MetThr: 2.465 ± 0.069
1.814MetVal: 1.814 ± 0.06
0.151MetTrp: 0.151 ± 0.016
0.758MetTyr: 0.758 ± 0.037
0.0MetXaa: 0.0 ± 0.0
Asn
2.625AsnAla: 2.625 ± 0.06
0.327AsnCys: 0.327 ± 0.025
2.691AsnAsp: 2.691 ± 0.061
2.998AsnGlu: 2.998 ± 0.073
1.965AsnPhe: 1.965 ± 0.056
3.128AsnGly: 3.128 ± 0.084
1.232AsnHis: 1.232 ± 0.044
3.141AsnIle: 3.141 ± 0.076
3.009AsnLys: 3.009 ± 0.072
4.242AsnLeu: 4.242 ± 0.078
1.291AsnMet: 1.291 ± 0.047
2.309AsnAsn: 2.309 ± 0.071
2.052AsnPro: 2.052 ± 0.067
3.287AsnGln: 3.287 ± 0.082
2.124AsnArg: 2.124 ± 0.063
2.375AsnSer: 2.375 ± 0.071
2.366AsnThr: 2.366 ± 0.061
2.891AsnVal: 2.891 ± 0.069
0.555AsnTrp: 0.555 ± 0.037
2.097AsnTyr: 2.097 ± 0.059
0.0AsnXaa: 0.0 ± 0.0
Pro
1.997ProAla: 1.997 ± 0.055
0.18ProCys: 0.18 ± 0.019
2.083ProAsp: 2.083 ± 0.06
2.697ProGlu: 2.697 ± 0.058
1.753ProPhe: 1.753 ± 0.05
1.978ProGly: 1.978 ± 0.065
0.625ProHis: 0.625 ± 0.03
2.933ProIle: 2.933 ± 0.063
2.369ProLys: 2.369 ± 0.06
3.356ProLeu: 3.356 ± 0.073
0.891ProMet: 0.891 ± 0.036
1.941ProAsn: 1.941 ± 0.054
0.617ProPro: 0.617 ± 0.035
1.181ProGln: 1.181 ± 0.044
0.877ProArg: 0.877 ± 0.037
2.12ProSer: 2.12 ± 0.053
2.482ProThr: 2.482 ± 0.06
2.82ProVal: 2.82 ± 0.07
0.278ProTrp: 0.278 ± 0.024
1.33ProTyr: 1.33 ± 0.046
0.0ProXaa: 0.0 ± 0.0
Gln
3.35GlnAla: 3.35 ± 0.084
0.225GlnCys: 0.225 ± 0.021
1.903GlnAsp: 1.903 ± 0.05
3.029GlnGlu: 3.029 ± 0.08
1.657GlnPhe: 1.657 ± 0.052
2.24GlnGly: 2.24 ± 0.064
0.86GlnHis: 0.86 ± 0.039
2.91GlnIle: 2.91 ± 0.057
3.009GlnLys: 3.009 ± 0.069
4.985GlnLeu: 4.985 ± 0.112
1.134GlnMet: 1.134 ± 0.041
1.699GlnAsn: 1.699 ± 0.05
1.488GlnPro: 1.488 ± 0.047
2.421GlnGln: 2.421 ± 0.078
1.698GlnArg: 1.698 ± 0.052
2.61GlnSer: 2.61 ± 0.069
2.81GlnThr: 2.81 ± 0.077
3.148GlnVal: 3.148 ± 0.076
0.374GlnTrp: 0.374 ± 0.022
1.146GlnTyr: 1.146 ± 0.041
0.0GlnXaa: 0.0 ± 0.0
Arg
2.12ArgAla: 2.12 ± 0.055
0.27ArgCys: 0.27 ± 0.019
2.069ArgAsp: 2.069 ± 0.057
2.835ArgGlu: 2.835 ± 0.069
1.748ArgPhe: 1.748 ± 0.052
1.994ArgGly: 1.994 ± 0.058
0.865ArgHis: 0.865 ± 0.032
2.668ArgIle: 2.668 ± 0.064
2.506ArgLys: 2.506 ± 0.067
4.212ArgLeu: 4.212 ± 0.08
1.096ArgMet: 1.096 ± 0.044
1.487ArgAsn: 1.487 ± 0.053
1.311ArgPro: 1.311 ± 0.049
2.107ArgGln: 2.107 ± 0.066
1.797ArgArg: 1.797 ± 0.051
1.777ArgSer: 1.777 ± 0.055
1.809ArgThr: 1.809 ± 0.051
2.613ArgVal: 2.613 ± 0.064
0.29ArgTrp: 0.29 ± 0.021
1.522ArgTyr: 1.522 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
3.547SerAla: 3.547 ± 0.081
0.35SerCys: 0.35 ± 0.024
3.521SerAsp: 3.521 ± 0.076
3.917SerGlu: 3.917 ± 0.087
2.843SerPhe: 2.843 ± 0.066
4.572SerGly: 4.572 ± 0.106
1.378SerHis: 1.378 ± 0.048
4.739SerIle: 4.739 ± 0.087
3.788SerLys: 3.788 ± 0.075
6.651SerLeu: 6.651 ± 0.119
1.573SerMet: 1.573 ± 0.059
2.627SerAsn: 2.627 ± 0.069
1.933SerPro: 1.933 ± 0.05
2.928SerGln: 2.928 ± 0.076
2.302SerArg: 2.302 ± 0.069
3.779SerSer: 3.779 ± 0.092
3.417SerThr: 3.417 ± 0.097
4.212SerVal: 4.212 ± 0.094
0.567SerTrp: 0.567 ± 0.031
2.302SerTyr: 2.302 ± 0.06
0.0SerXaa: 0.0 ± 0.0
Thr
4.064ThrAla: 4.064 ± 0.08
0.377ThrCys: 0.377 ± 0.027
3.851ThrAsp: 3.851 ± 0.08
4.021ThrGlu: 4.021 ± 0.087
2.74ThrPhe: 2.74 ± 0.067
4.287ThrGly: 4.287 ± 0.1
1.351ThrHis: 1.351 ± 0.048
5.459ThrIle: 5.459 ± 0.104
3.9ThrLys: 3.9 ± 0.089
6.293ThrLeu: 6.293 ± 0.091
1.456ThrMet: 1.456 ± 0.045
3.192ThrAsn: 3.192 ± 0.085
2.621ThrPro: 2.621 ± 0.061
2.114ThrGln: 2.114 ± 0.055
1.967ThrArg: 1.967 ± 0.063
3.75ThrSer: 3.75 ± 0.088
4.218ThrThr: 4.218 ± 0.111
4.429ThrVal: 4.429 ± 0.087
0.451ThrTrp: 0.451 ± 0.028
2.46ThrTyr: 2.46 ± 0.067
0.0ThrXaa: 0.0 ± 0.0
Val
5.35ValAla: 5.35 ± 0.104
0.564ValCys: 0.564 ± 0.031
4.044ValAsp: 4.044 ± 0.091
4.273ValGlu: 4.273 ± 0.101
2.94ValPhe: 2.94 ± 0.068
4.722ValGly: 4.722 ± 0.103
1.038ValHis: 1.038 ± 0.043
5.959ValIle: 5.959 ± 0.102
4.368ValLys: 4.368 ± 0.086
6.808ValLeu: 6.808 ± 0.108
1.84ValMet: 1.84 ± 0.053
3.154ValAsn: 3.154 ± 0.078
2.713ValPro: 2.713 ± 0.069
2.097ValGln: 2.097 ± 0.059
2.23ValArg: 2.23 ± 0.064
4.875ValSer: 4.875 ± 0.085
5.191ValThr: 5.191 ± 0.099
5.39ValVal: 5.39 ± 0.094
0.524ValTrp: 0.524 ± 0.028
2.309ValTyr: 2.309 ± 0.06
0.0ValXaa: 0.0 ± 0.0
Trp
0.388TrpAla: 0.388 ± 0.025
0.05TrpCys: 0.05 ± 0.009
0.422TrpAsp: 0.422 ± 0.027
0.539TrpGlu: 0.539 ± 0.034
0.48TrpPhe: 0.48 ± 0.032
0.5TrpGly: 0.5 ± 0.028
0.222TrpHis: 0.222 ± 0.018
0.576TrpIle: 0.576 ± 0.034
0.486TrpLys: 0.486 ± 0.026
1.132TrpLeu: 1.132 ± 0.042
0.24TrpMet: 0.24 ± 0.019
0.428TrpAsn: 0.428 ± 0.028
0.205TrpPro: 0.205 ± 0.019
0.498TrpGln: 0.498 ± 0.026
0.324TrpArg: 0.324 ± 0.023
0.507TrpSer: 0.507 ± 0.032
0.457TrpThr: 0.457 ± 0.03
0.539TrpVal: 0.539 ± 0.028
0.119TrpTrp: 0.119 ± 0.013
0.329TrpTyr: 0.329 ± 0.025
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.199TyrAla: 2.199 ± 0.065
0.313TyrCys: 0.313 ± 0.024
2.222TyrAsp: 2.222 ± 0.065
2.156TyrGlu: 2.156 ± 0.065
1.909TyrPhe: 1.909 ± 0.062
2.444TyrGly: 2.444 ± 0.059
0.922TyrHis: 0.922 ± 0.038
2.303TyrIle: 2.303 ± 0.065
1.783TyrLys: 1.783 ± 0.057
4.19TyrLeu: 4.19 ± 0.089
0.773TyrMet: 0.773 ± 0.033
1.514TyrAsn: 1.514 ± 0.058
1.411TyrPro: 1.411 ± 0.049
2.369TyrGln: 2.369 ± 0.074
1.774TyrArg: 1.774 ± 0.06
2.149TyrSer: 2.149 ± 0.058
1.973TyrThr: 1.973 ± 0.061
2.321TyrVal: 2.321 ± 0.06
0.336TyrTrp: 0.336 ± 0.022
1.632TyrTyr: 1.632 ± 0.061
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2060 proteins (654354 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski