Amino acid dipepetide frequency for Mageeibacillus indolicus (strain UPII9-5) (Clostridiales genomosp. BVAB3 (strain UPII9-5))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.544AlaAla: 11.544 ± 0.259
1.202AlaCys: 1.202 ± 0.05
5.541AlaAsp: 5.541 ± 0.13
6.531AlaGlu: 6.531 ± 0.129
3.199AlaPhe: 3.199 ± 0.088
7.32AlaGly: 7.32 ± 0.137
1.28AlaHis: 1.28 ± 0.05
5.471AlaIle: 5.471 ± 0.108
5.993AlaLys: 5.993 ± 0.133
8.314AlaLeu: 8.314 ± 0.157
2.426AlaMet: 2.426 ± 0.07
3.558AlaAsn: 3.558 ± 0.078
2.689AlaPro: 2.689 ± 0.083
2.64AlaGln: 2.64 ± 0.081
3.398AlaArg: 3.398 ± 0.084
4.094AlaSer: 4.094 ± 0.095
4.579AlaThr: 4.579 ± 0.113
7.127AlaVal: 7.127 ± 0.126
0.875AlaTrp: 0.875 ± 0.046
2.888AlaTyr: 2.888 ± 0.068
0.0AlaXaa: 0.0 ± 0.0
Cys
0.988CysAla: 0.988 ± 0.044
0.294CysCys: 0.294 ± 0.031
0.726CysAsp: 0.726 ± 0.039
0.711CysGlu: 0.711 ± 0.038
0.488CysPhe: 0.488 ± 0.028
1.299CysGly: 1.299 ± 0.057
0.337CysHis: 0.337 ± 0.027
0.821CysIle: 0.821 ± 0.041
0.621CysLys: 0.621 ± 0.04
1.381CysLeu: 1.381 ± 0.056
0.272CysMet: 0.272 ± 0.022
0.554CysAsn: 0.554 ± 0.03
0.679CysPro: 0.679 ± 0.039
0.419CysGln: 0.419 ± 0.032
0.904CysArg: 0.904 ± 0.05
0.834CysSer: 0.834 ± 0.048
0.586CysThr: 0.586 ± 0.038
0.778CysVal: 0.778 ± 0.043
0.108CysTrp: 0.108 ± 0.014
0.435CysTyr: 0.435 ± 0.028
0.0CysXaa: 0.0 ± 0.0
Asp
3.995AspAla: 3.995 ± 0.098
0.808AspCys: 0.808 ± 0.044
2.705AspAsp: 2.705 ± 0.088
3.865AspGlu: 3.865 ± 0.096
2.819AspPhe: 2.819 ± 0.08
4.114AspGly: 4.114 ± 0.117
0.826AspHis: 0.826 ± 0.041
3.721AspIle: 3.721 ± 0.091
3.591AspLys: 3.591 ± 0.107
5.744AspLeu: 5.744 ± 0.13
1.427AspMet: 1.427 ± 0.05
2.324AspAsn: 2.324 ± 0.079
2.324AspPro: 2.324 ± 0.081
1.386AspGln: 1.386 ± 0.053
2.462AspArg: 2.462 ± 0.07
2.959AspSer: 2.959 ± 0.082
2.512AspThr: 2.512 ± 0.072
3.342AspVal: 3.342 ± 0.077
0.56AspTrp: 0.56 ± 0.036
2.318AspTyr: 2.318 ± 0.066
0.0AspXaa: 0.0 ± 0.0
Glu
5.435GluAla: 5.435 ± 0.119
0.534GluCys: 0.534 ± 0.035
2.73GluAsp: 2.73 ± 0.083
3.893GluGlu: 3.893 ± 0.102
2.56GluPhe: 2.56 ± 0.076
3.076GluGly: 3.076 ± 0.077
1.232GluHis: 1.232 ± 0.051
4.963GluIle: 4.963 ± 0.098
4.799GluLys: 4.799 ± 0.112
6.702GluLeu: 6.702 ± 0.132
1.727GluMet: 1.727 ± 0.056
3.461GluAsn: 3.461 ± 0.091
2.131GluPro: 2.131 ± 0.072
2.305GluGln: 2.305 ± 0.072
3.195GluArg: 3.195 ± 0.084
2.932GluSer: 2.932 ± 0.08
3.286GluThr: 3.286 ± 0.081
3.973GluVal: 3.973 ± 0.095
0.543GluTrp: 0.543 ± 0.031
2.218GluTyr: 2.218 ± 0.064
0.0GluXaa: 0.0 ± 0.0
Phe
3.788PheAla: 3.788 ± 0.1
0.722PheCys: 0.722 ± 0.039
2.393PheAsp: 2.393 ± 0.064
1.92PheGlu: 1.92 ± 0.066
2.116PhePhe: 2.116 ± 0.082
2.912PheGly: 2.912 ± 0.091
0.742PheHis: 0.742 ± 0.039
2.919PheIle: 2.919 ± 0.089
2.583PheLys: 2.583 ± 0.072
3.952PheLeu: 3.952 ± 0.094
1.102PheMet: 1.102 ± 0.044
2.064PheAsn: 2.064 ± 0.061
1.623PhePro: 1.623 ± 0.053
0.964PheGln: 0.964 ± 0.041
1.706PheArg: 1.706 ± 0.058
3.154PheSer: 3.154 ± 0.08
2.572PheThr: 2.572 ± 0.073
2.719PheVal: 2.719 ± 0.077
0.454PheTrp: 0.454 ± 0.029
1.734PheTyr: 1.734 ± 0.057
0.0PheXaa: 0.0 ± 0.0
Gly
5.184GlyAla: 5.184 ± 0.125
1.066GlyCys: 1.066 ± 0.047
3.424GlyAsp: 3.424 ± 0.078
4.592GlyGlu: 4.592 ± 0.101
3.05GlyPhe: 3.05 ± 0.074
5.082GlyGly: 5.082 ± 0.133
1.325GlyHis: 1.325 ± 0.054
5.137GlyIle: 5.137 ± 0.107
5.623GlyLys: 5.623 ± 0.123
6.533GlyLeu: 6.533 ± 0.123
1.969GlyMet: 1.969 ± 0.071
2.879GlyAsn: 2.879 ± 0.083
1.338GlyPro: 1.338 ± 0.051
2.385GlyGln: 2.385 ± 0.078
3.725GlyArg: 3.725 ± 0.097
4.125GlySer: 4.125 ± 0.094
3.935GlyThr: 3.935 ± 0.078
4.542GlyVal: 4.542 ± 0.105
0.698GlyTrp: 0.698 ± 0.032
2.661GlyTyr: 2.661 ± 0.068
0.0GlyXaa: 0.0 ± 0.0
His
1.232HisAla: 1.232 ± 0.051
0.311HisCys: 0.311 ± 0.028
1.042HisAsp: 1.042 ± 0.047
1.048HisGlu: 1.048 ± 0.053
0.809HisPhe: 0.809 ± 0.037
1.384HisGly: 1.384 ± 0.058
0.428HisHis: 0.428 ± 0.025
1.135HisIle: 1.135 ± 0.045
1.122HisLys: 1.122 ± 0.045
1.831HisLeu: 1.831 ± 0.051
0.365HisMet: 0.365 ± 0.028
0.78HisAsn: 0.78 ± 0.043
0.917HisPro: 0.917 ± 0.043
0.618HisGln: 0.618 ± 0.033
0.925HisArg: 0.925 ± 0.044
1.04HisSer: 1.04 ± 0.052
0.945HisThr: 0.945 ± 0.044
0.973HisVal: 0.973 ± 0.045
0.207HisTrp: 0.207 ± 0.02
0.735HisTyr: 0.735 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
5.833IleAla: 5.833 ± 0.114
1.15IleCys: 1.15 ± 0.047
3.863IleAsp: 3.863 ± 0.097
3.764IleGlu: 3.764 ± 0.091
3.18IlePhe: 3.18 ± 0.088
4.853IleGly: 4.853 ± 0.114
1.092IleHis: 1.092 ± 0.052
5.07IleIle: 5.07 ± 0.124
4.531IleLys: 4.531 ± 0.11
6.942IleLeu: 6.942 ± 0.148
1.701IleMet: 1.701 ± 0.066
3.422IleAsn: 3.422 ± 0.087
2.879IlePro: 2.879 ± 0.071
1.807IleGln: 1.807 ± 0.06
2.97IleArg: 2.97 ± 0.082
4.399IleSer: 4.399 ± 0.094
3.881IleThr: 3.881 ± 0.108
4.369IleVal: 4.369 ± 0.104
0.597IleTrp: 0.597 ± 0.031
2.473IleTyr: 2.473 ± 0.073
0.0IleXaa: 0.0 ± 0.0
Lys
5.335LysAla: 5.335 ± 0.109
0.517LysCys: 0.517 ± 0.033
3.608LysAsp: 3.608 ± 0.127
3.984LysGlu: 3.984 ± 0.099
2.627LysPhe: 2.627 ± 0.085
3.228LysGly: 3.228 ± 0.086
1.09LysHis: 1.09 ± 0.048
5.037LysIle: 5.037 ± 0.106
4.631LysLys: 4.631 ± 0.136
6.754LysLeu: 6.754 ± 0.124
1.777LysMet: 1.777 ± 0.064
3.534LysAsn: 3.534 ± 0.104
2.646LysPro: 2.646 ± 0.1
2.363LysGln: 2.363 ± 0.073
2.955LysArg: 2.955 ± 0.084
3.485LysSer: 3.485 ± 0.087
3.666LysThr: 3.666 ± 0.085
4.421LysVal: 4.421 ± 0.104
0.569LysTrp: 0.569 ± 0.031
2.531LysTyr: 2.531 ± 0.07
0.0LysXaa: 0.0 ± 0.0
Leu
10.288LeuAla: 10.288 ± 0.198
1.364LeuCys: 1.364 ± 0.056
4.987LeuAsp: 4.987 ± 0.098
4.769LeuGlu: 4.769 ± 0.093
3.878LeuPhe: 3.878 ± 0.11
6.364LeuGly: 6.364 ± 0.121
1.998LeuHis: 1.998 ± 0.063
6.626LeuIle: 6.626 ± 0.142
5.947LeuLys: 5.947 ± 0.109
10.245LeuLeu: 10.245 ± 0.219
2.268LeuMet: 2.268 ± 0.07
4.821LeuAsn: 4.821 ± 0.102
5.578LeuPro: 5.578 ± 0.107
3.729LeuGln: 3.729 ± 0.084
5.318LeuArg: 5.318 ± 0.122
6.862LeuSer: 6.862 ± 0.118
6.099LeuThr: 6.099 ± 0.115
5.908LeuVal: 5.908 ± 0.115
0.817LeuTrp: 0.817 ± 0.039
3.013LeuTyr: 3.013 ± 0.075
0.0LeuXaa: 0.0 ± 0.0
Met
2.506MetAla: 2.506 ± 0.075
0.247MetCys: 0.247 ± 0.021
1.195MetAsp: 1.195 ± 0.046
1.379MetGlu: 1.379 ± 0.047
0.86MetPhe: 0.86 ± 0.043
1.621MetGly: 1.621 ± 0.061
0.439MetHis: 0.439 ± 0.029
1.68MetIle: 1.68 ± 0.069
1.608MetLys: 1.608 ± 0.055
2.594MetLeu: 2.594 ± 0.078
0.605MetMet: 0.605 ± 0.034
1.303MetAsn: 1.303 ± 0.042
1.258MetPro: 1.258 ± 0.053
0.981MetGln: 0.981 ± 0.042
1.423MetArg: 1.423 ± 0.048
1.589MetSer: 1.589 ± 0.062
1.325MetThr: 1.325 ± 0.051
1.595MetVal: 1.595 ± 0.054
0.149MetTrp: 0.149 ± 0.019
0.661MetTyr: 0.661 ± 0.037
0.0MetXaa: 0.0 ± 0.0
Asn
3.234AsnAla: 3.234 ± 0.088
0.661AsnCys: 0.661 ± 0.034
2.294AsnAsp: 2.294 ± 0.078
2.765AsnGlu: 2.765 ± 0.078
2.121AsnPhe: 2.121 ± 0.063
3.6AsnGly: 3.6 ± 0.113
0.815AsnHis: 0.815 ± 0.038
3.504AsnIle: 3.504 ± 0.09
3.115AsnLys: 3.115 ± 0.102
4.821AsnLeu: 4.821 ± 0.092
1.295AsnMet: 1.295 ± 0.056
2.309AsnAsn: 2.309 ± 0.077
2.322AsnPro: 2.322 ± 0.072
1.379AsnGln: 1.379 ± 0.051
2.296AsnArg: 2.296 ± 0.071
2.81AsnSer: 2.81 ± 0.087
2.196AsnThr: 2.196 ± 0.071
2.577AsnVal: 2.577 ± 0.066
0.497AsnTrp: 0.497 ± 0.028
1.803AsnTyr: 1.803 ± 0.061
0.0AsnXaa: 0.0 ± 0.0
Pro
3.872ProAla: 3.872 ± 0.116
0.445ProCys: 0.445 ± 0.031
2.653ProAsp: 2.653 ± 0.086
3.478ProGlu: 3.478 ± 0.087
1.608ProPhe: 1.608 ± 0.054
2.942ProGly: 2.942 ± 0.089
0.733ProHis: 0.733 ± 0.04
2.345ProIle: 2.345 ± 0.063
2.17ProLys: 2.17 ± 0.081
3.943ProLeu: 3.943 ± 0.08
0.763ProMet: 0.763 ± 0.04
1.848ProAsn: 1.848 ± 0.072
1.446ProPro: 1.446 ± 0.074
1.49ProGln: 1.49 ± 0.058
1.572ProArg: 1.572 ± 0.053
2.242ProSer: 2.242 ± 0.066
2.38ProThr: 2.38 ± 0.084
2.806ProVal: 2.806 ± 0.084
0.447ProTrp: 0.447 ± 0.026
1.606ProTyr: 1.606 ± 0.06
0.0ProXaa: 0.0 ± 0.0
Gln
3.539GlnAla: 3.539 ± 0.09
0.242GlnCys: 0.242 ± 0.024
1.463GlnAsp: 1.463 ± 0.059
1.946GlnGlu: 1.946 ± 0.061
1.122GlnPhe: 1.122 ± 0.045
2.045GlnGly: 2.045 ± 0.063
0.562GlnHis: 0.562 ± 0.032
2.358GlnIle: 2.358 ± 0.066
2.328GlnLys: 2.328 ± 0.07
3.096GlnLeu: 3.096 ± 0.077
0.802GlnMet: 0.802 ± 0.042
1.729GlnAsn: 1.729 ± 0.051
1.531GlnPro: 1.531 ± 0.074
1.295GlnGln: 1.295 ± 0.054
1.595GlnArg: 1.595 ± 0.058
1.77GlnSer: 1.77 ± 0.058
1.872GlnThr: 1.872 ± 0.061
2.276GlnVal: 2.276 ± 0.068
0.246GlnTrp: 0.246 ± 0.024
1.031GlnTyr: 1.031 ± 0.049
0.0GlnXaa: 0.0 ± 0.0
Arg
3.334ArgAla: 3.334 ± 0.085
0.7ArgCys: 0.7 ± 0.039
2.343ArgAsp: 2.343 ± 0.069
3.215ArgGlu: 3.215 ± 0.087
1.872ArgPhe: 1.872 ± 0.062
2.685ArgGly: 2.685 ± 0.082
1.02ArgHis: 1.02 ± 0.05
3.221ArgIle: 3.221 ± 0.089
3.089ArgLys: 3.089 ± 0.08
5.305ArgLeu: 5.305 ± 0.123
1.193ArgMet: 1.193 ± 0.045
2.278ArgAsn: 2.278 ± 0.066
2.064ArgPro: 2.064 ± 0.069
2.41ArgGln: 2.41 ± 0.074
3.104ArgArg: 3.104 ± 0.092
2.828ArgSer: 2.828 ± 0.079
2.222ArgThr: 2.222 ± 0.072
2.733ArgVal: 2.733 ± 0.066
0.359ArgTrp: 0.359 ± 0.025
1.84ArgTyr: 1.84 ± 0.057
0.0ArgXaa: 0.0 ± 0.0
Ser
5.011SerAla: 5.011 ± 0.109
0.765SerCys: 0.765 ± 0.037
3.467SerAsp: 3.467 ± 0.075
3.798SerGlu: 3.798 ± 0.084
2.702SerPhe: 2.702 ± 0.068
5.063SerGly: 5.063 ± 0.117
1.02SerHis: 1.02 ± 0.046
3.822SerIle: 3.822 ± 0.086
3.202SerLys: 3.202 ± 0.077
5.956SerLeu: 5.956 ± 0.112
1.409SerMet: 1.409 ± 0.061
2.376SerAsn: 2.376 ± 0.087
2.274SerPro: 2.274 ± 0.075
1.641SerGln: 1.641 ± 0.057
2.605SerArg: 2.605 ± 0.073
3.535SerSer: 3.535 ± 0.079
3.135SerThr: 3.135 ± 0.074
3.926SerVal: 3.926 ± 0.083
0.556SerTrp: 0.556 ± 0.034
2.151SerTyr: 2.151 ± 0.063
0.0SerXaa: 0.0 ± 0.0
Thr
6.2ThrAla: 6.2 ± 0.127
0.694ThrCys: 0.694 ± 0.04
3.141ThrAsp: 3.141 ± 0.082
3.476ThrGlu: 3.476 ± 0.083
2.151ThrPhe: 2.151 ± 0.07
4.339ThrGly: 4.339 ± 0.101
0.848ThrHis: 0.848 ± 0.041
3.266ThrIle: 3.266 ± 0.071
2.946ThrLys: 2.946 ± 0.081
5.193ThrLeu: 5.193 ± 0.112
1.278ThrMet: 1.278 ± 0.052
2.078ThrAsn: 2.078 ± 0.062
2.369ThrPro: 2.369 ± 0.077
1.319ThrGln: 1.319 ± 0.042
1.924ThrArg: 1.924 ± 0.053
3.02ThrSer: 3.02 ± 0.068
3.061ThrThr: 3.061 ± 0.072
4.523ThrVal: 4.523 ± 0.097
0.551ThrTrp: 0.551 ± 0.028
1.881ThrTyr: 1.881 ± 0.058
0.0ThrXaa: 0.0 ± 0.0
Val
6.068ValAla: 6.068 ± 0.132
0.899ValCys: 0.899 ± 0.042
3.744ValAsp: 3.744 ± 0.077
3.928ValGlu: 3.928 ± 0.094
2.88ValPhe: 2.88 ± 0.077
4.252ValGly: 4.252 ± 0.084
1.055ValHis: 1.055 ± 0.042
4.661ValIle: 4.661 ± 0.113
4.265ValLys: 4.265 ± 0.092
6.645ValLeu: 6.645 ± 0.126
1.643ValMet: 1.643 ± 0.059
2.938ValAsn: 2.938 ± 0.081
2.86ValPro: 2.86 ± 0.07
1.751ValGln: 1.751 ± 0.064
2.992ValArg: 2.992 ± 0.089
4.107ValSer: 4.107 ± 0.103
3.669ValThr: 3.669 ± 0.077
4.615ValVal: 4.615 ± 0.108
0.642ValTrp: 0.642 ± 0.037
2.3ValTyr: 2.3 ± 0.068
0.0ValXaa: 0.0 ± 0.0
Trp
0.614TrpAla: 0.614 ± 0.039
0.104TrpCys: 0.104 ± 0.015
0.422TrpAsp: 0.422 ± 0.028
0.545TrpGlu: 0.545 ± 0.033
0.359TrpPhe: 0.359 ± 0.028
0.579TrpGly: 0.579 ± 0.036
0.283TrpHis: 0.283 ± 0.025
0.541TrpIle: 0.541 ± 0.036
0.452TrpLys: 0.452 ± 0.03
1.191TrpLeu: 1.191 ± 0.045
0.227TrpMet: 0.227 ± 0.019
0.394TrpAsn: 0.394 ± 0.026
0.437TrpPro: 0.437 ± 0.032
0.778TrpGln: 0.778 ± 0.044
0.627TrpArg: 0.627 ± 0.034
0.491TrpSer: 0.491 ± 0.031
0.411TrpThr: 0.411 ± 0.027
0.456TrpVal: 0.456 ± 0.033
0.127TrpTrp: 0.127 ± 0.017
0.309TrpTyr: 0.309 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.666TyrAla: 2.666 ± 0.07
0.51TyrCys: 0.51 ± 0.032
2.064TyrAsp: 2.064 ± 0.067
2.088TyrGlu: 2.088 ± 0.07
1.738TyrPhe: 1.738 ± 0.062
2.516TyrGly: 2.516 ± 0.071
0.72TyrHis: 0.72 ± 0.038
2.478TyrIle: 2.478 ± 0.073
2.144TyrLys: 2.144 ± 0.075
3.736TyrLeu: 3.736 ± 0.089
0.767TyrMet: 0.767 ± 0.039
1.766TyrAsn: 1.766 ± 0.055
1.423TyrPro: 1.423 ± 0.056
1.185TyrGln: 1.185 ± 0.052
2.118TyrArg: 2.118 ± 0.064
2.138TyrSer: 2.138 ± 0.062
1.928TyrThr: 1.928 ± 0.068
2.255TyrVal: 2.255 ± 0.059
0.348TyrTrp: 0.348 ± 0.026
1.537TyrTyr: 1.537 ± 0.054
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1566 proteins (537427 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski