Amino acid dipepetide frequency for Bombiscardovia coagulans

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.43AlaAla: 9.43 ± 0.176
1.093AlaCys: 1.093 ± 0.053
5.486AlaAsp: 5.486 ± 0.128
4.575AlaGlu: 4.575 ± 0.117
3.267AlaPhe: 3.267 ± 0.079
7.553AlaGly: 7.553 ± 0.134
2.237AlaHis: 2.237 ± 0.067
5.322AlaIle: 5.322 ± 0.131
3.896AlaLys: 3.896 ± 0.113
9.656AlaLeu: 9.656 ± 0.176
2.421AlaMet: 2.421 ± 0.065
2.984AlaAsn: 2.984 ± 0.079
3.512AlaPro: 3.512 ± 0.089
5.393AlaGln: 5.393 ± 0.12
4.985AlaArg: 4.985 ± 0.108
6.575AlaSer: 6.575 ± 0.13
4.801AlaThr: 4.801 ± 0.106
7.236AlaVal: 7.236 ± 0.128
1.507AlaTrp: 1.507 ± 0.07
2.461AlaTyr: 2.461 ± 0.066
0.0AlaXaa: 0.0 ± 0.0
Cys
0.895CysAla: 0.895 ± 0.044
0.147CysCys: 0.147 ± 0.019
0.461CysAsp: 0.461 ± 0.032
0.453CysGlu: 0.453 ± 0.031
0.358CysPhe: 0.358 ± 0.026
0.861CysGly: 0.861 ± 0.044
0.202CysHis: 0.202 ± 0.023
0.562CysIle: 0.562 ± 0.031
0.412CysLys: 0.412 ± 0.031
0.982CysLeu: 0.982 ± 0.046
0.206CysMet: 0.206 ± 0.02
0.323CysAsn: 0.323 ± 0.027
0.459CysPro: 0.459 ± 0.03
0.416CysGln: 0.416 ± 0.029
0.42CysArg: 0.42 ± 0.028
0.75CysSer: 0.75 ± 0.043
0.485CysThr: 0.485 ± 0.03
0.752CysVal: 0.752 ± 0.041
0.113CysTrp: 0.113 ± 0.015
0.246CysTyr: 0.246 ± 0.025
0.0CysXaa: 0.0 ± 0.0
Asp
5.274AspAla: 5.274 ± 0.112
0.541AspCys: 0.541 ± 0.032
3.536AspAsp: 3.536 ± 0.11
3.865AspGlu: 3.865 ± 0.102
2.19AspPhe: 2.19 ± 0.066
4.558AspGly: 4.558 ± 0.125
1.378AspHis: 1.378 ± 0.051
3.433AspIle: 3.433 ± 0.082
2.514AspLys: 2.514 ± 0.085
5.195AspLeu: 5.195 ± 0.12
1.574AspMet: 1.574 ± 0.057
2.045AspAsn: 2.045 ± 0.069
3.211AspPro: 3.211 ± 0.091
2.649AspGln: 2.649 ± 0.087
3.146AspArg: 3.146 ± 0.096
3.885AspSer: 3.885 ± 0.094
3.116AspThr: 3.116 ± 0.083
4.195AspVal: 4.195 ± 0.102
0.82AspTrp: 0.82 ± 0.041
1.802AspTyr: 1.802 ± 0.066
0.0AspXaa: 0.0 ± 0.0
Glu
5.29GluAla: 5.29 ± 0.111
0.398GluCys: 0.398 ± 0.03
3.251GluAsp: 3.251 ± 0.091
3.605GluGlu: 3.605 ± 0.107
1.584GluPhe: 1.584 ± 0.059
3.873GluGly: 3.873 ± 0.102
1.847GluHis: 1.847 ± 0.072
2.758GluIle: 2.758 ± 0.086
2.037GluLys: 2.037 ± 0.077
5.134GluLeu: 5.134 ± 0.107
1.271GluMet: 1.271 ± 0.05
1.899GluAsn: 1.899 ± 0.064
2.398GluPro: 2.398 ± 0.072
3.558GluGln: 3.558 ± 0.091
3.548GluArg: 3.548 ± 0.112
3.568GluSer: 3.568 ± 0.088
2.825GluThr: 2.825 ± 0.071
3.748GluVal: 3.748 ± 0.088
0.578GluTrp: 0.578 ± 0.032
1.485GluTyr: 1.485 ± 0.058
0.0GluXaa: 0.0 ± 0.0
Phe
3.298PheAla: 3.298 ± 0.089
0.368PheCys: 0.368 ± 0.029
2.372PheAsp: 2.372 ± 0.072
1.798PheGlu: 1.798 ± 0.06
1.346PhePhe: 1.346 ± 0.054
2.964PheGly: 2.964 ± 0.085
0.719PheHis: 0.719 ± 0.042
2.053PheIle: 2.053 ± 0.081
1.152PheLys: 1.152 ± 0.048
3.065PheLeu: 3.065 ± 0.08
0.802PheMet: 0.802 ± 0.042
1.295PheAsn: 1.295 ± 0.056
1.354PhePro: 1.354 ± 0.054
1.164PheGln: 1.164 ± 0.046
1.392PheArg: 1.392 ± 0.053
2.548PheSer: 2.548 ± 0.075
2.281PheThr: 2.281 ± 0.082
2.509PheVal: 2.509 ± 0.077
0.475PheTrp: 0.475 ± 0.034
0.938PheTyr: 0.938 ± 0.044
0.0PheXaa: 0.0 ± 0.0
Gly
5.957GlyAla: 5.957 ± 0.141
0.661GlyCys: 0.661 ± 0.035
3.847GlyAsp: 3.847 ± 0.109
3.679GlyGlu: 3.679 ± 0.089
2.938GlyPhe: 2.938 ± 0.078
5.187GlyGly: 5.187 ± 0.126
1.865GlyHis: 1.865 ± 0.061
4.783GlyIle: 4.783 ± 0.108
4.037GlyLys: 4.037 ± 0.097
7.337GlyLeu: 7.337 ± 0.147
1.873GlyMet: 1.873 ± 0.066
2.809GlyAsn: 2.809 ± 0.11
2.657GlyPro: 2.657 ± 0.076
3.679GlyGln: 3.679 ± 0.098
4.071GlyArg: 4.071 ± 0.107
6.003GlySer: 6.003 ± 0.115
4.18GlyThr: 4.18 ± 0.112
5.591GlyVal: 5.591 ± 0.126
1.216GlyTrp: 1.216 ± 0.061
2.402GlyTyr: 2.402 ± 0.076
0.002GlyXaa: 0.002 ± 0.002
His
1.97HisAla: 1.97 ± 0.063
0.192HisCys: 0.192 ± 0.02
1.39HisAsp: 1.39 ± 0.057
1.226HisGlu: 1.226 ± 0.049
0.754HisPhe: 0.754 ± 0.043
1.758HisGly: 1.758 ± 0.07
0.618HisHis: 0.618 ± 0.038
1.509HisIle: 1.509 ± 0.056
0.935HisLys: 0.935 ± 0.053
1.97HisLeu: 1.97 ± 0.063
0.602HisMet: 0.602 ± 0.035
0.869HisAsn: 0.869 ± 0.044
1.321HisPro: 1.321 ± 0.06
1.111HisGln: 1.111 ± 0.053
1.499HisArg: 1.499 ± 0.06
1.772HisSer: 1.772 ± 0.083
1.459HisThr: 1.459 ± 0.053
1.637HisVal: 1.637 ± 0.054
0.402HisTrp: 0.402 ± 0.026
0.683HisTyr: 0.683 ± 0.042
0.0HisXaa: 0.0 ± 0.0
Ile
6.023IleAla: 6.023 ± 0.121
0.689IleCys: 0.689 ± 0.039
4.011IleAsp: 4.011 ± 0.099
3.366IleGlu: 3.366 ± 0.084
1.93IlePhe: 1.93 ± 0.082
4.358IleGly: 4.358 ± 0.111
1.233IleHis: 1.233 ± 0.046
3.203IleIle: 3.203 ± 0.092
1.98IleLys: 1.98 ± 0.067
4.692IleLeu: 4.692 ± 0.113
1.241IleMet: 1.241 ± 0.054
2.097IleAsn: 2.097 ± 0.06
2.885IlePro: 2.885 ± 0.081
2.033IleGln: 2.033 ± 0.058
3.013IleArg: 3.013 ± 0.08
4.122IleSer: 4.122 ± 0.1
3.399IleThr: 3.399 ± 0.094
4.874IleVal: 4.874 ± 0.115
0.671IleTrp: 0.671 ± 0.042
1.346IleTyr: 1.346 ± 0.052
0.0IleXaa: 0.0 ± 0.0
Lys
4.378LysAla: 4.378 ± 0.111
0.188LysCys: 0.188 ± 0.019
2.594LysAsp: 2.594 ± 0.083
2.453LysGlu: 2.453 ± 0.083
0.948LysPhe: 0.948 ± 0.046
2.96LysGly: 2.96 ± 0.087
1.099LysHis: 1.099 ± 0.043
2.122LysIle: 2.122 ± 0.073
1.907LysLys: 1.907 ± 0.081
3.611LysLeu: 3.611 ± 0.101
1.018LysMet: 1.018 ± 0.043
1.622LysAsn: 1.622 ± 0.066
2.372LysPro: 2.372 ± 0.074
2.204LysGln: 2.204 ± 0.069
2.374LysArg: 2.374 ± 0.076
2.734LysSer: 2.734 ± 0.079
2.722LysThr: 2.722 ± 0.078
3.081LysVal: 3.081 ± 0.088
0.424LysTrp: 0.424 ± 0.027
1.057LysTyr: 1.057 ± 0.045
0.0LysXaa: 0.0 ± 0.0
Leu
9.593LeuAla: 9.593 ± 0.188
1.028LeuCys: 1.028 ± 0.045
5.476LeuAsp: 5.476 ± 0.122
4.746LeuGlu: 4.746 ± 0.119
3.144LeuPhe: 3.144 ± 0.082
7.189LeuGly: 7.189 ± 0.132
2.021LeuHis: 2.021 ± 0.059
5.201LeuIle: 5.201 ± 0.117
3.677LeuLys: 3.677 ± 0.085
9.07LeuLeu: 9.07 ± 0.201
2.2LeuMet: 2.2 ± 0.063
3.239LeuAsn: 3.239 ± 0.091
4.968LeuPro: 4.968 ± 0.128
3.433LeuGln: 3.433 ± 0.091
4.819LeuArg: 4.819 ± 0.113
7.294LeuSer: 7.294 ± 0.145
5.754LeuThr: 5.754 ± 0.121
7.128LeuVal: 7.128 ± 0.136
1.142LeuTrp: 1.142 ± 0.049
2.166LeuTyr: 2.166 ± 0.069
0.0LeuXaa: 0.0 ± 0.0
Met
2.437MetAla: 2.437 ± 0.074
0.234MetCys: 0.234 ± 0.019
1.332MetAsp: 1.332 ± 0.054
1.188MetGlu: 1.188 ± 0.045
0.762MetPhe: 0.762 ± 0.041
1.778MetGly: 1.778 ± 0.058
0.539MetHis: 0.539 ± 0.032
1.198MetIle: 1.198 ± 0.051
1.105MetLys: 1.105 ± 0.046
2.394MetLeu: 2.394 ± 0.07
0.612MetMet: 0.612 ± 0.038
0.966MetAsn: 0.966 ± 0.035
1.261MetPro: 1.261 ± 0.051
0.909MetGln: 0.909 ± 0.046
1.418MetArg: 1.418 ± 0.053
2.045MetSer: 2.045 ± 0.062
1.724MetThr: 1.724 ± 0.063
1.877MetVal: 1.877 ± 0.053
0.246MetTrp: 0.246 ± 0.02
0.525MetTyr: 0.525 ± 0.03
0.0MetXaa: 0.0 ± 0.0
Asn
3.128AsnAla: 3.128 ± 0.083
0.313AsnCys: 0.313 ± 0.022
1.92AsnAsp: 1.92 ± 0.066
1.954AsnGlu: 1.954 ± 0.069
1.055AsnPhe: 1.055 ± 0.044
2.865AsnGly: 2.865 ± 0.106
0.792AsnHis: 0.792 ± 0.04
1.932AsnIle: 1.932 ± 0.067
1.768AsnLys: 1.768 ± 0.062
3.051AsnLeu: 3.051 ± 0.077
0.954AsnMet: 0.954 ± 0.042
1.402AsnAsn: 1.402 ± 0.067
2.322AsnPro: 2.322 ± 0.08
1.633AsnGln: 1.633 ± 0.066
1.992AsnArg: 1.992 ± 0.062
2.271AsnSer: 2.271 ± 0.075
2.269AsnThr: 2.269 ± 0.074
2.328AsnVal: 2.328 ± 0.07
0.564AsnTrp: 0.564 ± 0.037
0.992AsnTyr: 0.992 ± 0.054
0.0AsnXaa: 0.0 ± 0.0
Pro
4.277ProAla: 4.277 ± 0.123
0.291ProCys: 0.291 ± 0.022
3.194ProAsp: 3.194 ± 0.085
3.152ProGlu: 3.152 ± 0.078
1.515ProPhe: 1.515 ± 0.063
3.336ProGly: 3.336 ± 0.091
1.057ProHis: 1.057 ± 0.046
2.546ProIle: 2.546 ± 0.073
1.934ProLys: 1.934 ± 0.068
3.993ProLeu: 3.993 ± 0.099
0.972ProMet: 0.972 ± 0.049
1.596ProAsn: 1.596 ± 0.054
1.382ProPro: 1.382 ± 0.059
2.19ProGln: 2.19 ± 0.066
2.014ProArg: 2.014 ± 0.068
3.445ProSer: 3.445 ± 0.087
2.901ProThr: 2.901 ± 0.083
4.239ProVal: 4.239 ± 0.12
0.683ProTrp: 0.683 ± 0.043
1.372ProTyr: 1.372 ± 0.055
0.0ProXaa: 0.0 ± 0.0
Gln
5.508GlnAla: 5.508 ± 0.113
0.327GlnCys: 0.327 ± 0.027
2.295GlnAsp: 2.295 ± 0.072
3.021GlnGlu: 3.021 ± 0.085
1.257GlnPhe: 1.257 ± 0.06
3.33GlnGly: 3.33 ± 0.077
1.172GlnHis: 1.172 ± 0.052
2.479GlnIle: 2.479 ± 0.067
1.572GlnLys: 1.572 ± 0.06
4.597GlnLeu: 4.597 ± 0.113
1.224GlnMet: 1.224 ± 0.045
1.297GlnAsn: 1.297 ± 0.049
2.301GlnPro: 2.301 ± 0.084
2.869GlnGln: 2.869 ± 0.098
2.598GlnArg: 2.598 ± 0.072
3.136GlnSer: 3.136 ± 0.091
2.788GlnThr: 2.788 ± 0.08
4.172GlnVal: 4.172 ± 0.105
0.685GlnTrp: 0.685 ± 0.041
1.348GlnTyr: 1.348 ± 0.055
0.0GlnXaa: 0.0 ± 0.0
Arg
4.086ArgAla: 4.086 ± 0.098
0.416ArgCys: 0.416 ± 0.029
2.839ArgAsp: 2.839 ± 0.081
3.037ArgGlu: 3.037 ± 0.098
1.903ArgPhe: 1.903 ± 0.055
3.1ArgGly: 3.1 ± 0.086
1.257ArgHis: 1.257 ± 0.049
3.514ArgIle: 3.514 ± 0.09
2.782ArgLys: 2.782 ± 0.077
5.189ArgLeu: 5.189 ± 0.117
1.588ArgMet: 1.588 ± 0.06
1.895ArgAsn: 1.895 ± 0.071
2.182ArgPro: 2.182 ± 0.067
2.861ArgGln: 2.861 ± 0.093
3.372ArgArg: 3.372 ± 0.13
3.764ArgSer: 3.764 ± 0.084
3.102ArgThr: 3.102 ± 0.073
3.98ArgVal: 3.98 ± 0.098
0.828ArgTrp: 0.828 ± 0.042
1.756ArgTyr: 1.756 ± 0.055
0.0ArgXaa: 0.0 ± 0.0
Ser
6.324SerAla: 6.324 ± 0.116
0.665SerCys: 0.665 ± 0.039
4.269SerAsp: 4.269 ± 0.1
3.51SerGlu: 3.51 ± 0.083
2.602SerPhe: 2.602 ± 0.075
5.651SerGly: 5.651 ± 0.109
1.717SerHis: 1.717 ± 0.058
4.227SerIle: 4.227 ± 0.096
3.199SerLys: 3.199 ± 0.087
6.791SerLeu: 6.791 ± 0.137
1.885SerMet: 1.885 ± 0.05
2.804SerAsn: 2.804 ± 0.087
2.97SerPro: 2.97 ± 0.077
4.037SerGln: 4.037 ± 0.103
3.744SerArg: 3.744 ± 0.094
6.058SerSer: 6.058 ± 0.137
4.249SerThr: 4.249 ± 0.1
5.197SerVal: 5.197 ± 0.114
1.186SerTrp: 1.186 ± 0.051
2.055SerTyr: 2.055 ± 0.067
0.002SerXaa: 0.002 ± 0.002
Thr
5.478ThrAla: 5.478 ± 0.109
0.525ThrCys: 0.525 ± 0.029
3.423ThrAsp: 3.423 ± 0.081
2.522ThrGlu: 2.522 ± 0.064
2.043ThrPhe: 2.043 ± 0.067
4.777ThrGly: 4.777 ± 0.108
1.378ThrHis: 1.378 ± 0.05
3.582ThrIle: 3.582 ± 0.085
2.261ThrLys: 2.261 ± 0.081
5.542ThrLeu: 5.542 ± 0.122
1.299ThrMet: 1.299 ± 0.051
2.168ThrAsn: 2.168 ± 0.067
3.12ThrPro: 3.12 ± 0.099
2.705ThrGln: 2.705 ± 0.074
2.782ThrArg: 2.782 ± 0.073
4.158ThrSer: 4.158 ± 0.113
3.584ThrThr: 3.584 ± 0.107
4.954ThrVal: 4.954 ± 0.105
0.727ThrTrp: 0.727 ± 0.043
1.73ThrTyr: 1.73 ± 0.058
0.002ThrXaa: 0.002 ± 0.002
Val
7.28ValAla: 7.28 ± 0.142
0.96ValCys: 0.96 ± 0.049
4.962ValAsp: 4.962 ± 0.106
4.243ValGlu: 4.243 ± 0.106
2.758ValPhe: 2.758 ± 0.087
5.324ValGly: 5.324 ± 0.117
1.592ValHis: 1.592 ± 0.069
4.469ValIle: 4.469 ± 0.115
2.95ValLys: 2.95 ± 0.088
7.133ValLeu: 7.133 ± 0.121
1.748ValMet: 1.748 ± 0.057
2.665ValAsn: 2.665 ± 0.075
3.572ValPro: 3.572 ± 0.103
3.0ValGln: 3.0 ± 0.087
3.94ValArg: 3.94 ± 0.09
6.021ValSer: 6.021 ± 0.111
4.768ValThr: 4.768 ± 0.094
6.393ValVal: 6.393 ± 0.15
1.026ValTrp: 1.026 ± 0.049
1.899ValTyr: 1.899 ± 0.07
0.0ValXaa: 0.0 ± 0.0
Trp
1.162TrpAla: 1.162 ± 0.049
0.16TrpCys: 0.16 ± 0.018
0.699TrpAsp: 0.699 ± 0.042
0.626TrpGlu: 0.626 ± 0.038
0.554TrpPhe: 0.554 ± 0.037
1.089TrpGly: 1.089 ± 0.096
0.327TrpHis: 0.327 ± 0.025
0.847TrpIle: 0.847 ± 0.047
0.665TrpLys: 0.665 ± 0.037
1.487TrpLeu: 1.487 ± 0.064
0.436TrpMet: 0.436 ± 0.03
0.548TrpAsn: 0.548 ± 0.036
0.535TrpPro: 0.535 ± 0.036
0.756TrpGln: 0.756 ± 0.036
0.752TrpArg: 0.752 ± 0.04
1.053TrpSer: 1.053 ± 0.05
0.786TrpThr: 0.786 ± 0.047
0.879TrpVal: 0.879 ± 0.041
0.321TrpTrp: 0.321 ± 0.028
0.424TrpTyr: 0.424 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.604TyrAla: 2.604 ± 0.071
0.309TyrCys: 0.309 ± 0.023
1.57TyrAsp: 1.57 ± 0.063
1.635TyrGlu: 1.635 ± 0.072
0.984TyrPhe: 0.984 ± 0.043
2.261TyrGly: 2.261 ± 0.073
0.626TyrHis: 0.626 ± 0.039
1.427TyrIle: 1.427 ± 0.053
1.083TyrLys: 1.083 ± 0.057
2.396TyrLeu: 2.396 ± 0.064
0.622TyrMet: 0.622 ± 0.033
0.986TyrAsn: 0.986 ± 0.054
1.307TyrPro: 1.307 ± 0.047
1.376TyrGln: 1.376 ± 0.06
1.574TyrArg: 1.574 ± 0.058
1.938TyrSer: 1.938 ± 0.06
1.564TyrThr: 1.564 ± 0.063
1.93TyrVal: 1.93 ± 0.066
0.495TyrTrp: 0.495 ± 0.037
0.887TyrTyr: 0.887 ± 0.046
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.002XaaGlu: 0.002 ± 0.002
0.0XaaPhe: 0.0 ± 0.0
0.002XaaGly: 0.002 ± 0.002
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.002XaaThr: 0.002 ± 0.002
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.004XaaXaa: 0.004 ± 0.004
Statistics based on 1438 proteins (505083 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski