Amino acid dipepetide frequency for Boudabousia tangfeifanii

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.984AlaAla: 12.984 ± 0.229
0.94AlaCys: 0.94 ± 0.038
5.863AlaAsp: 5.863 ± 0.111
7.223AlaGlu: 7.223 ± 0.158
3.629AlaPhe: 3.629 ± 0.087
8.853AlaGly: 8.853 ± 0.158
1.865AlaHis: 1.865 ± 0.061
5.4AlaIle: 5.4 ± 0.117
6.371AlaLys: 6.371 ± 0.154
11.341AlaLeu: 11.341 ± 0.191
2.658AlaMet: 2.658 ± 0.065
3.892AlaAsn: 3.892 ± 0.073
5.373AlaPro: 5.373 ± 0.145
5.14AlaGln: 5.14 ± 0.118
5.545AlaArg: 5.545 ± 0.115
5.901AlaSer: 5.901 ± 0.115
6.28AlaThr: 6.28 ± 0.115
6.978AlaVal: 6.978 ± 0.129
1.547AlaTrp: 1.547 ± 0.056
2.216AlaTyr: 2.216 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
0.799CysAla: 0.799 ± 0.042
0.066CysCys: 0.066 ± 0.012
0.468CysAsp: 0.468 ± 0.026
0.551CysGlu: 0.551 ± 0.032
0.309CysPhe: 0.309 ± 0.02
0.772CysGly: 0.772 ± 0.038
0.186CysHis: 0.186 ± 0.017
0.241CysIle: 0.241 ± 0.021
0.232CysLys: 0.232 ± 0.018
0.689CysLeu: 0.689 ± 0.035
0.133CysMet: 0.133 ± 0.014
0.243CysAsn: 0.243 ± 0.022
0.475CysPro: 0.475 ± 0.032
0.387CysGln: 0.387 ± 0.03
0.371CysArg: 0.371 ± 0.023
0.487CysSer: 0.487 ± 0.03
0.428CysThr: 0.428 ± 0.03
0.536CysVal: 0.536 ± 0.029
0.099CysTrp: 0.099 ± 0.013
0.188CysTyr: 0.188 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
4.983AspAla: 4.983 ± 0.091
0.387AspCys: 0.387 ± 0.027
2.828AspAsp: 2.828 ± 0.068
4.094AspGlu: 4.094 ± 0.093
2.274AspPhe: 2.274 ± 0.064
4.444AspGly: 4.444 ± 0.122
1.11AspHis: 1.11 ± 0.046
2.324AspIle: 2.324 ± 0.06
2.144AspLys: 2.144 ± 0.067
6.38AspLeu: 6.38 ± 0.112
0.979AspMet: 0.979 ± 0.036
1.673AspAsn: 1.673 ± 0.061
3.552AspPro: 3.552 ± 0.08
2.368AspGln: 2.368 ± 0.07
2.88AspArg: 2.88 ± 0.076
2.814AspSer: 2.814 ± 0.084
2.496AspThr: 2.496 ± 0.066
3.458AspVal: 3.458 ± 0.076
0.915AspTrp: 0.915 ± 0.041
1.638AspTyr: 1.638 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
7.144GluAla: 7.144 ± 0.159
0.453GluCys: 0.453 ± 0.033
3.29GluAsp: 3.29 ± 0.077
3.987GluGlu: 3.987 ± 0.101
1.873GluPhe: 1.873 ± 0.051
4.178GluGly: 4.178 ± 0.105
1.187GluHis: 1.187 ± 0.042
3.686GluIle: 3.686 ± 0.083
3.553GluLys: 3.553 ± 0.099
6.847GluLeu: 6.847 ± 0.13
1.312GluMet: 1.312 ± 0.053
2.75GluAsn: 2.75 ± 0.08
3.387GluPro: 3.387 ± 0.19
2.7GluGln: 2.7 ± 0.074
3.378GluArg: 3.378 ± 0.099
3.049GluSer: 3.049 ± 0.075
3.542GluThr: 3.542 ± 0.07
5.235GluVal: 5.235 ± 0.099
0.736GluTrp: 0.736 ± 0.036
1.322GluTyr: 1.322 ± 0.051
0.0GluXaa: 0.0 ± 0.0
Phe
3.843PheAla: 3.843 ± 0.09
0.293PheCys: 0.293 ± 0.022
2.092PheAsp: 2.092 ± 0.058
2.119PheGlu: 2.119 ± 0.059
1.439PhePhe: 1.439 ± 0.057
3.005PheGly: 3.005 ± 0.078
0.604PheHis: 0.604 ± 0.031
1.577PheIle: 1.577 ± 0.065
1.416PheLys: 1.416 ± 0.064
3.461PheLeu: 3.461 ± 0.094
0.702PheMet: 0.702 ± 0.034
1.303PheAsn: 1.303 ± 0.043
1.654PhePro: 1.654 ± 0.064
1.04PheGln: 1.04 ± 0.038
1.638PheArg: 1.638 ± 0.057
2.576PheSer: 2.576 ± 0.079
2.271PheThr: 2.271 ± 0.065
2.695PheVal: 2.695 ± 0.068
0.564PheTrp: 0.564 ± 0.034
0.94PheTyr: 0.94 ± 0.044
0.0PheXaa: 0.0 ± 0.0
Gly
7.641GlyAla: 7.641 ± 0.146
0.576GlyCys: 0.576 ± 0.033
3.892GlyAsp: 3.892 ± 0.097
4.698GlyGlu: 4.698 ± 0.105
2.885GlyPhe: 2.885 ± 0.07
5.873GlyGly: 5.873 ± 0.112
1.5GlyHis: 1.5 ± 0.058
4.065GlyIle: 4.065 ± 0.08
4.341GlyLys: 4.341 ± 0.095
7.226GlyLeu: 7.226 ± 0.116
1.828GlyMet: 1.828 ± 0.054
2.626GlyAsn: 2.626 ± 0.086
2.933GlyPro: 2.933 ± 0.083
3.387GlyGln: 3.387 ± 0.083
4.349GlyArg: 4.349 ± 0.093
4.509GlySer: 4.509 ± 0.093
4.648GlyThr: 4.648 ± 0.108
5.824GlyVal: 5.824 ± 0.1
1.464GlyTrp: 1.464 ± 0.054
2.21GlyTyr: 2.21 ± 0.075
0.0GlyXaa: 0.0 ± 0.0
His
1.685HisAla: 1.685 ± 0.05
0.143HisCys: 0.143 ± 0.013
1.076HisAsp: 1.076 ± 0.042
1.181HisGlu: 1.181 ± 0.052
0.695HisPhe: 0.695 ± 0.031
1.447HisGly: 1.447 ± 0.044
0.478HisHis: 0.478 ± 0.032
0.697HisIle: 0.697 ± 0.032
0.55HisLys: 0.55 ± 0.031
2.213HisLeu: 2.213 ± 0.066
0.351HisMet: 0.351 ± 0.023
0.589HisAsn: 0.589 ± 0.033
1.356HisPro: 1.356 ± 0.041
0.816HisGln: 0.816 ± 0.038
1.077HisArg: 1.077 ± 0.045
0.957HisSer: 0.957 ± 0.043
0.987HisThr: 0.987 ± 0.041
1.212HisVal: 1.212 ± 0.047
0.305HisTrp: 0.305 ± 0.022
0.518HisTyr: 0.518 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
5.539IleAla: 5.539 ± 0.101
0.484IleCys: 0.484 ± 0.03
3.073IleAsp: 3.073 ± 0.069
2.955IleGlu: 2.955 ± 0.079
1.831IlePhe: 1.831 ± 0.071
3.752IleGly: 3.752 ± 0.094
0.811IleHis: 0.811 ± 0.038
2.42IleIle: 2.42 ± 0.065
1.953IleLys: 1.953 ± 0.065
4.208IleLeu: 4.208 ± 0.114
1.004IleMet: 1.004 ± 0.044
1.741IleAsn: 1.741 ± 0.054
2.643IlePro: 2.643 ± 0.078
1.33IleGln: 1.33 ± 0.044
2.402IleArg: 2.402 ± 0.066
3.492IleSer: 3.492 ± 0.08
3.162IleThr: 3.162 ± 0.08
3.848IleVal: 3.848 ± 0.091
0.703IleTrp: 0.703 ± 0.032
1.198IleTyr: 1.198 ± 0.049
0.0IleXaa: 0.0 ± 0.0
Lys
5.027LysAla: 5.027 ± 0.126
0.258LysCys: 0.258 ± 0.021
2.598LysAsp: 2.598 ± 0.079
2.911LysGlu: 2.911 ± 0.086
1.384LysPhe: 1.384 ± 0.054
2.679LysGly: 2.679 ± 0.072
0.858LysHis: 0.858 ± 0.039
2.293LysIle: 2.293 ± 0.06
2.153LysLys: 2.153 ± 0.072
4.498LysLeu: 4.498 ± 0.108
1.106LysMet: 1.106 ± 0.041
1.809LysAsn: 1.809 ± 0.07
3.624LysPro: 3.624 ± 0.264
1.995LysGln: 1.995 ± 0.062
2.506LysArg: 2.506 ± 0.064
2.543LysSer: 2.543 ± 0.068
2.952LysThr: 2.952 ± 0.077
3.965LysVal: 3.965 ± 0.114
0.662LysTrp: 0.662 ± 0.036
1.292LysTyr: 1.292 ± 0.045
0.0LysXaa: 0.0 ± 0.0
Leu
12.986LeuAla: 12.986 ± 0.221
0.752LeuCys: 0.752 ± 0.035
5.556LeuAsp: 5.556 ± 0.116
5.918LeuGlu: 5.918 ± 0.117
3.17LeuPhe: 3.17 ± 0.095
8.283LeuGly: 8.283 ± 0.145
1.777LeuHis: 1.777 ± 0.05
4.614LeuIle: 4.614 ± 0.111
3.898LeuLys: 3.898 ± 0.075
9.816LeuLeu: 9.816 ± 0.196
1.837LeuMet: 1.837 ± 0.058
3.131LeuAsn: 3.131 ± 0.072
5.989LeuPro: 5.989 ± 0.104
3.14LeuGln: 3.14 ± 0.078
5.129LeuArg: 5.129 ± 0.105
6.698LeuSer: 6.698 ± 0.117
6.469LeuThr: 6.469 ± 0.099
8.251LeuVal: 8.251 ± 0.145
1.259LeuTrp: 1.259 ± 0.051
2.028LeuTyr: 2.028 ± 0.059
0.0LeuXaa: 0.0 ± 0.0
Met
2.55MetAla: 2.55 ± 0.078
0.157MetCys: 0.157 ± 0.017
1.013MetAsp: 1.013 ± 0.038
1.048MetGlu: 1.048 ± 0.038
0.636MetPhe: 0.636 ± 0.035
1.605MetGly: 1.605 ± 0.054
0.348MetHis: 0.348 ± 0.027
1.073MetIle: 1.073 ± 0.043
0.926MetLys: 0.926 ± 0.039
2.103MetLeu: 2.103 ± 0.067
0.476MetMet: 0.476 ± 0.027
0.739MetAsn: 0.739 ± 0.029
1.276MetPro: 1.276 ± 0.05
0.634MetGln: 0.634 ± 0.032
1.273MetArg: 1.273 ± 0.049
1.56MetSer: 1.56 ± 0.056
1.323MetThr: 1.323 ± 0.041
1.629MetVal: 1.629 ± 0.059
0.247MetTrp: 0.247 ± 0.017
0.381MetTyr: 0.381 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
3.558AsnAla: 3.558 ± 0.082
0.296AsnCys: 0.296 ± 0.025
1.895AsnAsp: 1.895 ± 0.069
2.216AsnGlu: 2.216 ± 0.061
1.192AsnPhe: 1.192 ± 0.05
2.944AsnGly: 2.944 ± 0.101
0.722AsnHis: 0.722 ± 0.035
1.458AsnIle: 1.458 ± 0.052
1.278AsnLys: 1.278 ± 0.057
3.713AsnLeu: 3.713 ± 0.079
0.719AsnMet: 0.719 ± 0.03
1.295AsnAsn: 1.295 ± 0.065
2.449AsnPro: 2.449 ± 0.072
1.662AsnGln: 1.662 ± 0.052
1.834AsnArg: 1.834 ± 0.053
1.829AsnSer: 1.829 ± 0.056
1.881AsnThr: 1.881 ± 0.059
2.208AsnVal: 2.208 ± 0.069
0.634AsnTrp: 0.634 ± 0.032
0.968AsnTyr: 0.968 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
6.477ProAla: 6.477 ± 0.181
0.266ProCys: 0.266 ± 0.02
2.836ProAsp: 2.836 ± 0.073
4.595ProGlu: 4.595 ± 0.164
1.763ProPhe: 1.763 ± 0.053
3.89ProGly: 3.89 ± 0.085
0.915ProHis: 0.915 ± 0.034
2.321ProIle: 2.321 ± 0.063
3.416ProLys: 3.416 ± 0.168
4.587ProLeu: 4.587 ± 0.088
0.982ProMet: 0.982 ± 0.039
1.97ProAsn: 1.97 ± 0.064
2.073ProPro: 2.073 ± 0.084
2.43ProGln: 2.43 ± 0.064
2.435ProArg: 2.435 ± 0.075
3.132ProSer: 3.132 ± 0.097
4.148ProThr: 4.148 ± 0.201
4.712ProVal: 4.712 ± 0.157
0.792ProTrp: 0.792 ± 0.035
1.2ProTyr: 1.2 ± 0.042
0.0ProXaa: 0.0 ± 0.0
Gln
5.003GlnAla: 5.003 ± 0.112
0.282GlnCys: 0.282 ± 0.02
1.597GlnAsp: 1.597 ± 0.051
2.178GlnGlu: 2.178 ± 0.067
1.142GlnPhe: 1.142 ± 0.043
2.56GlnGly: 2.56 ± 0.068
0.667GlnHis: 0.667 ± 0.036
2.672GlnIle: 2.672 ± 0.069
1.807GlnLys: 1.807 ± 0.056
4.35GlnLeu: 4.35 ± 0.113
0.963GlnMet: 0.963 ± 0.039
1.469GlnAsn: 1.469 ± 0.047
2.058GlnPro: 2.058 ± 0.082
1.705GlnGln: 1.705 ± 0.057
2.155GlnArg: 2.155 ± 0.068
2.059GlnSer: 2.059 ± 0.057
2.313GlnThr: 2.313 ± 0.061
3.868GlnVal: 3.868 ± 0.084
0.594GlnTrp: 0.594 ± 0.034
0.874GlnTyr: 0.874 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
5.054ArgAla: 5.054 ± 0.105
0.371ArgCys: 0.371 ± 0.026
2.871ArgAsp: 2.871 ± 0.078
3.522ArgGlu: 3.522 ± 0.107
2.086ArgPhe: 2.086 ± 0.055
3.544ArgGly: 3.544 ± 0.092
1.051ArgHis: 1.051 ± 0.046
2.764ArgIle: 2.764 ± 0.067
2.368ArgLys: 2.368 ± 0.064
5.519ArgLeu: 5.519 ± 0.104
1.264ArgMet: 1.264 ± 0.045
1.724ArgAsn: 1.724 ± 0.049
2.82ArgPro: 2.82 ± 0.084
2.452ArgGln: 2.452 ± 0.07
3.633ArgArg: 3.633 ± 0.11
2.77ArgSer: 2.77 ± 0.078
3.007ArgThr: 3.007 ± 0.068
3.953ArgVal: 3.953 ± 0.08
0.868ArgTrp: 0.868 ± 0.042
1.511ArgTyr: 1.511 ± 0.056
0.0ArgXaa: 0.0 ± 0.0
Ser
6.476SerAla: 6.476 ± 0.114
0.362SerCys: 0.362 ± 0.026
3.094SerAsp: 3.094 ± 0.077
3.683SerGlu: 3.683 ± 0.102
2.293SerPhe: 2.293 ± 0.064
5.08SerGly: 5.08 ± 0.094
1.112SerHis: 1.112 ± 0.04
2.39SerIle: 2.39 ± 0.073
2.587SerLys: 2.587 ± 0.064
6.039SerLeu: 6.039 ± 0.117
1.162SerMet: 1.162 ± 0.047
1.828SerAsn: 1.828 ± 0.05
2.961SerPro: 2.961 ± 0.067
2.623SerGln: 2.623 ± 0.08
3.239SerArg: 3.239 ± 0.077
3.699SerSer: 3.699 ± 0.095
3.599SerThr: 3.599 ± 0.074
4.137SerVal: 4.137 ± 0.095
0.946SerTrp: 0.946 ± 0.041
1.489SerTyr: 1.489 ± 0.053
0.0SerXaa: 0.0 ± 0.0
Thr
5.893ThrAla: 5.893 ± 0.101
0.536ThrCys: 0.536 ± 0.036
3.11ThrAsp: 3.11 ± 0.084
3.58ThrGlu: 3.58 ± 0.08
2.304ThrPhe: 2.304 ± 0.073
5.016ThrGly: 5.016 ± 0.094
1.102ThrHis: 1.102 ± 0.039
3.226ThrIle: 3.226 ± 0.082
2.852ThrLys: 2.852 ± 0.086
5.857ThrLeu: 5.857 ± 0.101
1.143ThrMet: 1.143 ± 0.042
2.111ThrAsn: 2.111 ± 0.068
4.056ThrPro: 4.056 ± 0.161
2.044ThrGln: 2.044 ± 0.052
2.664ThrArg: 2.664 ± 0.067
3.45ThrSer: 3.45 ± 0.075
3.578ThrThr: 3.578 ± 0.08
4.701ThrVal: 4.701 ± 0.125
1.029ThrTrp: 1.029 ± 0.045
1.611ThrTyr: 1.611 ± 0.049
0.0ThrXaa: 0.0 ± 0.0
Val
8.533ValAla: 8.533 ± 0.135
0.655ValCys: 0.655 ± 0.035
4.396ValAsp: 4.396 ± 0.104
4.809ValGlu: 4.809 ± 0.092
2.697ValPhe: 2.697 ± 0.075
5.584ValGly: 5.584 ± 0.094
1.226ValHis: 1.226 ± 0.045
3.758ValIle: 3.758 ± 0.088
3.959ValLys: 3.959 ± 0.15
7.364ValLeu: 7.364 ± 0.132
1.585ValMet: 1.585 ± 0.051
2.607ValAsn: 2.607 ± 0.072
4.274ValPro: 4.274 ± 0.098
2.128ValGln: 2.128 ± 0.047
3.867ValArg: 3.867 ± 0.083
5.077ValSer: 5.077 ± 0.087
4.717ValThr: 4.717 ± 0.11
6.577ValVal: 6.577 ± 0.128
1.013ValTrp: 1.013 ± 0.046
1.583ValTyr: 1.583 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
1.519TrpAla: 1.519 ± 0.057
0.133TrpCys: 0.133 ± 0.015
0.819TrpAsp: 0.819 ± 0.038
0.971TrpGlu: 0.971 ± 0.042
0.578TrpPhe: 0.578 ± 0.032
0.96TrpGly: 0.96 ± 0.043
0.332TrpHis: 0.332 ± 0.019
0.694TrpIle: 0.694 ± 0.033
0.623TrpLys: 0.623 ± 0.035
1.749TrpLeu: 1.749 ± 0.057
0.421TrpMet: 0.421 ± 0.027
0.553TrpAsn: 0.553 ± 0.03
0.644TrpPro: 0.644 ± 0.034
0.913TrpGln: 0.913 ± 0.042
0.99TrpArg: 0.99 ± 0.04
0.797TrpSer: 0.797 ± 0.037
0.745TrpThr: 0.745 ± 0.039
0.957TrpVal: 0.957 ± 0.042
0.368TrpTrp: 0.368 ± 0.027
0.354TrpTyr: 0.354 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.304TyrAla: 2.304 ± 0.063
0.251TyrCys: 0.251 ± 0.02
1.373TyrAsp: 1.373 ± 0.049
1.372TyrGlu: 1.372 ± 0.05
1.054TyrPhe: 1.054 ± 0.047
2.003TyrGly: 2.003 ± 0.064
0.479TyrHis: 0.479 ± 0.03
0.877TyrIle: 0.877 ± 0.041
0.689TyrLys: 0.689 ± 0.034
2.741TyrLeu: 2.741 ± 0.071
0.371TyrMet: 0.371 ± 0.028
0.698TyrAsn: 0.698 ± 0.035
1.344TyrPro: 1.344 ± 0.042
1.424TyrGln: 1.424 ± 0.052
1.787TyrArg: 1.787 ± 0.059
1.398TyrSer: 1.398 ± 0.054
1.317TyrThr: 1.317 ± 0.058
1.665TyrVal: 1.665 ± 0.048
0.374TyrTrp: 0.374 ± 0.026
0.727TyrTyr: 0.727 ± 0.038
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1712 proteins (638556 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski