Amino acid dipepetide frequency for Schaalia turicensis ACS-279-V-Col4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.779AlaAla: 10.779 ± 0.219
1.016AlaCys: 1.016 ± 0.044
7.089AlaAsp: 7.089 ± 0.112
6.291AlaGlu: 6.291 ± 0.141
3.415AlaPhe: 3.415 ± 0.081
9.592AlaGly: 9.592 ± 0.162
2.57AlaHis: 2.57 ± 0.075
5.787AlaIle: 5.787 ± 0.115
4.005AlaLys: 4.005 ± 0.11
12.474AlaLeu: 12.474 ± 0.188
2.836AlaMet: 2.836 ± 0.07
2.962AlaAsn: 2.962 ± 0.083
4.974AlaPro: 4.974 ± 0.115
4.458AlaGln: 4.458 ± 0.108
7.211AlaArg: 7.211 ± 0.153
7.691AlaSer: 7.691 ± 0.15
7.258AlaThr: 7.258 ± 0.128
8.101AlaVal: 8.101 ± 0.141
1.655AlaTrp: 1.655 ± 0.061
2.204AlaTyr: 2.204 ± 0.07
0.0AlaXaa: 0.0 ± 0.0
Cys
0.965CysAla: 0.965 ± 0.042
0.087CysCys: 0.087 ± 0.013
0.488CysAsp: 0.488 ± 0.029
0.533CysGlu: 0.533 ± 0.032
0.258CysPhe: 0.258 ± 0.024
0.826CysGly: 0.826 ± 0.042
0.209CysHis: 0.209 ± 0.018
0.282CysIle: 0.282 ± 0.022
0.169CysLys: 0.169 ± 0.02
0.681CysLeu: 0.681 ± 0.038
0.16CysMet: 0.16 ± 0.015
0.16CysAsn: 0.16 ± 0.017
0.399CysPro: 0.399 ± 0.03
0.23CysGln: 0.23 ± 0.019
0.369CysArg: 0.369 ± 0.027
0.535CysSer: 0.535 ± 0.031
0.455CysThr: 0.455 ± 0.03
0.683CysVal: 0.683 ± 0.039
0.082CysTrp: 0.082 ± 0.013
0.162CysTyr: 0.162 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
7.141AspAla: 7.141 ± 0.118
0.415AspCys: 0.415 ± 0.03
3.859AspAsp: 3.859 ± 0.1
4.794AspGlu: 4.794 ± 0.093
1.944AspPhe: 1.944 ± 0.06
5.221AspGly: 5.221 ± 0.118
1.273AspHis: 1.273 ± 0.045
3.014AspIle: 3.014 ± 0.075
1.704AspLys: 1.704 ± 0.064
6.005AspLeu: 6.005 ± 0.112
1.28AspMet: 1.28 ± 0.045
1.441AspAsn: 1.441 ± 0.058
3.695AspPro: 3.695 ± 0.083
2.301AspGln: 2.301 ± 0.068
3.5AspArg: 3.5 ± 0.089
3.646AspSer: 3.646 ± 0.082
3.256AspThr: 3.256 ± 0.089
5.275AspVal: 5.275 ± 0.116
0.88AspTrp: 0.88 ± 0.042
1.53AspTyr: 1.53 ± 0.057
0.0AspXaa: 0.0 ± 0.0
Glu
7.514GluAla: 7.514 ± 0.162
0.484GluCys: 0.484 ± 0.033
3.496GluAsp: 3.496 ± 0.089
4.371GluGlu: 4.371 ± 0.117
1.753GluPhe: 1.753 ± 0.053
4.439GluGly: 4.439 ± 0.093
1.453GluHis: 1.453 ± 0.056
3.425GluIle: 3.425 ± 0.102
2.401GluLys: 2.401 ± 0.081
6.023GluLeu: 6.023 ± 0.128
1.446GluMet: 1.446 ± 0.054
1.911GluAsn: 1.911 ± 0.07
2.55GluPro: 2.55 ± 0.082
2.246GluGln: 2.246 ± 0.067
4.287GluArg: 4.287 ± 0.104
3.458GluSer: 3.458 ± 0.079
3.322GluThr: 3.322 ± 0.084
4.773GluVal: 4.773 ± 0.109
0.791GluTrp: 0.791 ± 0.036
1.456GluTyr: 1.456 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
3.458PheAla: 3.458 ± 0.096
0.233PheCys: 0.233 ± 0.019
2.392PheAsp: 2.392 ± 0.065
1.918PheGlu: 1.918 ± 0.057
1.155PhePhe: 1.155 ± 0.057
2.932PheGly: 2.932 ± 0.076
0.625PheHis: 0.625 ± 0.032
1.676PheIle: 1.676 ± 0.062
0.936PheLys: 0.936 ± 0.048
2.739PheLeu: 2.739 ± 0.081
0.636PheMet: 0.636 ± 0.034
0.963PheAsn: 0.963 ± 0.045
1.402PhePro: 1.402 ± 0.054
0.791PheGln: 0.791 ± 0.044
1.375PheArg: 1.375 ± 0.046
2.143PheSer: 2.143 ± 0.066
2.219PheThr: 2.219 ± 0.061
2.683PheVal: 2.683 ± 0.073
0.427PheTrp: 0.427 ± 0.03
0.679PheTyr: 0.679 ± 0.041
0.0PheXaa: 0.0 ± 0.0
Gly
8.73GlyAla: 8.73 ± 0.143
0.613GlyCys: 0.613 ± 0.032
4.531GlyAsp: 4.531 ± 0.094
5.002GlyGlu: 5.002 ± 0.098
3.005GlyPhe: 3.005 ± 0.079
6.434GlyGly: 6.434 ± 0.126
1.89GlyHis: 1.89 ± 0.066
5.002GlyIle: 5.002 ± 0.107
3.329GlyLys: 3.329 ± 0.085
7.653GlyLeu: 7.653 ± 0.151
2.078GlyMet: 2.078 ± 0.06
2.15GlyAsn: 2.15 ± 0.074
2.922GlyPro: 2.922 ± 0.071
2.824GlyGln: 2.824 ± 0.076
4.967GlyArg: 4.967 ± 0.108
5.108GlySer: 5.108 ± 0.101
5.286GlyThr: 5.286 ± 0.111
7.228GlyVal: 7.228 ± 0.134
1.376GlyTrp: 1.376 ± 0.072
2.017GlyTyr: 2.017 ± 0.063
0.0GlyXaa: 0.0 ± 0.0
His
2.232HisAla: 2.232 ± 0.062
0.22HisCys: 0.22 ± 0.022
1.253HisAsp: 1.253 ± 0.048
1.221HisGlu: 1.221 ± 0.047
0.65HisPhe: 0.65 ± 0.037
1.753HisGly: 1.753 ± 0.058
0.605HisHis: 0.605 ± 0.032
1.077HisIle: 1.077 ± 0.043
0.484HisLys: 0.484 ± 0.027
2.094HisLeu: 2.094 ± 0.061
0.542HisMet: 0.542 ± 0.029
0.601HisAsn: 0.601 ± 0.032
1.334HisPro: 1.334 ± 0.048
0.672HisGln: 0.672 ± 0.039
1.477HisArg: 1.477 ± 0.045
1.277HisSer: 1.277 ± 0.056
1.263HisThr: 1.263 ± 0.052
1.834HisVal: 1.834 ± 0.06
0.315HisTrp: 0.315 ± 0.024
0.476HisTyr: 0.476 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
6.845IleAla: 6.845 ± 0.125
0.436IleCys: 0.436 ± 0.026
4.183IleAsp: 4.183 ± 0.096
3.709IleGlu: 3.709 ± 0.086
1.491IlePhe: 1.491 ± 0.056
4.824IleGly: 4.824 ± 0.114
1.075IleHis: 1.075 ± 0.042
2.871IleIle: 2.871 ± 0.08
1.364IleLys: 1.364 ± 0.054
4.451IleLeu: 4.451 ± 0.1
1.101IleMet: 1.101 ± 0.047
1.474IleAsn: 1.474 ± 0.052
2.859IlePro: 2.859 ± 0.068
1.408IleGln: 1.408 ± 0.049
2.977IleArg: 2.977 ± 0.075
3.214IleSer: 3.214 ± 0.088
3.246IleThr: 3.246 ± 0.085
5.002IleVal: 5.002 ± 0.1
0.592IleTrp: 0.592 ± 0.039
0.995IleTyr: 0.995 ± 0.043
0.0IleXaa: 0.0 ± 0.0
Lys
4.012LysAla: 4.012 ± 0.11
0.202LysCys: 0.202 ± 0.018
2.099LysAsp: 2.099 ± 0.071
2.068LysGlu: 2.068 ± 0.075
0.772LysPhe: 0.772 ± 0.039
2.47LysGly: 2.47 ± 0.079
0.638LysHis: 0.638 ± 0.034
1.751LysIle: 1.751 ± 0.054
1.819LysLys: 1.819 ± 0.078
2.712LysLeu: 2.712 ± 0.087
0.834LysMet: 0.834 ± 0.035
1.199LysAsn: 1.199 ± 0.053
1.695LysPro: 1.695 ± 0.06
1.117LysGln: 1.117 ± 0.066
2.303LysArg: 2.303 ± 0.068
1.854LysSer: 1.854 ± 0.061
2.294LysThr: 2.294 ± 0.08
2.721LysVal: 2.721 ± 0.073
0.47LysTrp: 0.47 ± 0.029
0.744LysTyr: 0.744 ± 0.04
0.0LysXaa: 0.0 ± 0.0
Leu
11.78LeuAla: 11.78 ± 0.166
0.584LeuCys: 0.584 ± 0.033
6.259LeuAsp: 6.259 ± 0.115
5.516LeuGlu: 5.516 ± 0.114
2.739LeuPhe: 2.739 ± 0.093
7.991LeuGly: 7.991 ± 0.136
1.88LeuHis: 1.88 ± 0.066
5.134LeuIle: 5.134 ± 0.117
3.091LeuLys: 3.091 ± 0.083
8.542LeuLeu: 8.542 ± 0.181
2.068LeuMet: 2.068 ± 0.059
2.74LeuAsn: 2.74 ± 0.069
4.693LeuPro: 4.693 ± 0.087
2.211LeuGln: 2.211 ± 0.064
5.815LeuArg: 5.815 ± 0.121
6.291LeuSer: 6.291 ± 0.103
6.047LeuThr: 6.047 ± 0.092
8.017LeuVal: 8.017 ± 0.121
1.024LeuTrp: 1.024 ± 0.044
1.714LeuTyr: 1.714 ± 0.06
0.0LeuXaa: 0.0 ± 0.0
Met
2.488MetAla: 2.488 ± 0.067
0.2MetCys: 0.2 ± 0.019
1.253MetAsp: 1.253 ± 0.047
1.115MetGlu: 1.115 ± 0.041
0.667MetPhe: 0.667 ± 0.035
1.786MetGly: 1.786 ± 0.057
0.429MetHis: 0.429 ± 0.027
1.167MetIle: 1.167 ± 0.049
0.892MetLys: 0.892 ± 0.042
1.981MetLeu: 1.981 ± 0.069
0.57MetMet: 0.57 ± 0.031
0.852MetAsn: 0.852 ± 0.039
1.308MetPro: 1.308 ± 0.048
0.591MetGln: 0.591 ± 0.027
1.702MetArg: 1.702 ± 0.055
1.929MetSer: 1.929 ± 0.059
1.949MetThr: 1.949 ± 0.057
1.746MetVal: 1.746 ± 0.059
0.305MetTrp: 0.305 ± 0.026
0.387MetTyr: 0.387 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.235AsnAla: 3.235 ± 0.079
0.193AsnCys: 0.193 ± 0.017
1.706AsnAsp: 1.706 ± 0.055
1.699AsnGlu: 1.699 ± 0.058
0.796AsnPhe: 0.796 ± 0.041
2.549AsnGly: 2.549 ± 0.097
0.538AsnHis: 0.538 ± 0.033
1.338AsnIle: 1.338 ± 0.053
0.824AsnLys: 0.824 ± 0.052
2.613AsnLeu: 2.613 ± 0.069
0.61AsnMet: 0.61 ± 0.035
0.747AsnAsn: 0.747 ± 0.046
2.038AsnPro: 2.038 ± 0.061
0.882AsnGln: 0.882 ± 0.045
1.477AsnArg: 1.477 ± 0.054
1.561AsnSer: 1.561 ± 0.053
1.615AsnThr: 1.615 ± 0.058
2.329AsnVal: 2.329 ± 0.068
0.486AsnTrp: 0.486 ± 0.031
0.695AsnTyr: 0.695 ± 0.038
0.0AsnXaa: 0.0 ± 0.0
Pro
5.42ProAla: 5.42 ± 0.119
0.31ProCys: 0.31 ± 0.026
3.254ProAsp: 3.254 ± 0.076
3.523ProGlu: 3.523 ± 0.086
1.49ProPhe: 1.49 ± 0.05
3.85ProGly: 3.85 ± 0.088
1.11ProHis: 1.11 ± 0.044
2.453ProIle: 2.453 ± 0.068
1.599ProLys: 1.599 ± 0.06
4.199ProLeu: 4.199 ± 0.091
0.955ProMet: 0.955 ± 0.036
1.436ProAsn: 1.436 ± 0.054
1.472ProPro: 1.472 ± 0.059
1.704ProGln: 1.704 ± 0.065
2.519ProArg: 2.519 ± 0.07
3.383ProSer: 3.383 ± 0.085
3.296ProThr: 3.296 ± 0.077
4.078ProVal: 4.078 ± 0.096
0.768ProTrp: 0.768 ± 0.038
1.153ProTyr: 1.153 ± 0.052
0.0ProXaa: 0.0 ± 0.0
Gln
4.293GlnAla: 4.293 ± 0.107
0.223GlnCys: 0.223 ± 0.019
1.519GlnAsp: 1.519 ± 0.06
1.697GlnGlu: 1.697 ± 0.063
1.045GlnPhe: 1.045 ± 0.042
2.361GlnGly: 2.361 ± 0.064
0.557GlnHis: 0.557 ± 0.033
2.113GlnIle: 2.113 ± 0.066
1.111GlnLys: 1.111 ± 0.059
3.118GlnLeu: 3.118 ± 0.086
1.009GlnMet: 1.009 ± 0.039
0.8GlnAsn: 0.8 ± 0.041
1.455GlnPro: 1.455 ± 0.053
1.218GlnGln: 1.218 ± 0.054
2.24GlnArg: 2.24 ± 0.066
1.981GlnSer: 1.981 ± 0.077
1.986GlnThr: 1.986 ± 0.068
2.854GlnVal: 2.854 ± 0.075
0.618GlnTrp: 0.618 ± 0.034
0.679GlnTyr: 0.679 ± 0.036
0.0GlnXaa: 0.0 ± 0.0
Arg
6.516ArgAla: 6.516 ± 0.12
0.416ArgCys: 0.416 ± 0.026
3.437ArgAsp: 3.437 ± 0.091
4.035ArgGlu: 4.035 ± 0.101
2.068ArgPhe: 2.068 ± 0.054
4.486ArgGly: 4.486 ± 0.098
1.317ArgHis: 1.317 ± 0.053
3.651ArgIle: 3.651 ± 0.089
2.038ArgLys: 2.038 ± 0.07
5.833ArgLeu: 5.833 ± 0.112
1.591ArgMet: 1.591 ± 0.055
1.514ArgAsn: 1.514 ± 0.056
2.81ArgPro: 2.81 ± 0.083
2.329ArgGln: 2.329 ± 0.073
4.648ArgArg: 4.648 ± 0.138
3.848ArgSer: 3.848 ± 0.094
3.604ArgThr: 3.604 ± 0.081
4.766ArgVal: 4.766 ± 0.116
0.934ArgTrp: 0.934 ± 0.042
1.479ArgTyr: 1.479 ± 0.055
0.0ArgXaa: 0.0 ± 0.0
Ser
7.237SerAla: 7.237 ± 0.15
0.47SerCys: 0.47 ± 0.034
3.989SerAsp: 3.989 ± 0.081
3.524SerGlu: 3.524 ± 0.083
2.188SerPhe: 2.188 ± 0.061
5.942SerGly: 5.942 ± 0.113
1.406SerHis: 1.406 ± 0.051
3.373SerIle: 3.373 ± 0.092
1.984SerLys: 1.984 ± 0.058
5.878SerLeu: 5.878 ± 0.107
1.592SerMet: 1.592 ± 0.055
1.768SerAsn: 1.768 ± 0.062
2.929SerPro: 2.929 ± 0.085
2.329SerGln: 2.329 ± 0.074
3.662SerArg: 3.662 ± 0.083
4.671SerSer: 4.671 ± 0.123
4.143SerThr: 4.143 ± 0.11
5.012SerVal: 5.012 ± 0.116
0.976SerTrp: 0.976 ± 0.039
1.448SerTyr: 1.448 ± 0.05
0.0SerXaa: 0.0 ± 0.0
Thr
6.09ThrAla: 6.09 ± 0.123
0.535ThrCys: 0.535 ± 0.031
3.693ThrAsp: 3.693 ± 0.097
3.202ThrGlu: 3.202 ± 0.076
1.972ThrPhe: 1.972 ± 0.058
5.359ThrGly: 5.359 ± 0.104
1.47ThrHis: 1.47 ± 0.052
3.592ThrIle: 3.592 ± 0.09
2.08ThrLys: 2.08 ± 0.076
6.165ThrLeu: 6.165 ± 0.115
1.294ThrMet: 1.294 ± 0.044
1.77ThrAsn: 1.77 ± 0.065
3.888ThrPro: 3.888 ± 0.089
2.077ThrGln: 2.077 ± 0.062
3.465ThrArg: 3.465 ± 0.085
4.138ThrSer: 4.138 ± 0.088
3.993ThrThr: 3.993 ± 0.108
5.139ThrVal: 5.139 ± 0.126
1.059ThrTrp: 1.059 ± 0.05
1.573ThrTyr: 1.573 ± 0.057
0.0ThrXaa: 0.0 ± 0.0
Val
9.78ValAla: 9.78 ± 0.157
0.754ValCys: 0.754 ± 0.037
5.533ValAsp: 5.533 ± 0.117
5.31ValGlu: 5.31 ± 0.133
2.693ValPhe: 2.693 ± 0.073
6.338ValGly: 6.338 ± 0.106
1.604ValHis: 1.604 ± 0.048
4.672ValIle: 4.672 ± 0.084
2.751ValLys: 2.751 ± 0.07
7.502ValLeu: 7.502 ± 0.137
1.836ValMet: 1.836 ± 0.057
2.293ValAsn: 2.293 ± 0.065
3.894ValPro: 3.894 ± 0.076
2.152ValGln: 2.152 ± 0.063
4.986ValArg: 4.986 ± 0.099
5.451ValSer: 5.451 ± 0.104
5.138ValThr: 5.138 ± 0.103
7.207ValVal: 7.207 ± 0.143
1.005ValTrp: 1.005 ± 0.048
1.537ValTyr: 1.537 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
1.469TrpAla: 1.469 ± 0.061
0.155TrpCys: 0.155 ± 0.016
0.834TrpAsp: 0.834 ± 0.035
0.787TrpGlu: 0.787 ± 0.035
0.472TrpPhe: 0.472 ± 0.028
1.017TrpGly: 1.017 ± 0.044
0.303TrpHis: 0.303 ± 0.025
0.84TrpIle: 0.84 ± 0.036
0.549TrpLys: 0.549 ± 0.034
1.364TrpLeu: 1.364 ± 0.058
0.422TrpMet: 0.422 ± 0.028
0.594TrpAsn: 0.594 ± 0.037
0.575TrpPro: 0.575 ± 0.032
0.523TrpGln: 0.523 ± 0.028
0.862TrpArg: 0.862 ± 0.042
0.852TrpSer: 0.852 ± 0.04
0.887TrpThr: 0.887 ± 0.037
1.199TrpVal: 1.199 ± 0.051
0.319TrpTrp: 0.319 ± 0.026
0.371TrpTyr: 0.371 ± 0.034
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.383TyrAla: 2.383 ± 0.067
0.186TyrCys: 0.186 ± 0.018
1.307TyrAsp: 1.307 ± 0.054
1.287TyrGlu: 1.287 ± 0.044
0.81TyrPhe: 0.81 ± 0.04
1.976TyrGly: 1.976 ± 0.057
0.427TyrHis: 0.427 ± 0.027
0.897TyrIle: 0.897 ± 0.036
0.645TyrLys: 0.645 ± 0.035
2.11TyrLeu: 2.11 ± 0.06
0.453TyrMet: 0.453 ± 0.028
0.557TyrAsn: 0.557 ± 0.035
1.078TyrPro: 1.078 ± 0.05
0.817TyrGln: 0.817 ± 0.04
1.477TyrArg: 1.477 ± 0.05
1.491TyrSer: 1.491 ± 0.058
1.3TyrThr: 1.3 ± 0.052
1.787TyrVal: 1.787 ± 0.054
0.31TyrTrp: 0.31 ± 0.023
0.573TyrTyr: 0.573 ± 0.037
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1709 proteins (574013 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski