Amino acid dipepetide frequency for Kitasatospora atroaurantiaca

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.544AlaAla: 22.544 ± 0.179
1.032AlaCys: 1.032 ± 0.02
8.345AlaAsp: 8.345 ± 0.082
9.191AlaGlu: 9.191 ± 0.095
3.541AlaPhe: 3.541 ± 0.044
13.446AlaGly: 13.446 ± 0.088
2.647AlaHis: 2.647 ± 0.035
3.653AlaIle: 3.653 ± 0.044
3.026AlaLys: 3.026 ± 0.041
14.546AlaLeu: 14.546 ± 0.11
2.455AlaMet: 2.455 ± 0.036
2.249AlaAsn: 2.249 ± 0.04
7.198AlaPro: 7.198 ± 0.084
4.136AlaGln: 4.136 ± 0.054
9.519AlaArg: 9.519 ± 0.087
5.996AlaSer: 5.996 ± 0.065
7.315AlaThr: 7.315 ± 0.074
12.485AlaVal: 12.485 ± 0.101
1.91AlaTrp: 1.91 ± 0.029
2.782AlaTyr: 2.782 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
1.055CysAla: 1.055 ± 0.022
0.128CysCys: 0.128 ± 0.008
0.434CysAsp: 0.434 ± 0.014
0.378CysGlu: 0.378 ± 0.014
0.227CysPhe: 0.227 ± 0.011
0.95CysGly: 0.95 ± 0.023
0.204CysHis: 0.204 ± 0.01
0.196CysIle: 0.196 ± 0.01
0.12CysLys: 0.12 ± 0.007
0.799CysLeu: 0.799 ± 0.02
0.126CysMet: 0.126 ± 0.007
0.155CysAsn: 0.155 ± 0.009
0.527CysPro: 0.527 ± 0.021
0.185CysGln: 0.185 ± 0.009
0.612CysArg: 0.612 ± 0.017
0.492CysSer: 0.492 ± 0.018
0.557CysThr: 0.557 ± 0.016
0.57CysVal: 0.57 ± 0.015
0.15CysTrp: 0.15 ± 0.008
0.16CysTyr: 0.16 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
6.92AspAla: 6.92 ± 0.056
0.433AspCys: 0.433 ± 0.015
2.98AspAsp: 2.98 ± 0.047
3.37AspGlu: 3.37 ± 0.046
1.622AspPhe: 1.622 ± 0.028
6.082AspGly: 6.082 ± 0.068
1.396AspHis: 1.396 ± 0.027
1.758AspIle: 1.758 ± 0.03
1.033AspLys: 1.033 ± 0.022
6.18AspLeu: 6.18 ± 0.059
0.69AspMet: 0.69 ± 0.018
1.002AspAsn: 1.002 ± 0.024
4.407AspPro: 4.407 ± 0.047
1.72AspGln: 1.72 ± 0.029
4.648AspArg: 4.648 ± 0.051
2.64AspSer: 2.64 ± 0.041
2.993AspThr: 2.993 ± 0.04
4.166AspVal: 4.166 ± 0.042
0.996AspTrp: 0.996 ± 0.025
1.135AspTyr: 1.135 ± 0.021
0.0AspXaa: 0.0 ± 0.0
Glu
7.222GluAla: 7.222 ± 0.078
0.358GluCys: 0.358 ± 0.014
2.559GluAsp: 2.559 ± 0.036
3.234GluGlu: 3.234 ± 0.051
1.443GluPhe: 1.443 ± 0.027
3.831GluGly: 3.831 ± 0.046
1.515GluHis: 1.515 ± 0.03
2.191GluIle: 2.191 ± 0.035
1.24GluLys: 1.24 ± 0.027
7.271GluLeu: 7.271 ± 0.078
0.781GluMet: 0.781 ± 0.018
0.962GluAsn: 0.962 ± 0.02
3.279GluPro: 3.279 ± 0.046
2.517GluGln: 2.517 ± 0.042
4.829GluArg: 4.829 ± 0.061
2.419GluSer: 2.419 ± 0.032
2.675GluThr: 2.675 ± 0.037
4.401GluVal: 4.401 ± 0.052
0.757GluTrp: 0.757 ± 0.02
1.138GluTyr: 1.138 ± 0.023
0.0GluXaa: 0.0 ± 0.0
Phe
3.677PheAla: 3.677 ± 0.05
0.287PheCys: 0.287 ± 0.011
1.909PheAsp: 1.909 ± 0.029
1.434PheGlu: 1.434 ± 0.025
0.842PhePhe: 0.842 ± 0.022
2.996PheGly: 2.996 ± 0.041
0.613PheHis: 0.613 ± 0.017
0.777PheIle: 0.777 ± 0.018
0.53PheLys: 0.53 ± 0.018
2.537PheLeu: 2.537 ± 0.04
0.369PheMet: 0.369 ± 0.013
0.657PheAsn: 0.657 ± 0.021
1.385PhePro: 1.385 ± 0.026
0.782PheGln: 0.782 ± 0.021
1.79PheArg: 1.79 ± 0.031
1.498PheSer: 1.498 ± 0.029
2.055PheThr: 2.055 ± 0.032
2.013PheVal: 2.013 ± 0.031
0.432PheTrp: 0.432 ± 0.014
0.581PheTyr: 0.581 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
10.744GlyAla: 10.744 ± 0.08
0.841GlyCys: 0.841 ± 0.021
4.588GlyAsp: 4.588 ± 0.05
4.893GlyGlu: 4.893 ± 0.055
2.852GlyPhe: 2.852 ± 0.037
8.728GlyGly: 8.728 ± 0.098
2.252GlyHis: 2.252 ± 0.037
3.424GlyIle: 3.424 ± 0.038
2.388GlyLys: 2.388 ± 0.041
9.897GlyLeu: 9.897 ± 0.079
1.884GlyMet: 1.884 ± 0.031
1.912GlyAsn: 1.912 ± 0.041
5.128GlyPro: 5.128 ± 0.056
2.983GlyGln: 2.983 ± 0.038
7.606GlyArg: 7.606 ± 0.073
5.805GlySer: 5.805 ± 0.069
6.292GlyThr: 6.292 ± 0.075
7.236GlyVal: 7.236 ± 0.07
1.808GlyTrp: 1.808 ± 0.031
2.471GlyTyr: 2.471 ± 0.037
0.0GlyXaa: 0.0 ± 0.0
His
2.526HisAla: 2.526 ± 0.039
0.22HisCys: 0.22 ± 0.009
1.234HisAsp: 1.234 ± 0.026
1.103HisGlu: 1.103 ± 0.025
0.631HisPhe: 0.631 ± 0.018
2.344HisGly: 2.344 ± 0.039
0.688HisHis: 0.688 ± 0.02
0.635HisIle: 0.635 ± 0.017
0.343HisLys: 0.343 ± 0.013
2.478HisLeu: 2.478 ± 0.037
0.309HisMet: 0.309 ± 0.012
0.415HisAsn: 0.415 ± 0.014
1.852HisPro: 1.852 ± 0.033
0.729HisGln: 0.729 ± 0.019
2.113HisArg: 2.113 ± 0.035
1.061HisSer: 1.061 ± 0.023
1.256HisThr: 1.256 ± 0.028
1.52HisVal: 1.52 ± 0.029
0.4HisTrp: 0.4 ± 0.015
0.493HisTyr: 0.493 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
4.961IleAla: 4.961 ± 0.052
0.302IleCys: 0.302 ± 0.011
2.208IleAsp: 2.208 ± 0.028
1.99IleGlu: 1.99 ± 0.036
0.761IlePhe: 0.761 ± 0.02
3.702IleGly: 3.702 ± 0.048
0.62IleHis: 0.62 ± 0.018
0.936IleIle: 0.936 ± 0.023
0.742IleLys: 0.742 ± 0.021
2.52IleLeu: 2.52 ± 0.037
0.405IleMet: 0.405 ± 0.012
0.778IleAsn: 0.778 ± 0.021
1.934IlePro: 1.934 ± 0.03
0.771IleGln: 0.771 ± 0.02
2.288IleArg: 2.288 ± 0.036
1.846IleSer: 1.846 ± 0.028
2.353IleThr: 2.353 ± 0.036
2.528IleVal: 2.528 ± 0.042
0.421IleTrp: 0.421 ± 0.013
0.555IleTyr: 0.555 ± 0.016
0.0IleXaa: 0.0 ± 0.0
Lys
3.007LysAla: 3.007 ± 0.049
0.114LysCys: 0.114 ± 0.008
1.183LysAsp: 1.183 ± 0.025
1.015LysGlu: 1.015 ± 0.024
0.479LysPhe: 0.479 ± 0.015
1.679LysGly: 1.679 ± 0.034
0.46LysHis: 0.46 ± 0.014
0.867LysIle: 0.867 ± 0.021
0.662LysLys: 0.662 ± 0.022
2.141LysLeu: 2.141 ± 0.032
0.363LysMet: 0.363 ± 0.013
0.5LysAsn: 0.5 ± 0.017
1.395LysPro: 1.395 ± 0.031
0.762LysGln: 0.762 ± 0.019
1.32LysArg: 1.32 ± 0.028
1.132LysSer: 1.132 ± 0.022
1.173LysThr: 1.173 ± 0.028
1.899LysVal: 1.899 ± 0.035
0.258LysTrp: 0.258 ± 0.011
0.486LysTyr: 0.486 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
16.301LeuAla: 16.301 ± 0.117
0.867LeuCys: 0.867 ± 0.022
6.59LeuAsp: 6.59 ± 0.064
4.621LeuGlu: 4.621 ± 0.047
2.616LeuPhe: 2.616 ± 0.039
9.657LeuGly: 9.657 ± 0.088
2.324LeuHis: 2.324 ± 0.036
3.48LeuIle: 3.48 ± 0.045
1.981LeuLys: 1.981 ± 0.037
12.053LeuLeu: 12.053 ± 0.122
1.579LeuMet: 1.579 ± 0.03
1.893LeuAsn: 1.893 ± 0.033
6.84LeuPro: 6.84 ± 0.065
2.638LeuGln: 2.638 ± 0.038
8.549LeuArg: 8.549 ± 0.077
5.448LeuSer: 5.448 ± 0.048
7.07LeuThr: 7.07 ± 0.063
8.882LeuVal: 8.882 ± 0.079
1.301LeuTrp: 1.301 ± 0.026
1.848LeuTyr: 1.848 ± 0.033
0.0LeuXaa: 0.0 ± 0.0
Met
2.178MetAla: 2.178 ± 0.032
0.107MetCys: 0.107 ± 0.007
0.797MetAsp: 0.797 ± 0.018
0.654MetGlu: 0.654 ± 0.018
0.442MetPhe: 0.442 ± 0.015
1.242MetGly: 1.242 ± 0.028
0.364MetHis: 0.364 ± 0.012
0.629MetIle: 0.629 ± 0.019
0.377MetLys: 0.377 ± 0.013
1.681MetLeu: 1.681 ± 0.027
0.274MetMet: 0.274 ± 0.011
0.408MetAsn: 0.408 ± 0.012
1.079MetPro: 1.079 ± 0.022
0.452MetGln: 0.452 ± 0.013
1.258MetArg: 1.258 ± 0.024
1.263MetSer: 1.263 ± 0.024
1.479MetThr: 1.479 ± 0.026
1.253MetVal: 1.253 ± 0.025
0.187MetTrp: 0.187 ± 0.009
0.297MetTyr: 0.297 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.339AsnAla: 2.339 ± 0.04
0.185AsnCys: 0.185 ± 0.009
0.982AsnAsp: 0.982 ± 0.024
0.825AsnGlu: 0.825 ± 0.021
0.554AsnPhe: 0.554 ± 0.018
2.174AsnGly: 2.174 ± 0.047
0.42AsnHis: 0.42 ± 0.015
0.676AsnIle: 0.676 ± 0.02
0.393AsnLys: 0.393 ± 0.017
1.934AsnLeu: 1.934 ± 0.035
0.3AsnMet: 0.3 ± 0.01
0.517AsnAsn: 0.517 ± 0.018
1.507AsnPro: 1.507 ± 0.031
0.63AsnGln: 0.63 ± 0.017
1.304AsnArg: 1.304 ± 0.024
1.115AsnSer: 1.115 ± 0.029
1.217AsnThr: 1.217 ± 0.032
1.414AsnVal: 1.414 ± 0.032
0.326AsnTrp: 0.326 ± 0.015
0.455AsnTyr: 0.455 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
9.381ProAla: 9.381 ± 0.082
0.357ProCys: 0.357 ± 0.014
4.159ProAsp: 4.159 ± 0.052
4.248ProGlu: 4.248 ± 0.046
1.489ProPhe: 1.489 ± 0.027
6.65ProGly: 6.65 ± 0.067
1.279ProHis: 1.279 ± 0.029
1.471ProIle: 1.471 ± 0.026
1.21ProLys: 1.21 ± 0.026
5.385ProLeu: 5.385 ± 0.058
0.952ProMet: 0.952 ± 0.021
1.07ProAsn: 1.07 ± 0.025
3.475ProPro: 3.475 ± 0.062
2.033ProGln: 2.033 ± 0.041
3.676ProArg: 3.676 ± 0.045
3.48ProSer: 3.48 ± 0.047
3.514ProThr: 3.514 ± 0.045
5.66ProVal: 5.66 ± 0.056
0.922ProTrp: 0.922 ± 0.023
1.441ProTyr: 1.441 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
4.263GlnAla: 4.263 ± 0.06
0.191GlnCys: 0.191 ± 0.009
1.505GlnAsp: 1.505 ± 0.026
1.454GlnGlu: 1.454 ± 0.026
0.747GlnPhe: 0.747 ± 0.018
2.38GlnGly: 2.38 ± 0.036
0.735GlnHis: 0.735 ± 0.02
1.197GlnIle: 1.197 ± 0.026
0.6GlnLys: 0.6 ± 0.02
3.754GlnLeu: 3.754 ± 0.039
0.479GlnMet: 0.479 ± 0.016
0.576GlnAsn: 0.576 ± 0.019
2.057GlnPro: 2.057 ± 0.039
1.628GlnGln: 1.628 ± 0.041
2.517GlnArg: 2.517 ± 0.044
1.409GlnSer: 1.409 ± 0.027
1.542GlnThr: 1.542 ± 0.028
2.714GlnVal: 2.714 ± 0.034
0.529GlnTrp: 0.529 ± 0.018
0.702GlnTyr: 0.702 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
9.406ArgAla: 9.406 ± 0.091
0.609ArgCys: 0.609 ± 0.018
3.566ArgAsp: 3.566 ± 0.046
4.305ArgGlu: 4.305 ± 0.057
2.182ArgPhe: 2.182 ± 0.032
5.179ArgGly: 5.179 ± 0.059
1.982ArgHis: 1.982 ± 0.031
3.253ArgIle: 3.253 ± 0.046
1.541ArgLys: 1.541 ± 0.03
8.743ArgLeu: 8.743 ± 0.083
1.612ArgMet: 1.612 ± 0.03
1.362ArgAsn: 1.362 ± 0.024
4.945ArgPro: 4.945 ± 0.059
2.42ArgGln: 2.42 ± 0.038
7.33ArgArg: 7.33 ± 0.079
4.248ArgSer: 4.248 ± 0.046
5.123ArgThr: 5.123 ± 0.049
5.329ArgVal: 5.329 ± 0.059
1.398ArgTrp: 1.398 ± 0.029
1.85ArgTyr: 1.85 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
7.182SerAla: 7.182 ± 0.067
0.484SerCys: 0.484 ± 0.015
2.704SerAsp: 2.704 ± 0.035
2.37SerGlu: 2.37 ± 0.034
1.646SerPhe: 1.646 ± 0.031
6.062SerGly: 6.062 ± 0.072
1.033SerHis: 1.033 ± 0.027
1.57SerIle: 1.57 ± 0.03
1.091SerLys: 1.091 ± 0.021
4.961SerLeu: 4.961 ± 0.049
1.09SerMet: 1.09 ± 0.022
1.04SerAsn: 1.04 ± 0.027
3.367SerPro: 3.367 ± 0.05
1.348SerGln: 1.348 ± 0.024
3.621SerArg: 3.621 ± 0.044
3.292SerSer: 3.292 ± 0.062
3.45SerThr: 3.45 ± 0.046
4.292SerVal: 4.292 ± 0.046
1.031SerTrp: 1.031 ± 0.023
1.307SerTyr: 1.307 ± 0.031
0.0SerXaa: 0.0 ± 0.0
Thr
9.379ThrAla: 9.379 ± 0.074
0.444ThrCys: 0.444 ± 0.015
3.529ThrAsp: 3.529 ± 0.041
3.092ThrGlu: 3.092 ± 0.038
1.621ThrPhe: 1.621 ± 0.031
6.685ThrGly: 6.685 ± 0.068
1.119ThrHis: 1.119 ± 0.021
1.86ThrIle: 1.86 ± 0.033
1.195ThrLys: 1.195 ± 0.027
5.824ThrLeu: 5.824 ± 0.051
0.893ThrMet: 0.893 ± 0.021
1.109ThrAsn: 1.109 ± 0.026
4.241ThrPro: 4.241 ± 0.051
1.406ThrGln: 1.406 ± 0.025
3.698ThrArg: 3.698 ± 0.039
3.286ThrSer: 3.286 ± 0.053
4.081ThrThr: 4.081 ± 0.073
6.283ThrVal: 6.283 ± 0.065
0.977ThrTrp: 0.977 ± 0.026
1.305ThrTyr: 1.305 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
10.505ValAla: 10.505 ± 0.092
0.698ValCys: 0.698 ± 0.018
4.683ValAsp: 4.683 ± 0.045
4.578ValGlu: 4.578 ± 0.053
2.288ValPhe: 2.288 ± 0.034
6.507ValGly: 6.507 ± 0.065
1.907ValHis: 1.907 ± 0.029
2.954ValIle: 2.954 ± 0.04
1.71ValLys: 1.71 ± 0.029
9.654ValLeu: 9.654 ± 0.086
1.358ValMet: 1.358 ± 0.024
1.762ValAsn: 1.762 ± 0.033
5.204ValPro: 5.204 ± 0.058
2.353ValGln: 2.353 ± 0.037
6.513ValArg: 6.513 ± 0.061
4.409ValSer: 4.409 ± 0.053
5.509ValThr: 5.509 ± 0.064
7.613ValVal: 7.613 ± 0.07
1.106ValTrp: 1.106 ± 0.025
1.565ValTyr: 1.565 ± 0.031
0.0ValXaa: 0.0 ± 0.0
Trp
1.783TrpAla: 1.783 ± 0.031
0.165TrpCys: 0.165 ± 0.008
0.79TrpAsp: 0.79 ± 0.021
0.735TrpGlu: 0.735 ± 0.021
0.495TrpPhe: 0.495 ± 0.016
1.084TrpGly: 1.084 ± 0.021
0.374TrpHis: 0.374 ± 0.016
0.576TrpIle: 0.576 ± 0.017
0.346TrpLys: 0.346 ± 0.014
1.84TrpLeu: 1.84 ± 0.028
0.27TrpMet: 0.27 ± 0.01
0.408TrpAsn: 0.408 ± 0.016
0.855TrpPro: 0.855 ± 0.02
0.718TrpGln: 0.718 ± 0.018
1.278TrpArg: 1.278 ± 0.025
1.019TrpSer: 1.019 ± 0.026
1.119TrpThr: 1.119 ± 0.028
1.011TrpVal: 1.011 ± 0.021
0.363TrpTrp: 0.363 ± 0.011
0.41TrpTyr: 0.41 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.646TyrAla: 2.646 ± 0.038
0.203TyrCys: 0.203 ± 0.012
1.427TyrAsp: 1.427 ± 0.036
1.132TyrGlu: 1.132 ± 0.024
0.684TyrPhe: 0.684 ± 0.017
2.351TyrGly: 2.351 ± 0.036
0.415TyrHis: 0.415 ± 0.014
0.516TyrIle: 0.516 ± 0.015
0.369TyrLys: 0.369 ± 0.013
2.341TyrLeu: 2.341 ± 0.034
0.231TyrMet: 0.231 ± 0.012
0.491TyrAsn: 0.491 ± 0.019
1.177TyrPro: 1.177 ± 0.025
0.769TyrGln: 0.769 ± 0.023
1.917TyrArg: 1.917 ± 0.033
1.081TyrSer: 1.081 ± 0.026
1.256TyrThr: 1.256 ± 0.028
1.607TyrVal: 1.607 ± 0.025
0.366TyrTrp: 0.366 ± 0.013
0.499TyrTyr: 0.499 ± 0.018
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6749 proteins (2226480 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski