Amino acid dipepetide frequency for Streptomyces albireticuli

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
23.144AlaAla: 23.144 ± 0.207
1.155AlaCys: 1.155 ± 0.023
8.467AlaAsp: 8.467 ± 0.068
9.152AlaGlu: 9.152 ± 0.1
3.719AlaPhe: 3.719 ± 0.048
14.751AlaGly: 14.751 ± 0.11
3.084AlaHis: 3.084 ± 0.036
2.928AlaIle: 2.928 ± 0.049
2.832AlaLys: 2.832 ± 0.059
15.264AlaLeu: 15.264 ± 0.124
2.536AlaMet: 2.536 ± 0.036
1.743AlaAsn: 1.743 ± 0.029
8.049AlaPro: 8.049 ± 0.094
3.351AlaGln: 3.351 ± 0.043
11.415AlaArg: 11.415 ± 0.094
5.658AlaSer: 5.658 ± 0.049
7.09AlaThr: 7.09 ± 0.055
12.936AlaVal: 12.936 ± 0.106
1.872AlaTrp: 1.872 ± 0.027
2.714AlaTyr: 2.714 ± 0.039
0.0AlaXaa: 0.0 ± 0.0
Cys
1.246CysAla: 1.246 ± 0.024
0.092CysCys: 0.092 ± 0.007
0.523CysAsp: 0.523 ± 0.016
0.424CysGlu: 0.424 ± 0.015
0.226CysPhe: 0.226 ± 0.01
1.002CysGly: 1.002 ± 0.025
0.196CysHis: 0.196 ± 0.01
0.131CysIle: 0.131 ± 0.009
0.118CysLys: 0.118 ± 0.008
0.783CysLeu: 0.783 ± 0.022
0.131CysMet: 0.131 ± 0.008
0.119CysAsn: 0.119 ± 0.007
0.516CysPro: 0.516 ± 0.015
0.164CysGln: 0.164 ± 0.008
0.683CysArg: 0.683 ± 0.017
0.42CysSer: 0.42 ± 0.016
0.477CysThr: 0.477 ± 0.016
0.718CysVal: 0.718 ± 0.018
0.125CysTrp: 0.125 ± 0.008
0.156CysTyr: 0.156 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
7.963AspAla: 7.963 ± 0.069
0.426AspCys: 0.426 ± 0.014
3.353AspAsp: 3.353 ± 0.043
3.657AspGlu: 3.657 ± 0.044
1.617AspPhe: 1.617 ± 0.031
6.532AspGly: 6.532 ± 0.064
1.461AspHis: 1.461 ± 0.026
1.742AspIle: 1.742 ± 0.03
1.179AspLys: 1.179 ± 0.034
5.943AspLeu: 5.943 ± 0.055
0.752AspMet: 0.752 ± 0.019
0.803AspAsn: 0.803 ± 0.022
4.628AspPro: 4.628 ± 0.046
1.3AspGln: 1.3 ± 0.025
5.107AspArg: 5.107 ± 0.053
2.325AspSer: 2.325 ± 0.03
3.076AspThr: 3.076 ± 0.041
4.483AspVal: 4.483 ± 0.045
0.975AspTrp: 0.975 ± 0.022
1.051AspTyr: 1.051 ± 0.024
0.0AspXaa: 0.0 ± 0.0
Glu
7.806GluAla: 7.806 ± 0.081
0.396GluCys: 0.396 ± 0.013
2.833GluAsp: 2.833 ± 0.038
3.736GluGlu: 3.736 ± 0.052
1.457GluPhe: 1.457 ± 0.029
4.604GluGly: 4.604 ± 0.047
1.486GluHis: 1.486 ± 0.026
2.12GluIle: 2.12 ± 0.028
1.444GluLys: 1.444 ± 0.033
6.914GluLeu: 6.914 ± 0.06
0.823GluMet: 0.823 ± 0.02
0.964GluAsn: 0.964 ± 0.02
3.466GluPro: 3.466 ± 0.049
1.891GluGln: 1.891 ± 0.031
5.84GluArg: 5.84 ± 0.06
2.394GluSer: 2.394 ± 0.033
2.809GluThr: 2.809 ± 0.039
4.303GluVal: 4.303 ± 0.052
0.778GluTrp: 0.778 ± 0.022
0.991GluTyr: 0.991 ± 0.021
0.0GluXaa: 0.0 ± 0.0
Phe
3.817PheAla: 3.817 ± 0.044
0.281PheCys: 0.281 ± 0.012
1.954PheAsp: 1.954 ± 0.035
1.37PheGlu: 1.37 ± 0.022
0.892PhePhe: 0.892 ± 0.021
2.996PheGly: 2.996 ± 0.034
0.671PheHis: 0.671 ± 0.016
0.644PheIle: 0.644 ± 0.02
0.463PheLys: 0.463 ± 0.017
2.62PheLeu: 2.62 ± 0.033
0.397PheMet: 0.397 ± 0.015
0.497PheAsn: 0.497 ± 0.014
1.453PhePro: 1.453 ± 0.027
0.651PheGln: 0.651 ± 0.018
1.929PheArg: 1.929 ± 0.031
1.441PheSer: 1.441 ± 0.027
2.023PheThr: 2.023 ± 0.033
2.153PheVal: 2.153 ± 0.033
0.404PheTrp: 0.404 ± 0.014
0.551PheTyr: 0.551 ± 0.018
0.0PheXaa: 0.0 ± 0.0
Gly
12.243GlyAla: 12.243 ± 0.089
0.899GlyCys: 0.899 ± 0.024
5.285GlyAsp: 5.285 ± 0.058
5.325GlyGlu: 5.325 ± 0.048
2.905GlyPhe: 2.905 ± 0.041
9.699GlyGly: 9.699 ± 0.093
2.492GlyHis: 2.492 ± 0.035
3.22GlyIle: 3.22 ± 0.041
2.521GlyLys: 2.521 ± 0.046
9.925GlyLeu: 9.925 ± 0.074
2.075GlyMet: 2.075 ± 0.034
1.632GlyAsn: 1.632 ± 0.032
6.067GlyPro: 6.067 ± 0.063
2.541GlyGln: 2.541 ± 0.041
8.59GlyArg: 8.59 ± 0.068
5.236GlySer: 5.236 ± 0.056
6.75GlyThr: 6.75 ± 0.07
7.768GlyVal: 7.768 ± 0.076
1.675GlyTrp: 1.675 ± 0.03
2.17GlyTyr: 2.17 ± 0.036
0.0GlyXaa: 0.0 ± 0.0
His
2.774HisAla: 2.774 ± 0.034
0.232HisCys: 0.232 ± 0.011
1.388HisAsp: 1.388 ± 0.026
1.196HisGlu: 1.196 ± 0.026
0.632HisPhe: 0.632 ± 0.018
2.513HisGly: 2.513 ± 0.037
0.731HisHis: 0.731 ± 0.018
0.628HisIle: 0.628 ± 0.017
0.352HisLys: 0.352 ± 0.014
2.55HisLeu: 2.55 ± 0.036
0.338HisMet: 0.338 ± 0.01
0.323HisAsn: 0.323 ± 0.013
1.893HisPro: 1.893 ± 0.032
0.619HisGln: 0.619 ± 0.017
2.179HisArg: 2.179 ± 0.034
1.036HisSer: 1.036 ± 0.025
1.351HisThr: 1.351 ± 0.026
1.676HisVal: 1.676 ± 0.03
0.37HisTrp: 0.37 ± 0.012
0.479HisTyr: 0.479 ± 0.016
0.0HisXaa: 0.0 ± 0.0
Ile
4.269IleAla: 4.269 ± 0.047
0.271IleCys: 0.271 ± 0.012
1.91IleAsp: 1.91 ± 0.03
1.745IleGlu: 1.745 ± 0.028
0.645IlePhe: 0.645 ± 0.018
3.282IleGly: 3.282 ± 0.042
0.555IleHis: 0.555 ± 0.018
0.71IleIle: 0.71 ± 0.022
0.675IleLys: 0.675 ± 0.02
2.109IleLeu: 2.109 ± 0.031
0.404IleMet: 0.404 ± 0.015
0.606IleAsn: 0.606 ± 0.017
1.587IlePro: 1.587 ± 0.029
0.618IleGln: 0.618 ± 0.017
2.057IleArg: 2.057 ± 0.034
1.534IleSer: 1.534 ± 0.026
1.911IleThr: 1.911 ± 0.031
2.42IleVal: 2.42 ± 0.038
0.343IleTrp: 0.343 ± 0.013
0.445IleTyr: 0.445 ± 0.017
0.0IleXaa: 0.0 ± 0.0
Lys
2.917LysAla: 2.917 ± 0.051
0.132LysCys: 0.132 ± 0.008
1.344LysAsp: 1.344 ± 0.036
1.19LysGlu: 1.19 ± 0.026
0.426LysPhe: 0.426 ± 0.016
1.903LysGly: 1.903 ± 0.039
0.405LysHis: 0.405 ± 0.012
0.792LysIle: 0.792 ± 0.021
0.823LysLys: 0.823 ± 0.034
1.928LysLeu: 1.928 ± 0.039
0.387LysMet: 0.387 ± 0.015
0.494LysAsn: 0.494 ± 0.017
1.334LysPro: 1.334 ± 0.032
0.613LysGln: 0.613 ± 0.019
1.453LysArg: 1.453 ± 0.025
1.099LysSer: 1.099 ± 0.026
1.139LysThr: 1.139 ± 0.029
1.797LysVal: 1.797 ± 0.037
0.256LysTrp: 0.256 ± 0.011
0.407LysTyr: 0.407 ± 0.013
0.0LysXaa: 0.0 ± 0.0
Leu
15.764LeuAla: 15.764 ± 0.116
0.898LeuCys: 0.898 ± 0.023
6.674LeuAsp: 6.674 ± 0.062
4.684LeuGlu: 4.684 ± 0.05
2.607LeuPhe: 2.607 ± 0.042
9.617LeuGly: 9.617 ± 0.09
2.356LeuHis: 2.356 ± 0.036
2.967LeuIle: 2.967 ± 0.049
1.898LeuLys: 1.898 ± 0.036
11.453LeuLeu: 11.453 ± 0.1
1.652LeuMet: 1.652 ± 0.028
1.509LeuAsn: 1.509 ± 0.027
6.764LeuPro: 6.764 ± 0.06
1.985LeuGln: 1.985 ± 0.028
9.176LeuArg: 9.176 ± 0.078
5.07LeuSer: 5.07 ± 0.05
7.057LeuThr: 7.057 ± 0.06
8.727LeuVal: 8.727 ± 0.073
1.319LeuTrp: 1.319 ± 0.025
1.794LeuTyr: 1.794 ± 0.029
0.0LeuXaa: 0.0 ± 0.0
Met
2.297MetAla: 2.297 ± 0.033
0.14MetCys: 0.14 ± 0.008
0.939MetAsp: 0.939 ± 0.021
0.752MetGlu: 0.752 ± 0.019
0.44MetPhe: 0.44 ± 0.016
1.351MetGly: 1.351 ± 0.027
0.336MetHis: 0.336 ± 0.013
0.623MetIle: 0.623 ± 0.017
0.383MetLys: 0.383 ± 0.014
1.662MetLeu: 1.662 ± 0.027
0.296MetMet: 0.296 ± 0.012
0.4MetAsn: 0.4 ± 0.014
1.097MetPro: 1.097 ± 0.024
0.387MetGln: 0.387 ± 0.013
1.436MetArg: 1.436 ± 0.028
1.251MetSer: 1.251 ± 0.02
1.494MetThr: 1.494 ± 0.028
1.27MetVal: 1.27 ± 0.025
0.213MetTrp: 0.213 ± 0.01
0.317MetTyr: 0.317 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
1.986AsnAla: 1.986 ± 0.034
0.152AsnCys: 0.152 ± 0.009
0.824AsnAsp: 0.824 ± 0.022
0.721AsnGlu: 0.721 ± 0.019
0.434AsnPhe: 0.434 ± 0.015
1.762AsnGly: 1.762 ± 0.037
0.371AsnHis: 0.371 ± 0.015
0.594AsnIle: 0.594 ± 0.016
0.374AsnLys: 0.374 ± 0.015
1.525AsnLeu: 1.525 ± 0.025
0.281AsnMet: 0.281 ± 0.01
0.343AsnAsn: 0.343 ± 0.013
1.228AsnPro: 1.228 ± 0.024
0.417AsnGln: 0.417 ± 0.014
1.175AsnArg: 1.175 ± 0.022
0.825AsnSer: 0.825 ± 0.022
0.987AsnThr: 0.987 ± 0.025
1.258AsnVal: 1.258 ± 0.026
0.238AsnTrp: 0.238 ± 0.011
0.393AsnTyr: 0.393 ± 0.014
0.0AsnXaa: 0.0 ± 0.0
Pro
10.016ProAla: 10.016 ± 0.097
0.379ProCys: 0.379 ± 0.011
4.486ProAsp: 4.486 ± 0.047
4.569ProGlu: 4.569 ± 0.054
1.582ProPhe: 1.582 ± 0.029
7.71ProGly: 7.71 ± 0.08
1.427ProHis: 1.427 ± 0.029
1.062ProIle: 1.062 ± 0.021
1.176ProLys: 1.176 ± 0.025
5.576ProLeu: 5.576 ± 0.056
1.012ProMet: 1.012 ± 0.021
0.804ProAsn: 0.804 ± 0.022
3.947ProPro: 3.947 ± 0.07
1.575ProGln: 1.575 ± 0.03
4.464ProArg: 4.464 ± 0.051
3.107ProSer: 3.107 ± 0.039
2.933ProThr: 2.933 ± 0.035
5.793ProVal: 5.793 ± 0.064
0.942ProTrp: 0.942 ± 0.021
1.372ProTyr: 1.372 ± 0.025
0.0ProXaa: 0.0 ± 0.0
Gln
3.371GlnAla: 3.371 ± 0.045
0.15GlnCys: 0.15 ± 0.009
1.301GlnAsp: 1.301 ± 0.025
1.387GlnGlu: 1.387 ± 0.029
0.607GlnPhe: 0.607 ± 0.018
2.208GlnGly: 2.208 ± 0.037
0.591GlnHis: 0.591 ± 0.016
0.902GlnIle: 0.902 ± 0.019
0.552GlnLys: 0.552 ± 0.019
2.692GlnLeu: 2.692 ± 0.036
0.426GlnMet: 0.426 ± 0.013
0.418GlnAsn: 0.418 ± 0.014
1.493GlnPro: 1.493 ± 0.032
1.053GlnGln: 1.053 ± 0.027
2.275GlnArg: 2.275 ± 0.032
1.038GlnSer: 1.038 ± 0.024
1.048GlnThr: 1.048 ± 0.023
2.069GlnVal: 2.069 ± 0.03
0.402GlnTrp: 0.402 ± 0.014
0.509GlnTyr: 0.509 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
11.078ArgAla: 11.078 ± 0.105
0.673ArgCys: 0.673 ± 0.017
4.384ArgAsp: 4.384 ± 0.049
5.117ArgGlu: 5.117 ± 0.058
2.451ArgPhe: 2.451 ± 0.041
6.281ArgGly: 6.281 ± 0.058
2.243ArgHis: 2.243 ± 0.033
2.919ArgIle: 2.919 ± 0.037
1.721ArgLys: 1.721 ± 0.035
9.223ArgLeu: 9.223 ± 0.076
1.792ArgMet: 1.792 ± 0.029
1.318ArgAsn: 1.318 ± 0.025
5.579ArgPro: 5.579 ± 0.064
2.175ArgGln: 2.175 ± 0.031
8.075ArgArg: 8.075 ± 0.081
3.922ArgSer: 3.922 ± 0.05
5.744ArgThr: 5.744 ± 0.053
6.226ArgVal: 6.226 ± 0.059
1.409ArgTrp: 1.409 ± 0.027
1.813ArgTyr: 1.813 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
6.542SerAla: 6.542 ± 0.065
0.4SerCys: 0.4 ± 0.013
2.329SerAsp: 2.329 ± 0.035
2.164SerGlu: 2.164 ± 0.032
1.476SerPhe: 1.476 ± 0.026
5.937SerGly: 5.937 ± 0.071
0.967SerHis: 0.967 ± 0.021
1.181SerIle: 1.181 ± 0.025
0.941SerLys: 0.941 ± 0.023
4.742SerLeu: 4.742 ± 0.053
0.978SerMet: 0.978 ± 0.021
0.78SerAsn: 0.78 ± 0.019
3.247SerPro: 3.247 ± 0.044
1.027SerGln: 1.027 ± 0.024
3.623SerArg: 3.623 ± 0.044
2.549SerSer: 2.549 ± 0.044
2.693SerThr: 2.693 ± 0.039
4.146SerVal: 4.146 ± 0.05
0.866SerTrp: 0.866 ± 0.02
1.18SerTyr: 1.18 ± 0.026
0.0SerXaa: 0.0 ± 0.0
Thr
9.295ThrAla: 9.295 ± 0.076
0.452ThrCys: 0.452 ± 0.015
3.48ThrAsp: 3.48 ± 0.037
3.196ThrGlu: 3.196 ± 0.036
1.589ThrPhe: 1.589 ± 0.025
7.193ThrGly: 7.193 ± 0.062
1.207ThrHis: 1.207 ± 0.024
1.379ThrIle: 1.379 ± 0.027
1.045ThrLys: 1.045 ± 0.025
5.672ThrLeu: 5.672 ± 0.055
0.882ThrMet: 0.882 ± 0.022
0.842ThrAsn: 0.842 ± 0.019
4.132ThrPro: 4.132 ± 0.05
1.143ThrGln: 1.143 ± 0.026
4.002ThrArg: 4.002 ± 0.046
2.923ThrSer: 2.923 ± 0.037
3.613ThrThr: 3.613 ± 0.049
5.91ThrVal: 5.91 ± 0.055
0.847ThrTrp: 0.847 ± 0.023
1.264ThrTyr: 1.264 ± 0.027
0.0ThrXaa: 0.0 ± 0.0
Val
10.9ValAla: 10.9 ± 0.088
0.767ValCys: 0.767 ± 0.018
4.764ValAsp: 4.764 ± 0.053
4.695ValGlu: 4.695 ± 0.049
2.401ValPhe: 2.401 ± 0.039
6.448ValGly: 6.448 ± 0.064
1.904ValHis: 1.904 ± 0.032
2.639ValIle: 2.639 ± 0.044
1.645ValLys: 1.645 ± 0.033
9.586ValLeu: 9.586 ± 0.087
1.408ValMet: 1.408 ± 0.028
1.582ValAsn: 1.582 ± 0.029
5.568ValPro: 5.568 ± 0.05
1.89ValGln: 1.89 ± 0.029
7.459ValArg: 7.459 ± 0.064
4.2ValSer: 4.2 ± 0.043
5.78ValThr: 5.78 ± 0.058
7.932ValVal: 7.932 ± 0.087
1.092ValTrp: 1.092 ± 0.024
1.436ValTyr: 1.436 ± 0.031
0.0ValXaa: 0.0 ± 0.0
Trp
1.689TrpAla: 1.689 ± 0.03
0.167TrpCys: 0.167 ± 0.008
0.81TrpAsp: 0.81 ± 0.02
0.737TrpGlu: 0.737 ± 0.016
0.486TrpPhe: 0.486 ± 0.015
1.038TrpGly: 1.038 ± 0.021
0.387TrpHis: 0.387 ± 0.013
0.482TrpIle: 0.482 ± 0.015
0.331TrpLys: 0.331 ± 0.014
1.758TrpLeu: 1.758 ± 0.029
0.274TrpMet: 0.274 ± 0.012
0.354TrpAsn: 0.354 ± 0.014
0.815TrpPro: 0.815 ± 0.02
0.57TrpGln: 0.57 ± 0.017
1.379TrpArg: 1.379 ± 0.029
0.897TrpSer: 0.897 ± 0.019
1.027TrpThr: 1.027 ± 0.025
0.955TrpVal: 0.955 ± 0.021
0.338TrpTrp: 0.338 ± 0.014
0.327TrpTyr: 0.327 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.745TyrAla: 2.745 ± 0.04
0.186TyrCys: 0.186 ± 0.009
1.325TyrAsp: 1.325 ± 0.023
1.238TyrGlu: 1.238 ± 0.028
0.615TyrPhe: 0.615 ± 0.017
2.192TyrGly: 2.192 ± 0.037
0.389TyrHis: 0.389 ± 0.014
0.434TyrIle: 0.434 ± 0.015
0.367TyrLys: 0.367 ± 0.014
2.047TyrLeu: 2.047 ± 0.032
0.241TyrMet: 0.241 ± 0.012
0.358TyrAsn: 0.358 ± 0.011
1.066TyrPro: 1.066 ± 0.019
0.523TyrGln: 0.523 ± 0.016
1.827TyrArg: 1.827 ± 0.031
0.843TyrSer: 0.843 ± 0.02
1.064TyrThr: 1.064 ± 0.022
1.556TyrVal: 1.556 ± 0.027
0.355TyrTrp: 0.355 ± 0.013
0.406TyrTyr: 0.406 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6966 proteins (2327136 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski