Amino acid dipepetide frequency for Streptomyces glauciniger

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
23.181AlaAla: 23.181 ± 0.165
1.087AlaCys: 1.087 ± 0.023
8.423AlaAsp: 8.423 ± 0.057
8.667AlaGlu: 8.667 ± 0.068
3.62AlaPhe: 3.62 ± 0.041
14.043AlaGly: 14.043 ± 0.095
2.918AlaHis: 2.918 ± 0.04
3.313AlaIle: 3.313 ± 0.036
2.548AlaLys: 2.548 ± 0.039
15.012AlaLeu: 15.012 ± 0.115
2.588AlaMet: 2.588 ± 0.029
1.817AlaAsn: 1.817 ± 0.031
7.498AlaPro: 7.498 ± 0.064
3.615AlaGln: 3.615 ± 0.04
10.675AlaArg: 10.675 ± 0.077
6.034AlaSer: 6.034 ± 0.047
7.038AlaThr: 7.038 ± 0.05
13.185AlaVal: 13.185 ± 0.087
1.987AlaTrp: 1.987 ± 0.026
2.779AlaTyr: 2.779 ± 0.037
0.0AlaXaa: 0.0 ± 0.0
Cys
1.074CysAla: 1.074 ± 0.019
0.095CysCys: 0.095 ± 0.006
0.465CysAsp: 0.465 ± 0.014
0.398CysGlu: 0.398 ± 0.011
0.213CysPhe: 0.213 ± 0.008
0.972CysGly: 0.972 ± 0.019
0.193CysHis: 0.193 ± 0.009
0.163CysIle: 0.163 ± 0.008
0.1CysLys: 0.1 ± 0.006
0.724CysLeu: 0.724 ± 0.017
0.115CysMet: 0.115 ± 0.007
0.126CysAsn: 0.126 ± 0.007
0.471CysPro: 0.471 ± 0.015
0.165CysGln: 0.165 ± 0.007
0.636CysArg: 0.636 ± 0.016
0.429CysSer: 0.429 ± 0.012
0.501CysThr: 0.501 ± 0.014
0.639CysVal: 0.639 ± 0.017
0.129CysTrp: 0.129 ± 0.007
0.147CysTyr: 0.147 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
7.967AspAla: 7.967 ± 0.07
0.414AspCys: 0.414 ± 0.013
3.596AspAsp: 3.596 ± 0.045
3.759AspGlu: 3.759 ± 0.034
1.646AspPhe: 1.646 ± 0.024
6.644AspGly: 6.644 ± 0.055
1.404AspHis: 1.404 ± 0.028
1.725AspIle: 1.725 ± 0.027
1.027AspLys: 1.027 ± 0.024
6.265AspLeu: 6.265 ± 0.055
0.775AspMet: 0.775 ± 0.018
0.923AspAsn: 0.923 ± 0.023
4.606AspPro: 4.606 ± 0.039
1.421AspGln: 1.421 ± 0.024
5.015AspArg: 5.015 ± 0.046
2.397AspSer: 2.397 ± 0.031
3.166AspThr: 3.166 ± 0.035
4.943AspVal: 4.943 ± 0.044
1.024AspTrp: 1.024 ± 0.02
1.101AspTyr: 1.101 ± 0.022
0.0AspXaa: 0.0 ± 0.0
Glu
7.455GluAla: 7.455 ± 0.07
0.382GluCys: 0.382 ± 0.011
2.954GluAsp: 2.954 ± 0.038
3.513GluGlu: 3.513 ± 0.045
1.402GluPhe: 1.402 ± 0.026
4.374GluGly: 4.374 ± 0.04
1.563GluHis: 1.563 ± 0.024
2.032GluIle: 2.032 ± 0.031
1.14GluLys: 1.14 ± 0.027
6.583GluLeu: 6.583 ± 0.057
0.818GluMet: 0.818 ± 0.018
0.919GluAsn: 0.919 ± 0.018
3.352GluPro: 3.352 ± 0.045
2.134GluGln: 2.134 ± 0.031
5.508GluArg: 5.508 ± 0.05
2.401GluSer: 2.401 ± 0.031
2.674GluThr: 2.674 ± 0.03
4.473GluVal: 4.473 ± 0.045
0.765GluTrp: 0.765 ± 0.017
1.085GluTyr: 1.085 ± 0.022
0.0GluXaa: 0.0 ± 0.0
Phe
3.712PheAla: 3.712 ± 0.04
0.248PheCys: 0.248 ± 0.009
1.907PheAsp: 1.907 ± 0.031
1.356PheGlu: 1.356 ± 0.024
0.861PhePhe: 0.861 ± 0.021
3.065PheGly: 3.065 ± 0.037
0.635PheHis: 0.635 ± 0.013
0.654PheIle: 0.654 ± 0.017
0.443PheLys: 0.443 ± 0.013
2.583PheLeu: 2.583 ± 0.036
0.39PheMet: 0.39 ± 0.012
0.526PheAsn: 0.526 ± 0.014
1.415PhePro: 1.415 ± 0.024
0.654PheGln: 0.654 ± 0.015
1.932PheArg: 1.932 ± 0.031
1.399PheSer: 1.399 ± 0.021
2.013PheThr: 2.013 ± 0.027
2.252PheVal: 2.252 ± 0.031
0.43PheTrp: 0.43 ± 0.014
0.554PheTyr: 0.554 ± 0.014
0.0PheXaa: 0.0 ± 0.0
Gly
11.571GlyAla: 11.571 ± 0.09
0.825GlyCys: 0.825 ± 0.02
5.502GlyAsp: 5.502 ± 0.052
5.126GlyGlu: 5.126 ± 0.049
2.9GlyPhe: 2.9 ± 0.038
9.473GlyGly: 9.473 ± 0.084
2.481GlyHis: 2.481 ± 0.026
3.347GlyIle: 3.347 ± 0.04
2.237GlyLys: 2.237 ± 0.041
9.683GlyLeu: 9.683 ± 0.067
2.039GlyMet: 2.039 ± 0.029
1.803GlyAsn: 1.803 ± 0.032
5.521GlyPro: 5.521 ± 0.05
2.48GlyGln: 2.48 ± 0.031
8.379GlyArg: 8.379 ± 0.063
5.43GlySer: 5.43 ± 0.049
6.783GlyThr: 6.783 ± 0.064
7.722GlyVal: 7.722 ± 0.067
1.742GlyTrp: 1.742 ± 0.023
2.283GlyTyr: 2.283 ± 0.033
0.0GlyXaa: 0.0 ± 0.0
His
2.826HisAla: 2.826 ± 0.035
0.2HisCys: 0.2 ± 0.007
1.412HisAsp: 1.412 ± 0.025
1.23HisGlu: 1.23 ± 0.02
0.676HisPhe: 0.676 ± 0.014
2.536HisGly: 2.536 ± 0.035
0.718HisHis: 0.718 ± 0.015
0.644HisIle: 0.644 ± 0.017
0.32HisLys: 0.32 ± 0.011
2.436HisLeu: 2.436 ± 0.031
0.367HisMet: 0.367 ± 0.013
0.35HisAsn: 0.35 ± 0.013
1.884HisPro: 1.884 ± 0.029
0.601HisGln: 0.601 ± 0.016
2.128HisArg: 2.128 ± 0.032
1.033HisSer: 1.033 ± 0.019
1.304HisThr: 1.304 ± 0.021
1.824HisVal: 1.824 ± 0.027
0.369HisTrp: 0.369 ± 0.012
0.484HisTyr: 0.484 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
4.657IleAla: 4.657 ± 0.042
0.255IleCys: 0.255 ± 0.009
2.107IleAsp: 2.107 ± 0.032
1.744IleGlu: 1.744 ± 0.029
0.626IlePhe: 0.626 ± 0.017
3.449IleGly: 3.449 ± 0.037
0.579IleHis: 0.579 ± 0.015
0.775IleIle: 0.775 ± 0.021
0.64IleLys: 0.64 ± 0.017
2.19IleLeu: 2.19 ± 0.031
0.404IleMet: 0.404 ± 0.012
0.625IleAsn: 0.625 ± 0.016
1.661IlePro: 1.661 ± 0.022
0.615IleGln: 0.615 ± 0.016
2.18IleArg: 2.18 ± 0.031
1.564IleSer: 1.564 ± 0.025
2.085IleThr: 2.085 ± 0.029
2.569IleVal: 2.569 ± 0.032
0.369IleTrp: 0.369 ± 0.011
0.485IleTyr: 0.485 ± 0.014
0.0IleXaa: 0.0 ± 0.0
Lys
2.719LysAla: 2.719 ± 0.042
0.1LysCys: 0.1 ± 0.007
1.175LysAsp: 1.175 ± 0.024
1.007LysGlu: 1.007 ± 0.021
0.387LysPhe: 0.387 ± 0.012
1.7LysGly: 1.7 ± 0.037
0.395LysHis: 0.395 ± 0.012
0.721LysIle: 0.721 ± 0.017
0.645LysLys: 0.645 ± 0.022
1.759LysLeu: 1.759 ± 0.031
0.31LysMet: 0.31 ± 0.01
0.431LysAsn: 0.431 ± 0.014
1.173LysPro: 1.173 ± 0.023
0.591LysGln: 0.591 ± 0.015
1.292LysArg: 1.292 ± 0.024
1.008LysSer: 1.008 ± 0.022
1.084LysThr: 1.084 ± 0.023
1.745LysVal: 1.745 ± 0.032
0.24LysTrp: 0.24 ± 0.01
0.393LysTyr: 0.393 ± 0.012
0.0LysXaa: 0.0 ± 0.0
Leu
15.342LeuAla: 15.342 ± 0.107
0.808LeuCys: 0.808 ± 0.019
6.681LeuAsp: 6.681 ± 0.054
4.802LeuGlu: 4.802 ± 0.046
2.587LeuPhe: 2.587 ± 0.039
9.269LeuGly: 9.269 ± 0.064
2.308LeuHis: 2.308 ± 0.031
3.021LeuIle: 3.021 ± 0.036
1.726LeuLys: 1.726 ± 0.028
11.277LeuLeu: 11.277 ± 0.085
1.609LeuMet: 1.609 ± 0.025
1.566LeuAsn: 1.566 ± 0.026
6.712LeuPro: 6.712 ± 0.063
2.238LeuGln: 2.238 ± 0.027
9.062LeuArg: 9.062 ± 0.073
5.253LeuSer: 5.253 ± 0.044
6.798LeuThr: 6.798 ± 0.052
8.972LeuVal: 8.972 ± 0.072
1.322LeuTrp: 1.322 ± 0.023
1.868LeuTyr: 1.868 ± 0.028
0.0LeuXaa: 0.0 ± 0.0
Met
2.287MetAla: 2.287 ± 0.031
0.134MetCys: 0.134 ± 0.007
0.929MetAsp: 0.929 ± 0.019
0.739MetGlu: 0.739 ± 0.02
0.411MetPhe: 0.411 ± 0.014
1.321MetGly: 1.321 ± 0.022
0.366MetHis: 0.366 ± 0.012
0.59MetIle: 0.59 ± 0.016
0.383MetLys: 0.383 ± 0.013
1.635MetLeu: 1.635 ± 0.027
0.278MetMet: 0.278 ± 0.011
0.415MetAsn: 0.415 ± 0.012
1.092MetPro: 1.092 ± 0.018
0.416MetGln: 0.416 ± 0.012
1.48MetArg: 1.48 ± 0.025
1.276MetSer: 1.276 ± 0.021
1.482MetThr: 1.482 ± 0.023
1.276MetVal: 1.276 ± 0.021
0.209MetTrp: 0.209 ± 0.008
0.311MetTyr: 0.311 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
2.168AsnAla: 2.168 ± 0.031
0.153AsnCys: 0.153 ± 0.008
0.924AsnAsp: 0.924 ± 0.019
0.776AsnGlu: 0.776 ± 0.018
0.461AsnPhe: 0.461 ± 0.013
1.913AsnGly: 1.913 ± 0.035
0.382AsnHis: 0.382 ± 0.012
0.553AsnIle: 0.553 ± 0.016
0.343AsnLys: 0.343 ± 0.013
1.581AsnLeu: 1.581 ± 0.029
0.266AsnMet: 0.266 ± 0.009
0.429AsnAsn: 0.429 ± 0.016
1.306AsnPro: 1.306 ± 0.025
0.448AsnGln: 0.448 ± 0.013
1.254AsnArg: 1.254 ± 0.018
0.837AsnSer: 0.837 ± 0.021
1.055AsnThr: 1.055 ± 0.024
1.326AsnVal: 1.326 ± 0.028
0.263AsnTrp: 0.263 ± 0.011
0.376AsnTyr: 0.376 ± 0.012
0.0AsnXaa: 0.0 ± 0.0
Pro
9.259ProAla: 9.259 ± 0.078
0.34ProCys: 0.34 ± 0.012
4.419ProAsp: 4.419 ± 0.037
4.326ProGlu: 4.326 ± 0.045
1.543ProPhe: 1.543 ± 0.026
7.233ProGly: 7.233 ± 0.057
1.432ProHis: 1.432 ± 0.022
1.227ProIle: 1.227 ± 0.027
1.036ProLys: 1.036 ± 0.022
5.49ProLeu: 5.49 ± 0.047
0.977ProMet: 0.977 ± 0.023
0.834ProAsn: 0.834 ± 0.019
3.673ProPro: 3.673 ± 0.055
1.784ProGln: 1.784 ± 0.029
4.282ProArg: 4.282 ± 0.043
3.206ProSer: 3.206 ± 0.045
2.967ProThr: 2.967 ± 0.038
5.667ProVal: 5.667 ± 0.05
0.957ProTrp: 0.957 ± 0.02
1.379ProTyr: 1.379 ± 0.023
0.0ProXaa: 0.0 ± 0.0
Gln
3.711GlnAla: 3.711 ± 0.043
0.172GlnCys: 0.172 ± 0.008
1.406GlnAsp: 1.406 ± 0.027
1.384GlnGlu: 1.384 ± 0.023
0.651GlnPhe: 0.651 ± 0.015
2.252GlnGly: 2.252 ± 0.031
0.606GlnHis: 0.606 ± 0.015
0.997GlnIle: 0.997 ± 0.022
0.51GlnLys: 0.51 ± 0.016
2.717GlnLeu: 2.717 ± 0.031
0.434GlnMet: 0.434 ± 0.013
0.468GlnAsn: 0.468 ± 0.014
1.628GlnPro: 1.628 ± 0.027
1.13GlnGln: 1.13 ± 0.031
2.312GlnArg: 2.312 ± 0.03
1.206GlnSer: 1.206 ± 0.022
1.268GlnThr: 1.268 ± 0.024
2.296GlnVal: 2.296 ± 0.032
0.472GlnTrp: 0.472 ± 0.013
0.584GlnTyr: 0.584 ± 0.015
0.0GlnXaa: 0.0 ± 0.0
Arg
10.18ArgAla: 10.18 ± 0.081
0.616ArgCys: 0.616 ± 0.014
4.392ArgAsp: 4.392 ± 0.042
4.862ArgGlu: 4.862 ± 0.046
2.416ArgPhe: 2.416 ± 0.027
6.223ArgGly: 6.223 ± 0.047
2.352ArgHis: 2.352 ± 0.032
3.216ArgIle: 3.216 ± 0.039
1.526ArgLys: 1.526 ± 0.024
9.243ArgLeu: 9.243 ± 0.068
1.787ArgMet: 1.787 ± 0.029
1.334ArgAsn: 1.334 ± 0.024
5.203ArgPro: 5.203 ± 0.046
2.291ArgGln: 2.291 ± 0.027
8.209ArgArg: 8.209 ± 0.07
4.093ArgSer: 4.093 ± 0.039
5.442ArgThr: 5.442 ± 0.041
6.084ArgVal: 6.084 ± 0.054
1.44ArgTrp: 1.44 ± 0.022
1.815ArgTyr: 1.815 ± 0.027
0.0ArgXaa: 0.0 ± 0.0
Ser
6.837SerAla: 6.837 ± 0.058
0.423SerCys: 0.423 ± 0.013
2.572SerAsp: 2.572 ± 0.037
2.235SerGlu: 2.235 ± 0.031
1.511SerPhe: 1.511 ± 0.022
6.072SerGly: 6.072 ± 0.053
1.005SerHis: 1.005 ± 0.023
1.33SerIle: 1.33 ± 0.025
0.899SerLys: 0.899 ± 0.02
4.729SerLeu: 4.729 ± 0.045
1.004SerMet: 1.004 ± 0.018
0.825SerAsn: 0.825 ± 0.018
3.235SerPro: 3.235 ± 0.041
1.179SerGln: 1.179 ± 0.02
3.759SerArg: 3.759 ± 0.038
2.72SerSer: 2.72 ± 0.041
2.909SerThr: 2.909 ± 0.04
4.346SerVal: 4.346 ± 0.036
0.906SerTrp: 0.906 ± 0.019
1.162SerTyr: 1.162 ± 0.021
0.0SerXaa: 0.0 ± 0.0
Thr
9.292ThrAla: 9.292 ± 0.066
0.427ThrCys: 0.427 ± 0.013
3.506ThrAsp: 3.506 ± 0.042
3.031ThrGlu: 3.031 ± 0.034
1.527ThrPhe: 1.527 ± 0.027
6.876ThrGly: 6.876 ± 0.058
1.179ThrHis: 1.179 ± 0.021
1.6ThrIle: 1.6 ± 0.026
1.047ThrLys: 1.047 ± 0.025
5.556ThrLeu: 5.556 ± 0.048
0.928ThrMet: 0.928 ± 0.019
0.919ThrAsn: 0.919 ± 0.022
4.082ThrPro: 4.082 ± 0.04
1.301ThrGln: 1.301 ± 0.024
3.974ThrArg: 3.974 ± 0.038
3.087ThrSer: 3.087 ± 0.033
3.768ThrThr: 3.768 ± 0.043
6.196ThrVal: 6.196 ± 0.061
0.934ThrTrp: 0.934 ± 0.022
1.297ThrTyr: 1.297 ± 0.025
0.0ThrXaa: 0.0 ± 0.0
Val
11.26ValAla: 11.26 ± 0.071
0.751ValCys: 0.751 ± 0.017
5.084ValAsp: 5.084 ± 0.042
4.64ValGlu: 4.64 ± 0.052
2.454ValPhe: 2.454 ± 0.03
6.561ValGly: 6.561 ± 0.065
2.056ValHis: 2.056 ± 0.025
2.677ValIle: 2.677 ± 0.03
1.616ValLys: 1.616 ± 0.028
9.929ValLeu: 9.929 ± 0.078
1.418ValMet: 1.418 ± 0.024
1.689ValAsn: 1.689 ± 0.028
5.639ValPro: 5.639 ± 0.049
1.977ValGln: 1.977 ± 0.027
7.357ValArg: 7.357 ± 0.056
4.362ValSer: 4.362 ± 0.045
5.902ValThr: 5.902 ± 0.049
8.399ValVal: 8.399 ± 0.073
1.206ValTrp: 1.206 ± 0.023
1.574ValTyr: 1.574 ± 0.025
0.0ValXaa: 0.0 ± 0.0
Trp
1.739TrpAla: 1.739 ± 0.027
0.151TrpCys: 0.151 ± 0.009
0.875TrpAsp: 0.875 ± 0.018
0.765TrpGlu: 0.765 ± 0.018
0.528TrpPhe: 0.528 ± 0.014
1.113TrpGly: 1.113 ± 0.026
0.394TrpHis: 0.394 ± 0.012
0.53TrpIle: 0.53 ± 0.015
0.346TrpLys: 0.346 ± 0.012
1.775TrpLeu: 1.775 ± 0.026
0.275TrpMet: 0.275 ± 0.01
0.429TrpAsn: 0.429 ± 0.013
0.821TrpPro: 0.821 ± 0.017
0.59TrpGln: 0.59 ± 0.016
1.383TrpArg: 1.383 ± 0.022
0.989TrpSer: 0.989 ± 0.021
1.057TrpThr: 1.057 ± 0.017
0.99TrpVal: 0.99 ± 0.02
0.33TrpTrp: 0.33 ± 0.01
0.377TrpTyr: 0.377 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.789TyrAla: 2.789 ± 0.032
0.177TyrCys: 0.177 ± 0.007
1.491TyrAsp: 1.491 ± 0.03
1.166TyrGlu: 1.166 ± 0.021
0.607TyrPhe: 0.607 ± 0.015
2.235TyrGly: 2.235 ± 0.033
0.377TyrHis: 0.377 ± 0.012
0.462TyrIle: 0.462 ± 0.012
0.344TyrLys: 0.344 ± 0.012
2.054TyrLeu: 2.054 ± 0.023
0.251TyrMet: 0.251 ± 0.012
0.387TyrAsn: 0.387 ± 0.014
1.059TyrPro: 1.059 ± 0.021
0.572TyrGln: 0.572 ± 0.015
1.908TyrArg: 1.908 ± 0.029
0.934TyrSer: 0.934 ± 0.021
1.231TyrThr: 1.231 ± 0.022
1.643TyrVal: 1.643 ± 0.027
0.365TyrTrp: 0.365 ± 0.012
0.449TyrTyr: 0.449 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8673 proteins (2848599 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski