Amino acid dipepetide frequency for Nonomuraea sp. ATCC 55076

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.689AlaAla: 21.689 ± 0.115
1.11AlaCys: 1.11 ± 0.018
7.648AlaAsp: 7.648 ± 0.051
8.483AlaGlu: 8.483 ± 0.071
3.861AlaPhe: 3.861 ± 0.033
13.927AlaGly: 13.927 ± 0.081
2.698AlaHis: 2.698 ± 0.028
4.259AlaIle: 4.259 ± 0.038
2.614AlaLys: 2.614 ± 0.031
14.667AlaLeu: 14.667 ± 0.089
2.754AlaMet: 2.754 ± 0.03
1.942AlaAsn: 1.942 ± 0.026
6.472AlaPro: 6.472 ± 0.055
3.61AlaGln: 3.61 ± 0.035
10.715AlaArg: 10.715 ± 0.061
5.56AlaSer: 5.56 ± 0.048
7.123AlaThr: 7.123 ± 0.049
11.634AlaVal: 11.634 ± 0.067
2.049AlaTrp: 2.049 ± 0.025
2.876AlaTyr: 2.876 ± 0.028
0.0AlaXaa: 0.0 ± 0.0
Cys
1.03CysAla: 1.03 ± 0.017
0.099CysCys: 0.099 ± 0.006
0.477CysAsp: 0.477 ± 0.013
0.408CysGlu: 0.408 ± 0.012
0.222CysPhe: 0.222 ± 0.008
0.891CysGly: 0.891 ± 0.017
0.203CysHis: 0.203 ± 0.007
0.126CysIle: 0.126 ± 0.006
0.1CysLys: 0.1 ± 0.006
0.766CysLeu: 0.766 ± 0.017
0.118CysMet: 0.118 ± 0.005
0.124CysAsn: 0.124 ± 0.006
0.495CysPro: 0.495 ± 0.012
0.173CysGln: 0.173 ± 0.007
0.572CysArg: 0.572 ± 0.014
0.395CysSer: 0.395 ± 0.01
0.43CysThr: 0.43 ± 0.012
0.648CysVal: 0.648 ± 0.014
0.128CysTrp: 0.128 ± 0.006
0.168CysTyr: 0.168 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
6.771AspAla: 6.771 ± 0.05
0.357AspCys: 0.357 ± 0.011
3.575AspAsp: 3.575 ± 0.035
3.569AspGlu: 3.569 ± 0.035
1.528AspPhe: 1.528 ± 0.023
6.16AspGly: 6.16 ± 0.062
1.373AspHis: 1.373 ± 0.019
1.659AspIle: 1.659 ± 0.022
1.035AspLys: 1.035 ± 0.017
6.65AspLeu: 6.65 ± 0.05
0.747AspMet: 0.747 ± 0.015
0.896AspAsn: 0.896 ± 0.017
4.547AspPro: 4.547 ± 0.041
1.549AspGln: 1.549 ± 0.022
4.646AspArg: 4.646 ± 0.039
2.021AspSer: 2.021 ± 0.024
2.674AspThr: 2.674 ± 0.028
4.964AspVal: 4.964 ± 0.042
0.951AspTrp: 0.951 ± 0.016
1.197AspTyr: 1.197 ± 0.018
0.0AspXaa: 0.0 ± 0.0
Glu
6.941GluAla: 6.941 ± 0.056
0.345GluCys: 0.345 ± 0.01
2.324GluAsp: 2.324 ± 0.029
3.535GluGlu: 3.535 ± 0.038
1.394GluPhe: 1.394 ± 0.02
3.963GluGly: 3.963 ± 0.033
1.664GluHis: 1.664 ± 0.024
2.301GluIle: 2.301 ± 0.031
1.058GluLys: 1.058 ± 0.019
7.242GluLeu: 7.242 ± 0.057
0.867GluMet: 0.867 ± 0.013
0.825GluAsn: 0.825 ± 0.016
3.581GluPro: 3.581 ± 0.036
2.29GluGln: 2.29 ± 0.027
6.03GluArg: 6.03 ± 0.048
2.482GluSer: 2.482 ± 0.028
2.67GluThr: 2.67 ± 0.027
4.68GluVal: 4.68 ± 0.042
0.814GluTrp: 0.814 ± 0.016
1.023GluTyr: 1.023 ± 0.017
0.0GluXaa: 0.0 ± 0.0
Phe
3.817PheAla: 3.817 ± 0.035
0.252PheCys: 0.252 ± 0.007
2.043PheAsp: 2.043 ± 0.025
1.502PheGlu: 1.502 ± 0.022
0.887PhePhe: 0.887 ± 0.016
3.037PheGly: 3.037 ± 0.031
0.63PheHis: 0.63 ± 0.013
0.76PheIle: 0.76 ± 0.016
0.511PheLys: 0.511 ± 0.011
2.708PheLeu: 2.708 ± 0.031
0.453PheMet: 0.453 ± 0.011
0.564PheAsn: 0.564 ± 0.013
1.424PhePro: 1.424 ± 0.021
0.714PheGln: 0.714 ± 0.015
1.805PheArg: 1.805 ± 0.023
1.494PheSer: 1.494 ± 0.026
2.064PheThr: 2.064 ± 0.027
2.374PheVal: 2.374 ± 0.025
0.48PheTrp: 0.48 ± 0.011
0.64PheTyr: 0.64 ± 0.014
0.0PheXaa: 0.0 ± 0.0
Gly
10.619GlyAla: 10.619 ± 0.062
0.801GlyCys: 0.801 ± 0.015
5.214GlyAsp: 5.214 ± 0.042
5.222GlyGlu: 5.222 ± 0.044
2.888GlyPhe: 2.888 ± 0.024
8.929GlyGly: 8.929 ± 0.071
2.31GlyHis: 2.31 ± 0.025
3.254GlyIle: 3.254 ± 0.032
2.252GlyLys: 2.252 ± 0.032
10.009GlyLeu: 10.009 ± 0.071
2.055GlyMet: 2.055 ± 0.022
1.706GlyAsn: 1.706 ± 0.027
5.186GlyPro: 5.186 ± 0.044
2.794GlyGln: 2.794 ± 0.034
8.009GlyArg: 8.009 ± 0.048
4.79GlySer: 4.79 ± 0.039
5.86GlyThr: 5.86 ± 0.048
7.942GlyVal: 7.942 ± 0.058
1.821GlyTrp: 1.821 ± 0.023
2.398GlyTyr: 2.398 ± 0.03
0.0GlyXaa: 0.0 ± 0.0
His
2.738HisAla: 2.738 ± 0.031
0.203HisCys: 0.203 ± 0.007
1.435HisAsp: 1.435 ± 0.021
1.223HisGlu: 1.223 ± 0.021
0.6HisPhe: 0.6 ± 0.014
2.25HisGly: 2.25 ± 0.025
0.71HisHis: 0.71 ± 0.015
0.611HisIle: 0.611 ± 0.012
0.294HisLys: 0.294 ± 0.009
2.586HisLeu: 2.586 ± 0.03
0.319HisMet: 0.319 ± 0.01
0.385HisAsn: 0.385 ± 0.01
1.743HisPro: 1.743 ± 0.025
0.635HisGln: 0.635 ± 0.014
2.011HisArg: 2.011 ± 0.027
0.846HisSer: 0.846 ± 0.016
1.128HisThr: 1.128 ± 0.018
1.93HisVal: 1.93 ± 0.021
0.374HisTrp: 0.374 ± 0.01
0.506HisTyr: 0.506 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
5.001IleAla: 5.001 ± 0.038
0.302IleCys: 0.302 ± 0.008
2.396IleAsp: 2.396 ± 0.027
2.173IleGlu: 2.173 ± 0.028
0.834IlePhe: 0.834 ± 0.018
3.618IleGly: 3.618 ± 0.033
0.626IleHis: 0.626 ± 0.013
1.047IleIle: 1.047 ± 0.019
0.821IleLys: 0.821 ± 0.016
2.586IleLeu: 2.586 ± 0.029
0.592IleMet: 0.592 ± 0.013
0.732IleAsn: 0.732 ± 0.015
1.832IlePro: 1.832 ± 0.027
0.752IleGln: 0.752 ± 0.015
2.412IleArg: 2.412 ± 0.028
1.878IleSer: 1.878 ± 0.024
2.425IleThr: 2.425 ± 0.026
3.124IleVal: 3.124 ± 0.028
0.472IleTrp: 0.472 ± 0.011
0.639IleTyr: 0.639 ± 0.013
0.0IleXaa: 0.0 ± 0.0
Lys
2.596LysAla: 2.596 ± 0.034
0.11LysCys: 0.11 ± 0.006
1.092LysAsp: 1.092 ± 0.019
1.165LysGlu: 1.165 ± 0.019
0.395LysPhe: 0.395 ± 0.011
1.733LysGly: 1.733 ± 0.024
0.395LysHis: 0.395 ± 0.01
0.883LysIle: 0.883 ± 0.016
0.603LysLys: 0.603 ± 0.018
1.989LysLeu: 1.989 ± 0.024
0.337LysMet: 0.337 ± 0.009
0.425LysAsn: 0.425 ± 0.013
1.243LysPro: 1.243 ± 0.023
0.668LysGln: 0.668 ± 0.016
1.351LysArg: 1.351 ± 0.022
0.977LysSer: 0.977 ± 0.018
1.141LysThr: 1.141 ± 0.023
1.894LysVal: 1.894 ± 0.03
0.28LysTrp: 0.28 ± 0.009
0.412LysTyr: 0.412 ± 0.012
0.0LysXaa: 0.0 ± 0.0
Leu
16.468LeuAla: 16.468 ± 0.089
0.785LeuCys: 0.785 ± 0.016
6.742LeuAsp: 6.742 ± 0.054
4.879LeuGlu: 4.879 ± 0.042
2.775LeuPhe: 2.775 ± 0.032
9.633LeuGly: 9.633 ± 0.06
2.208LeuHis: 2.208 ± 0.027
3.762LeuIle: 3.762 ± 0.036
1.92LeuLys: 1.92 ± 0.027
11.754LeuLeu: 11.754 ± 0.08
1.809LeuMet: 1.809 ± 0.023
1.738LeuAsn: 1.738 ± 0.024
6.708LeuPro: 6.708 ± 0.049
2.143LeuGln: 2.143 ± 0.024
9.346LeuArg: 9.346 ± 0.055
5.641LeuSer: 5.641 ± 0.04
6.935LeuThr: 6.935 ± 0.046
9.071LeuVal: 9.071 ± 0.062
1.389LeuTrp: 1.389 ± 0.022
1.965LeuTyr: 1.965 ± 0.025
0.0LeuXaa: 0.0 ± 0.0
Met
2.44MetAla: 2.44 ± 0.023
0.123MetCys: 0.123 ± 0.005
0.897MetAsp: 0.897 ± 0.015
0.848MetGlu: 0.848 ± 0.016
0.508MetPhe: 0.508 ± 0.012
1.379MetGly: 1.379 ± 0.021
0.347MetHis: 0.347 ± 0.01
0.843MetIle: 0.843 ± 0.014
0.425MetLys: 0.425 ± 0.01
1.916MetLeu: 1.916 ± 0.023
0.34MetMet: 0.34 ± 0.012
0.449MetAsn: 0.449 ± 0.01
1.14MetPro: 1.14 ± 0.016
0.405MetGln: 0.405 ± 0.011
1.689MetArg: 1.689 ± 0.02
1.308MetSer: 1.308 ± 0.02
1.51MetThr: 1.51 ± 0.021
1.369MetVal: 1.369 ± 0.022
0.244MetTrp: 0.244 ± 0.007
0.333MetTyr: 0.333 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
2.187AsnAla: 2.187 ± 0.027
0.154AsnCys: 0.154 ± 0.006
0.969AsnAsp: 0.969 ± 0.015
0.801AsnGlu: 0.801 ± 0.017
0.47AsnPhe: 0.47 ± 0.011
1.88AsnGly: 1.88 ± 0.028
0.358AsnHis: 0.358 ± 0.01
0.592AsnIle: 0.592 ± 0.013
0.362AsnLys: 0.362 ± 0.013
1.788AsnLeu: 1.788 ± 0.025
0.271AsnMet: 0.271 ± 0.01
0.413AsnAsn: 0.413 ± 0.013
1.362AsnPro: 1.362 ± 0.022
0.492AsnGln: 0.492 ± 0.012
1.228AsnArg: 1.228 ± 0.02
0.757AsnSer: 0.757 ± 0.017
1.0AsnThr: 1.0 ± 0.02
1.556AsnVal: 1.556 ± 0.022
0.274AsnTrp: 0.274 ± 0.009
0.394AsnTyr: 0.394 ± 0.012
0.0AsnXaa: 0.0 ± 0.0
Pro
8.842ProAla: 8.842 ± 0.059
0.344ProCys: 0.344 ± 0.01
4.361ProAsp: 4.361 ± 0.036
4.156ProGlu: 4.156 ± 0.036
1.679ProPhe: 1.679 ± 0.02
6.846ProGly: 6.846 ± 0.054
1.324ProHis: 1.324 ± 0.02
1.739ProIle: 1.739 ± 0.022
1.124ProLys: 1.124 ± 0.019
5.518ProLeu: 5.518 ± 0.046
1.167ProMet: 1.167 ± 0.017
0.924ProAsn: 0.924 ± 0.019
3.892ProPro: 3.892 ± 0.047
1.645ProGln: 1.645 ± 0.028
3.915ProArg: 3.915 ± 0.04
3.151ProSer: 3.151 ± 0.039
2.899ProThr: 2.899 ± 0.042
5.213ProVal: 5.213 ± 0.039
0.951ProTrp: 0.951 ± 0.018
1.534ProTyr: 1.534 ± 0.02
0.0ProXaa: 0.0 ± 0.0
Gln
4.187GlnAla: 4.187 ± 0.039
0.153GlnCys: 0.153 ± 0.007
1.28GlnAsp: 1.28 ± 0.021
1.502GlnGlu: 1.502 ± 0.019
0.663GlnPhe: 0.663 ± 0.014
2.228GlnGly: 2.228 ± 0.024
0.619GlnHis: 0.619 ± 0.013
1.094GlnIle: 1.094 ± 0.019
0.489GlnLys: 0.489 ± 0.012
2.777GlnLeu: 2.777 ± 0.027
0.439GlnMet: 0.439 ± 0.011
0.474GlnAsn: 0.474 ± 0.014
1.735GlnPro: 1.735 ± 0.027
1.088GlnGln: 1.088 ± 0.028
2.327GlnArg: 2.327 ± 0.023
1.155GlnSer: 1.155 ± 0.02
1.28GlnThr: 1.28 ± 0.02
2.675GlnVal: 2.675 ± 0.027
0.48GlnTrp: 0.48 ± 0.013
0.533GlnTyr: 0.533 ± 0.013
0.0GlnXaa: 0.0 ± 0.0
Arg
10.217ArgAla: 10.217 ± 0.066
0.577ArgCys: 0.577 ± 0.013
4.404ArgAsp: 4.404 ± 0.028
4.779ArgGlu: 4.779 ± 0.045
2.461ArgPhe: 2.461 ± 0.028
5.725ArgGly: 5.725 ± 0.04
2.331ArgHis: 2.331 ± 0.028
3.153ArgIle: 3.153 ± 0.027
1.574ArgLys: 1.574 ± 0.02
9.935ArgLeu: 9.935 ± 0.06
1.887ArgMet: 1.887 ± 0.021
1.336ArgAsn: 1.336 ± 0.019
5.177ArgPro: 5.177 ± 0.041
2.485ArgGln: 2.485 ± 0.029
8.129ArgArg: 8.129 ± 0.066
3.773ArgSer: 3.773 ± 0.034
4.816ArgThr: 4.816 ± 0.04
6.359ArgVal: 6.359 ± 0.04
1.494ArgTrp: 1.494 ± 0.024
1.925ArgTyr: 1.925 ± 0.026
0.0ArgXaa: 0.0 ± 0.0
Ser
6.233SerAla: 6.233 ± 0.05
0.395SerCys: 0.395 ± 0.012
2.396SerAsp: 2.396 ± 0.025
2.285SerGlu: 2.285 ± 0.026
1.536SerPhe: 1.536 ± 0.018
5.798SerGly: 5.798 ± 0.047
0.906SerHis: 0.906 ± 0.016
1.637SerIle: 1.637 ± 0.021
0.951SerLys: 0.951 ± 0.019
4.678SerLeu: 4.678 ± 0.037
1.176SerMet: 1.176 ± 0.017
0.815SerAsn: 0.815 ± 0.018
3.23SerPro: 3.23 ± 0.03
1.168SerGln: 1.168 ± 0.022
3.644SerArg: 3.644 ± 0.033
2.592SerSer: 2.592 ± 0.034
2.778SerThr: 2.778 ± 0.03
3.965SerVal: 3.965 ± 0.038
0.972SerTrp: 0.972 ± 0.015
1.187SerTyr: 1.187 ± 0.019
0.0SerXaa: 0.0 ± 0.0
Thr
7.831ThrAla: 7.831 ± 0.049
0.456ThrCys: 0.456 ± 0.011
2.954ThrAsp: 2.954 ± 0.03
2.698ThrGlu: 2.698 ± 0.028
1.84ThrPhe: 1.84 ± 0.024
6.376ThrGly: 6.376 ± 0.042
1.112ThrHis: 1.112 ± 0.016
2.144ThrIle: 2.144 ± 0.027
1.111ThrLys: 1.111 ± 0.025
5.961ThrLeu: 5.961 ± 0.046
1.123ThrMet: 1.123 ± 0.018
0.955ThrAsn: 0.955 ± 0.021
4.025ThrPro: 4.025 ± 0.041
1.312ThrGln: 1.312 ± 0.02
4.005ThrArg: 4.005 ± 0.036
3.1ThrSer: 3.1 ± 0.034
3.666ThrThr: 3.666 ± 0.046
5.532ThrVal: 5.532 ± 0.046
1.054ThrTrp: 1.054 ± 0.016
1.404ThrTyr: 1.404 ± 0.025
0.0ThrXaa: 0.0 ± 0.0
Val
11.521ValAla: 11.521 ± 0.071
0.675ValCys: 0.675 ± 0.014
4.343ValAsp: 4.343 ± 0.035
4.781ValGlu: 4.781 ± 0.037
2.46ValPhe: 2.46 ± 0.027
6.312ValGly: 6.312 ± 0.046
1.835ValHis: 1.835 ± 0.025
3.303ValIle: 3.303 ± 0.034
1.75ValLys: 1.75 ± 0.025
9.827ValLeu: 9.827 ± 0.068
1.462ValMet: 1.462 ± 0.022
1.722ValAsn: 1.722 ± 0.025
5.365ValPro: 5.365 ± 0.048
1.979ValGln: 1.979 ± 0.026
7.225ValArg: 7.225 ± 0.043
4.477ValSer: 4.477 ± 0.038
6.002ValThr: 6.002 ± 0.054
8.051ValVal: 8.051 ± 0.067
1.161ValTrp: 1.161 ± 0.02
1.637ValTyr: 1.637 ± 0.02
0.0ValXaa: 0.0 ± 0.0
Trp
1.778TrpAla: 1.778 ± 0.023
0.153TrpCys: 0.153 ± 0.007
0.842TrpAsp: 0.842 ± 0.019
0.757TrpGlu: 0.757 ± 0.016
0.504TrpPhe: 0.504 ± 0.013
1.09TrpGly: 1.09 ± 0.019
0.429TrpHis: 0.429 ± 0.012
0.598TrpIle: 0.598 ± 0.012
0.33TrpLys: 0.33 ± 0.01
1.944TrpLeu: 1.944 ± 0.024
0.332TrpMet: 0.332 ± 0.01
0.436TrpAsn: 0.436 ± 0.012
0.927TrpPro: 0.927 ± 0.016
0.633TrpGln: 0.633 ± 0.013
1.552TrpArg: 1.552 ± 0.026
1.006TrpSer: 1.006 ± 0.017
1.022TrpThr: 1.022 ± 0.016
1.058TrpVal: 1.058 ± 0.018
0.387TrpTrp: 0.387 ± 0.011
0.386TrpTyr: 0.386 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.786TyrAla: 2.786 ± 0.028
0.181TyrCys: 0.181 ± 0.007
1.477TyrAsp: 1.477 ± 0.025
1.265TyrGlu: 1.265 ± 0.02
0.651TyrPhe: 0.651 ± 0.013
2.285TyrGly: 2.285 ± 0.027
0.449TyrHis: 0.449 ± 0.012
0.499TyrIle: 0.499 ± 0.012
0.365TyrLys: 0.365 ± 0.012
2.373TyrLeu: 2.373 ± 0.025
0.266TyrMet: 0.266 ± 0.008
0.447TyrAsn: 0.447 ± 0.012
1.138TyrPro: 1.138 ± 0.019
0.643TyrGln: 0.643 ± 0.016
1.931TyrArg: 1.931 ± 0.026
0.938TyrSer: 0.938 ± 0.019
1.232TyrThr: 1.232 ± 0.02
1.849TyrVal: 1.849 ± 0.022
0.384TyrTrp: 0.384 ± 0.011
0.497TyrTyr: 0.497 ± 0.012
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11313 proteins (3677589 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski