Amino acid dipepetide frequency for Arthrobacter sp. MN05-02

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.674AlaAla: 19.674 ± 0.211
0.795AlaCys: 0.795 ± 0.029
7.742AlaAsp: 7.742 ± 0.081
7.646AlaGlu: 7.646 ± 0.088
3.822AlaPhe: 3.822 ± 0.061
13.125AlaGly: 13.125 ± 0.136
2.367AlaHis: 2.367 ± 0.049
4.614AlaIle: 4.614 ± 0.078
2.482AlaLys: 2.482 ± 0.05
13.43AlaLeu: 13.43 ± 0.146
2.605AlaMet: 2.605 ± 0.047
2.172AlaAsn: 2.172 ± 0.049
6.612AlaPro: 6.612 ± 0.113
3.752AlaGln: 3.752 ± 0.062
8.948AlaArg: 8.948 ± 0.089
7.119AlaSer: 7.119 ± 0.087
6.799AlaThr: 6.799 ± 0.089
11.882AlaVal: 11.882 ± 0.115
1.67AlaTrp: 1.67 ± 0.042
2.314AlaTyr: 2.314 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
0.727CysAla: 0.727 ± 0.028
0.071CysCys: 0.071 ± 0.009
0.325CysAsp: 0.325 ± 0.017
0.28CysGlu: 0.28 ± 0.019
0.166CysPhe: 0.166 ± 0.012
0.718CysGly: 0.718 ± 0.025
0.133CysHis: 0.133 ± 0.011
0.249CysIle: 0.249 ± 0.016
0.079CysLys: 0.079 ± 0.01
0.555CysLeu: 0.555 ± 0.022
0.095CysMet: 0.095 ± 0.009
0.118CysAsn: 0.118 ± 0.012
0.392CysPro: 0.392 ± 0.02
0.147CysGln: 0.147 ± 0.011
0.484CysArg: 0.484 ± 0.024
0.434CysSer: 0.434 ± 0.021
0.412CysThr: 0.412 ± 0.023
0.454CysVal: 0.454 ± 0.022
0.083CysTrp: 0.083 ± 0.009
0.151CysTyr: 0.151 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
8.233AspAla: 8.233 ± 0.11
0.285AspCys: 0.285 ± 0.016
4.001AspAsp: 4.001 ± 0.066
3.76AspGlu: 3.76 ± 0.066
1.78AspPhe: 1.78 ± 0.046
6.311AspGly: 6.311 ± 0.09
1.313AspHis: 1.313 ± 0.035
2.381AspIle: 2.381 ± 0.062
1.006AspLys: 1.006 ± 0.034
6.417AspLeu: 6.417 ± 0.069
0.838AspMet: 0.838 ± 0.031
0.994AspAsn: 0.994 ± 0.047
4.101AspPro: 4.101 ± 0.068
1.678AspGln: 1.678 ± 0.041
4.372AspArg: 4.372 ± 0.066
2.721AspSer: 2.721 ± 0.05
3.214AspThr: 3.214 ± 0.06
5.455AspVal: 5.455 ± 0.074
0.801AspTrp: 0.801 ± 0.029
1.35AspTyr: 1.35 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
7.124GluAla: 7.124 ± 0.099
0.287GluCys: 0.287 ± 0.017
3.548GluAsp: 3.548 ± 0.062
3.529GluGlu: 3.529 ± 0.056
1.596GluPhe: 1.596 ± 0.038
4.488GluGly: 4.488 ± 0.068
1.5GluHis: 1.5 ± 0.039
2.336GluIle: 2.336 ± 0.057
1.407GluLys: 1.407 ± 0.039
6.265GluLeu: 6.265 ± 0.074
0.876GluMet: 0.876 ± 0.027
1.317GluAsn: 1.317 ± 0.037
3.055GluPro: 3.055 ± 0.059
2.441GluGln: 2.441 ± 0.057
4.549GluArg: 4.549 ± 0.074
2.799GluSer: 2.799 ± 0.054
2.805GluThr: 2.805 ± 0.049
4.478GluVal: 4.478 ± 0.065
0.701GluTrp: 0.701 ± 0.029
1.112GluTyr: 1.112 ± 0.034
0.0GluXaa: 0.0 ± 0.0
Phe
3.79PheAla: 3.79 ± 0.059
0.204PheCys: 0.204 ± 0.016
2.297PheAsp: 2.297 ± 0.048
1.707PheGlu: 1.707 ± 0.039
1.088PhePhe: 1.088 ± 0.037
3.221PheGly: 3.221 ± 0.057
0.55PheHis: 0.55 ± 0.024
1.212PheIle: 1.212 ± 0.038
0.521PheLys: 0.521 ± 0.025
3.124PheLeu: 3.124 ± 0.058
0.466PheMet: 0.466 ± 0.024
0.764PheAsn: 0.764 ± 0.025
1.436PhePro: 1.436 ± 0.038
0.788PheGln: 0.788 ± 0.032
1.899PheArg: 1.899 ± 0.042
1.957PheSer: 1.957 ± 0.041
2.077PheThr: 2.077 ± 0.047
2.533PheVal: 2.533 ± 0.049
0.468PheTrp: 0.468 ± 0.021
0.66PheTyr: 0.66 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
10.778GlyAla: 10.778 ± 0.12
0.647GlyCys: 0.647 ± 0.028
4.761GlyAsp: 4.761 ± 0.082
4.79GlyGlu: 4.79 ± 0.078
3.129GlyPhe: 3.129 ± 0.054
8.12GlyGly: 8.12 ± 0.127
2.014GlyHis: 2.014 ± 0.047
4.663GlyIle: 4.663 ± 0.083
2.228GlyLys: 2.228 ± 0.053
9.006GlyLeu: 9.006 ± 0.114
1.981GlyMet: 1.981 ± 0.046
2.017GlyAsn: 2.017 ± 0.04
4.589GlyPro: 4.589 ± 0.075
2.8GlyGln: 2.8 ± 0.052
7.114GlyArg: 7.114 ± 0.1
6.188GlySer: 6.188 ± 0.088
6.586GlyThr: 6.586 ± 0.089
7.505GlyVal: 7.505 ± 0.101
1.493GlyTrp: 1.493 ± 0.041
2.242GlyTyr: 2.242 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
2.367HisAla: 2.367 ± 0.05
0.136HisCys: 0.136 ± 0.01
1.269HisAsp: 1.269 ± 0.038
1.133HisGlu: 1.133 ± 0.029
0.63HisPhe: 0.63 ± 0.023
2.115HisGly: 2.115 ± 0.051
0.677HisHis: 0.677 ± 0.028
0.69HisIle: 0.69 ± 0.025
0.296HisLys: 0.296 ± 0.017
2.28HisLeu: 2.28 ± 0.048
0.318HisMet: 0.318 ± 0.016
0.367HisAsn: 0.367 ± 0.019
1.604HisPro: 1.604 ± 0.038
0.658HisGln: 0.658 ± 0.025
1.921HisArg: 1.921 ± 0.041
1.063HisSer: 1.063 ± 0.032
1.015HisThr: 1.015 ± 0.032
1.63HisVal: 1.63 ± 0.042
0.298HisTrp: 0.298 ± 0.017
0.461HisTyr: 0.461 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
5.546IleAla: 5.546 ± 0.087
0.293IleCys: 0.293 ± 0.017
3.051IleAsp: 3.051 ± 0.046
2.451IleGlu: 2.451 ± 0.048
1.221IlePhe: 1.221 ± 0.034
4.133IleGly: 4.133 ± 0.075
0.697IleHis: 0.697 ± 0.028
1.783IleIle: 1.783 ± 0.046
0.83IleLys: 0.83 ± 0.031
3.949IleLeu: 3.949 ± 0.072
0.642IleMet: 0.642 ± 0.022
1.001IleAsn: 1.001 ± 0.032
2.301IlePro: 2.301 ± 0.045
0.915IleGln: 0.915 ± 0.031
2.614IleArg: 2.614 ± 0.052
2.415IleSer: 2.415 ± 0.052
2.778IleThr: 2.778 ± 0.054
3.461IleVal: 3.461 ± 0.062
0.428IleTrp: 0.428 ± 0.022
0.697IleTyr: 0.697 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
2.597LysAla: 2.597 ± 0.06
0.074LysCys: 0.074 ± 0.008
1.333LysAsp: 1.333 ± 0.037
1.098LysGlu: 1.098 ± 0.038
0.494LysPhe: 0.494 ± 0.021
1.707LysGly: 1.707 ± 0.04
0.415LysHis: 0.415 ± 0.019
0.903LysIle: 0.903 ± 0.031
0.701LysLys: 0.701 ± 0.032
1.941LysLeu: 1.941 ± 0.052
0.395LysMet: 0.395 ± 0.019
0.552LysAsn: 0.552 ± 0.025
1.208LysPro: 1.208 ± 0.036
0.706LysGln: 0.706 ± 0.026
1.434LysArg: 1.434 ± 0.041
1.139LysSer: 1.139 ± 0.038
1.175LysThr: 1.175 ± 0.034
1.688LysVal: 1.688 ± 0.05
0.232LysTrp: 0.232 ± 0.014
0.488LysTyr: 0.488 ± 0.02
0.0LysXaa: 0.0 ± 0.0
Leu
14.072LeuAla: 14.072 ± 0.139
0.613LeuCys: 0.613 ± 0.024
6.559LeuAsp: 6.559 ± 0.09
5.552LeuGlu: 5.552 ± 0.092
2.937LeuPhe: 2.937 ± 0.071
9.337LeuGly: 9.337 ± 0.11
2.089LeuHis: 2.089 ± 0.047
3.992LeuIle: 3.992 ± 0.067
1.993LeuLys: 1.993 ± 0.053
10.908LeuLeu: 10.908 ± 0.14
1.78LeuMet: 1.78 ± 0.037
2.005LeuAsn: 2.005 ± 0.046
5.862LeuPro: 5.862 ± 0.08
3.031LeuGln: 3.031 ± 0.056
7.607LeuArg: 7.607 ± 0.098
5.799LeuSer: 5.799 ± 0.077
6.403LeuThr: 6.403 ± 0.089
9.113LeuVal: 9.113 ± 0.119
1.203LeuTrp: 1.203 ± 0.038
1.756LeuTyr: 1.756 ± 0.043
0.0LeuXaa: 0.0 ± 0.0
Met
2.334MetAla: 2.334 ± 0.052
0.104MetCys: 0.104 ± 0.01
0.966MetAsp: 0.966 ± 0.033
0.855MetGlu: 0.855 ± 0.028
0.512MetPhe: 0.512 ± 0.02
1.446MetGly: 1.446 ± 0.039
0.372MetHis: 0.372 ± 0.015
0.848MetIle: 0.848 ± 0.032
0.517MetLys: 0.517 ± 0.022
1.785MetLeu: 1.785 ± 0.045
0.338MetMet: 0.338 ± 0.016
0.519MetAsn: 0.519 ± 0.023
1.062MetPro: 1.062 ± 0.029
0.51MetGln: 0.51 ± 0.021
1.191MetArg: 1.191 ± 0.031
1.437MetSer: 1.437 ± 0.033
1.564MetThr: 1.564 ± 0.04
1.421MetVal: 1.421 ± 0.036
0.172MetTrp: 0.172 ± 0.014
0.302MetTyr: 0.302 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
2.473AsnAla: 2.473 ± 0.053
0.147AsnCys: 0.147 ± 0.012
1.183AsnAsp: 1.183 ± 0.038
1.063AsnGlu: 1.063 ± 0.029
0.653AsnPhe: 0.653 ± 0.025
2.029AsnGly: 2.029 ± 0.042
0.453AsnHis: 0.453 ± 0.02
0.912AsnIle: 0.912 ± 0.029
0.433AsnLys: 0.433 ± 0.022
2.097AsnLeu: 2.097 ± 0.038
0.341AsnMet: 0.341 ± 0.021
0.549AsnAsn: 0.549 ± 0.028
1.584AsnPro: 1.584 ± 0.041
0.677AsnGln: 0.677 ± 0.028
1.436AsnArg: 1.436 ± 0.035
1.131AsnSer: 1.131 ± 0.047
1.257AsnThr: 1.257 ± 0.033
1.559AsnVal: 1.559 ± 0.038
0.346AsnTrp: 0.346 ± 0.017
0.476AsnTyr: 0.476 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
8.099ProAla: 8.099 ± 0.109
0.281ProCys: 0.281 ± 0.017
3.951ProAsp: 3.951 ± 0.067
3.903ProGlu: 3.903 ± 0.064
1.714ProPhe: 1.714 ± 0.041
5.824ProGly: 5.824 ± 0.082
1.185ProHis: 1.185 ± 0.034
1.714ProIle: 1.714 ± 0.047
0.988ProLys: 0.988 ± 0.031
5.017ProLeu: 5.017 ± 0.076
1.019ProMet: 1.019 ± 0.029
0.973ProAsn: 0.973 ± 0.034
2.648ProPro: 2.648 ± 0.062
1.637ProGln: 1.637 ± 0.047
3.511ProArg: 3.511 ± 0.065
3.686ProSer: 3.686 ± 0.062
3.206ProThr: 3.206 ± 0.072
5.243ProVal: 5.243 ± 0.073
0.797ProTrp: 0.797 ± 0.029
1.057ProTyr: 1.057 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
3.789GlnAla: 3.789 ± 0.068
0.15GlnCys: 0.15 ± 0.012
1.849GlnAsp: 1.849 ± 0.044
1.7GlnGlu: 1.7 ± 0.043
0.856GlnPhe: 0.856 ± 0.029
2.548GlnGly: 2.548 ± 0.053
0.758GlnHis: 0.758 ± 0.026
1.209GlnIle: 1.209 ± 0.04
0.65GlnLys: 0.65 ± 0.025
3.305GlnLeu: 3.305 ± 0.06
0.552GlnMet: 0.552 ± 0.022
0.641GlnAsn: 0.641 ± 0.029
1.714GlnPro: 1.714 ± 0.043
1.415GlnGln: 1.415 ± 0.042
2.503GlnArg: 2.503 ± 0.046
1.481GlnSer: 1.481 ± 0.04
1.38GlnThr: 1.38 ± 0.036
2.499GlnVal: 2.499 ± 0.054
0.505GlnTrp: 0.505 ± 0.025
0.663GlnTyr: 0.663 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
8.366ArgAla: 8.366 ± 0.09
0.463ArgCys: 0.463 ± 0.023
4.025ArgAsp: 4.025 ± 0.066
3.986ArgGlu: 3.986 ± 0.065
2.324ArgPhe: 2.324 ± 0.047
5.481ArgGly: 5.481 ± 0.077
1.757ArgHis: 1.757 ± 0.04
3.49ArgIle: 3.49 ± 0.052
1.642ArgLys: 1.642 ± 0.043
7.328ArgLeu: 7.328 ± 0.091
1.649ArgMet: 1.649 ± 0.039
1.675ArgAsn: 1.675 ± 0.04
4.158ArgPro: 4.158 ± 0.064
2.405ArgGln: 2.405 ± 0.044
7.272ArgArg: 7.272 ± 0.119
4.762ArgSer: 4.762 ± 0.07
4.775ArgThr: 4.775 ± 0.071
5.306ArgVal: 5.306 ± 0.084
1.153ArgTrp: 1.153 ± 0.037
1.564ArgTyr: 1.564 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
7.315SerAla: 7.315 ± 0.089
0.389SerCys: 0.389 ± 0.019
3.132SerAsp: 3.132 ± 0.064
2.97SerGlu: 2.97 ± 0.058
1.998SerPhe: 1.998 ± 0.041
6.093SerGly: 6.093 ± 0.082
1.094SerHis: 1.094 ± 0.029
2.537SerIle: 2.537 ± 0.052
1.174SerLys: 1.174 ± 0.031
5.486SerLeu: 5.486 ± 0.08
1.325SerMet: 1.325 ± 0.04
1.199SerAsn: 1.199 ± 0.035
3.333SerPro: 3.333 ± 0.057
1.478SerGln: 1.478 ± 0.037
4.182SerArg: 4.182 ± 0.067
4.06SerSer: 4.06 ± 0.079
4.016SerThr: 4.016 ± 0.065
4.817SerVal: 4.817 ± 0.067
0.95SerTrp: 0.95 ± 0.03
1.341SerTyr: 1.341 ± 0.04
0.0SerXaa: 0.0 ± 0.0
Thr
8.109ThrAla: 8.109 ± 0.11
0.336ThrCys: 0.336 ± 0.019
3.54ThrAsp: 3.54 ± 0.06
3.108ThrGlu: 3.108 ± 0.055
1.865ThrPhe: 1.865 ± 0.041
6.15ThrGly: 6.15 ± 0.082
1.113ThrHis: 1.113 ± 0.034
2.605ThrIle: 2.605 ± 0.052
1.172ThrLys: 1.172 ± 0.036
5.901ThrLeu: 5.901 ± 0.079
1.019ThrMet: 1.019 ± 0.029
1.191ThrAsn: 1.191 ± 0.034
3.983ThrPro: 3.983 ± 0.072
1.495ThrGln: 1.495 ± 0.036
3.842ThrArg: 3.842 ± 0.064
3.656ThrSer: 3.656 ± 0.057
4.115ThrThr: 4.115 ± 0.07
5.61ThrVal: 5.61 ± 0.084
0.782ThrTrp: 0.782 ± 0.028
1.312ThrTyr: 1.312 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
10.427ValAla: 10.427 ± 0.097
0.535ValCys: 0.535 ± 0.024
5.471ValAsp: 5.471 ± 0.078
4.917ValGlu: 4.917 ± 0.073
2.693ValPhe: 2.693 ± 0.058
6.921ValGly: 6.921 ± 0.078
1.805ValHis: 1.805 ± 0.043
3.782ValIle: 3.782 ± 0.067
1.451ValLys: 1.451 ± 0.046
9.917ValLeu: 9.917 ± 0.116
1.477ValMet: 1.477 ± 0.033
1.843ValAsn: 1.843 ± 0.052
5.077ValPro: 5.077 ± 0.072
2.407ValGln: 2.407 ± 0.043
6.019ValArg: 6.019 ± 0.084
4.84ValSer: 4.84 ± 0.067
5.276ValThr: 5.276 ± 0.082
8.453ValVal: 8.453 ± 0.103
0.978ValTrp: 0.978 ± 0.027
1.456ValTyr: 1.456 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
1.386TrpAla: 1.386 ± 0.035
0.109TrpCys: 0.109 ± 0.01
0.727TrpAsp: 0.727 ± 0.027
0.594TrpGlu: 0.594 ± 0.023
0.514TrpPhe: 0.514 ± 0.025
0.95TrpGly: 0.95 ± 0.033
0.326TrpHis: 0.326 ± 0.017
0.613TrpIle: 0.613 ± 0.024
0.35TrpLys: 0.35 ± 0.02
1.655TrpLeu: 1.655 ± 0.042
0.285TrpMet: 0.285 ± 0.017
0.461TrpAsn: 0.461 ± 0.021
0.629TrpPro: 0.629 ± 0.026
0.592TrpGln: 0.592 ± 0.026
1.039TrpArg: 1.039 ± 0.036
0.921TrpSer: 0.921 ± 0.033
0.902TrpThr: 0.902 ± 0.026
1.048TrpVal: 1.048 ± 0.029
0.322TrpTrp: 0.322 ± 0.02
0.285TrpTyr: 0.285 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.364TyrAla: 2.364 ± 0.051
0.154TyrCys: 0.154 ± 0.012
1.28TyrAsp: 1.28 ± 0.034
1.169TyrGlu: 1.169 ± 0.032
0.769TyrPhe: 0.769 ± 0.028
1.936TyrGly: 1.936 ± 0.041
0.31TyrHis: 0.31 ± 0.017
0.672TyrIle: 0.672 ± 0.028
0.36TyrLys: 0.36 ± 0.02
2.246TyrLeu: 2.246 ± 0.048
0.255TyrMet: 0.255 ± 0.017
0.443TyrAsn: 0.443 ± 0.024
1.104TyrPro: 1.104 ± 0.034
0.638TyrGln: 0.638 ± 0.025
1.644TyrArg: 1.644 ± 0.04
1.279TyrSer: 1.279 ± 0.03
1.149TyrThr: 1.149 ± 0.035
1.59TyrVal: 1.59 ± 0.038
0.326TyrTrp: 0.326 ± 0.018
0.507TyrTyr: 0.507 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3539 proteins (1059819 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski