Amino acid dipepetide frequency for Aeriscardovia aeriphila

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.083AlaAla: 11.083 ± 0.254
0.815AlaCys: 0.815 ± 0.042
6.22AlaAsp: 6.22 ± 0.109
5.691AlaGlu: 5.691 ± 0.153
3.359AlaPhe: 3.359 ± 0.094
7.663AlaGly: 7.663 ± 0.153
2.9AlaHis: 2.9 ± 0.089
5.007AlaIle: 5.007 ± 0.114
4.021AlaLys: 4.021 ± 0.104
10.011AlaLeu: 10.011 ± 0.176
2.546AlaMet: 2.546 ± 0.08
3.29AlaAsn: 3.29 ± 0.09
3.831AlaPro: 3.831 ± 0.113
6.873AlaGln: 6.873 ± 0.182
6.163AlaArg: 6.163 ± 0.126
7.492AlaSer: 7.492 ± 0.174
5.546AlaThr: 5.546 ± 0.121
7.031AlaVal: 7.031 ± 0.135
1.337AlaTrp: 1.337 ± 0.057
2.471AlaTyr: 2.471 ± 0.08
0.0AlaXaa: 0.0 ± 0.0
Cys
0.792CysAla: 0.792 ± 0.045
0.095CysCys: 0.095 ± 0.016
0.457CysAsp: 0.457 ± 0.034
0.444CysGlu: 0.444 ± 0.031
0.308CysPhe: 0.308 ± 0.028
0.668CysGly: 0.668 ± 0.041
0.217CysHis: 0.217 ± 0.02
0.322CysIle: 0.322 ± 0.027
0.225CysLys: 0.225 ± 0.021
0.649CysLeu: 0.649 ± 0.039
0.166CysMet: 0.166 ± 0.018
0.187CysAsn: 0.187 ± 0.021
0.377CysPro: 0.377 ± 0.03
0.257CysGln: 0.257 ± 0.025
0.326CysArg: 0.326 ± 0.025
0.577CysSer: 0.577 ± 0.035
0.472CysThr: 0.472 ± 0.038
0.611CysVal: 0.611 ± 0.035
0.133CysTrp: 0.133 ± 0.018
0.198CysTyr: 0.198 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
5.655AspAla: 5.655 ± 0.125
0.434AspCys: 0.434 ± 0.032
3.817AspAsp: 3.817 ± 0.133
4.752AspGlu: 4.752 ± 0.133
2.327AspPhe: 2.327 ± 0.071
4.427AspGly: 4.427 ± 0.118
1.26AspHis: 1.26 ± 0.054
2.854AspIle: 2.854 ± 0.096
2.492AspLys: 2.492 ± 0.084
4.842AspLeu: 4.842 ± 0.104
1.487AspMet: 1.487 ± 0.059
2.169AspAsn: 2.169 ± 0.074
3.437AspPro: 3.437 ± 0.082
2.317AspGln: 2.317 ± 0.067
2.614AspArg: 2.614 ± 0.092
4.044AspSer: 4.044 ± 0.102
2.991AspThr: 2.991 ± 0.097
4.337AspVal: 4.337 ± 0.109
0.931AspTrp: 0.931 ± 0.052
1.868AspTyr: 1.868 ± 0.069
0.0AspXaa: 0.0 ± 0.0
Glu
5.853GluAla: 5.853 ± 0.14
0.354GluCys: 0.354 ± 0.028
3.046GluAsp: 3.046 ± 0.103
4.284GluGlu: 4.284 ± 0.123
1.736GluPhe: 1.736 ± 0.071
3.69GluGly: 3.69 ± 0.096
1.725GluHis: 1.725 ± 0.073
3.073GluIle: 3.073 ± 0.095
3.945GluLys: 3.945 ± 0.097
5.649GluLeu: 5.649 ± 0.136
1.55GluMet: 1.55 ± 0.071
2.692GluAsn: 2.692 ± 0.09
2.66GluPro: 2.66 ± 0.093
3.574GluGln: 3.574 ± 0.119
3.528GluArg: 3.528 ± 0.099
3.124GluSer: 3.124 ± 0.094
2.909GluThr: 2.909 ± 0.091
3.819GluVal: 3.819 ± 0.099
0.72GluTrp: 0.72 ± 0.043
1.594GluTyr: 1.594 ± 0.069
0.0GluXaa: 0.0 ± 0.0
Phe
3.804PheAla: 3.804 ± 0.107
0.305PheCys: 0.305 ± 0.026
2.5PheAsp: 2.5 ± 0.071
1.864PheGlu: 1.864 ± 0.065
1.472PhePhe: 1.472 ± 0.068
2.808PheGly: 2.808 ± 0.079
0.779PheHis: 0.779 ± 0.046
1.769PheIle: 1.769 ± 0.08
1.043PheLys: 1.043 ± 0.059
3.128PheLeu: 3.128 ± 0.081
0.794PheMet: 0.794 ± 0.045
1.308PheAsn: 1.308 ± 0.048
1.356PhePro: 1.356 ± 0.059
1.161PheGln: 1.161 ± 0.058
1.535PheArg: 1.535 ± 0.059
2.997PheSer: 2.997 ± 0.09
2.45PheThr: 2.45 ± 0.085
2.692PheVal: 2.692 ± 0.077
0.487PheTrp: 0.487 ± 0.04
0.941PheTyr: 0.941 ± 0.049
0.0PheXaa: 0.0 ± 0.0
Gly
5.815GlyAla: 5.815 ± 0.131
0.51GlyCys: 0.51 ± 0.034
3.69GlyAsp: 3.69 ± 0.105
4.625GlyGlu: 4.625 ± 0.108
3.086GlyPhe: 3.086 ± 0.082
4.587GlyGly: 4.587 ± 0.129
1.573GlyHis: 1.573 ± 0.061
4.179GlyIle: 4.179 ± 0.089
4.012GlyLys: 4.012 ± 0.098
6.399GlyLeu: 6.399 ± 0.13
1.952GlyMet: 1.952 ± 0.07
2.704GlyAsn: 2.704 ± 0.1
2.106GlyPro: 2.106 ± 0.081
2.827GlyGln: 2.827 ± 0.082
3.431GlyArg: 3.431 ± 0.085
4.977GlySer: 4.977 ± 0.117
4.236GlyThr: 4.236 ± 0.104
5.491GlyVal: 5.491 ± 0.122
1.188GlyTrp: 1.188 ± 0.055
2.428GlyTyr: 2.428 ± 0.087
0.0GlyXaa: 0.0 ± 0.0
His
2.222HisAla: 2.222 ± 0.076
0.164HisCys: 0.164 ± 0.017
1.584HisAsp: 1.584 ± 0.058
1.527HisGlu: 1.527 ± 0.066
0.792HisPhe: 0.792 ± 0.045
1.782HisGly: 1.782 ± 0.069
0.813HisHis: 0.813 ± 0.046
1.319HisIle: 1.319 ± 0.053
0.811HisLys: 0.811 ± 0.041
2.108HisLeu: 2.108 ± 0.06
0.663HisMet: 0.663 ± 0.041
0.923HisAsn: 0.923 ± 0.049
1.327HisPro: 1.327 ± 0.047
0.95HisGln: 0.95 ± 0.04
1.533HisArg: 1.533 ± 0.058
1.656HisSer: 1.656 ± 0.071
1.415HisThr: 1.415 ± 0.053
1.816HisVal: 1.816 ± 0.06
0.341HisTrp: 0.341 ± 0.03
0.75HisTyr: 0.75 ± 0.042
0.0HisXaa: 0.0 ± 0.0
Ile
5.691IleAla: 5.691 ± 0.117
0.558IleCys: 0.558 ± 0.033
3.372IleAsp: 3.372 ± 0.091
2.791IleGlu: 2.791 ± 0.076
1.672IlePhe: 1.672 ± 0.071
3.692IleGly: 3.692 ± 0.103
1.102IleHis: 1.102 ± 0.051
3.031IleIle: 3.031 ± 0.092
1.502IleLys: 1.502 ± 0.06
4.168IleLeu: 4.168 ± 0.088
0.967IleMet: 0.967 ± 0.048
1.959IleAsn: 1.959 ± 0.068
2.94IlePro: 2.94 ± 0.078
1.495IleGln: 1.495 ± 0.05
2.818IleArg: 2.818 ± 0.081
3.844IleSer: 3.844 ± 0.106
3.41IleThr: 3.41 ± 0.091
4.038IleVal: 4.038 ± 0.095
0.548IleTrp: 0.548 ± 0.039
1.234IleTyr: 1.234 ± 0.055
0.0IleXaa: 0.0 ± 0.0
Lys
4.878LysAla: 4.878 ± 0.128
0.173LysCys: 0.173 ± 0.017
2.511LysAsp: 2.511 ± 0.09
2.271LysGlu: 2.271 ± 0.07
1.059LysPhe: 1.059 ± 0.047
2.749LysGly: 2.749 ± 0.085
1.026LysHis: 1.026 ± 0.05
1.955LysIle: 1.955 ± 0.074
2.879LysLys: 2.879 ± 0.088
3.991LysLeu: 3.991 ± 0.109
1.125LysMet: 1.125 ± 0.054
1.965LysAsn: 1.965 ± 0.074
2.511LysPro: 2.511 ± 0.061
2.125LysGln: 2.125 ± 0.071
2.563LysArg: 2.563 ± 0.083
2.572LysSer: 2.572 ± 0.077
2.487LysThr: 2.487 ± 0.079
2.784LysVal: 2.784 ± 0.092
0.493LysTrp: 0.493 ± 0.034
1.158LysTyr: 1.158 ± 0.049
0.0LysXaa: 0.0 ± 0.0
Leu
10.055LeuAla: 10.055 ± 0.164
0.796LeuCys: 0.796 ± 0.042
5.586LeuAsp: 5.586 ± 0.14
4.766LeuGlu: 4.766 ± 0.104
3.366LeuPhe: 3.366 ± 0.099
6.401LeuGly: 6.401 ± 0.114
2.146LeuHis: 2.146 ± 0.068
4.714LeuIle: 4.714 ± 0.119
3.802LeuLys: 3.802 ± 0.107
8.596LeuLeu: 8.596 ± 0.202
2.014LeuMet: 2.014 ± 0.071
3.385LeuAsn: 3.385 ± 0.078
4.712LeuPro: 4.712 ± 0.103
3.212LeuGln: 3.212 ± 0.086
5.217LeuArg: 5.217 ± 0.111
7.016LeuSer: 7.016 ± 0.144
6.064LeuThr: 6.064 ± 0.119
6.913LeuVal: 6.913 ± 0.145
1.22LeuTrp: 1.22 ± 0.053
2.062LeuTyr: 2.062 ± 0.067
0.002LeuXaa: 0.002 ± 0.002
Met
2.599MetAla: 2.599 ± 0.076
0.179MetCys: 0.179 ± 0.019
1.302MetAsp: 1.302 ± 0.062
1.013MetGlu: 1.013 ± 0.048
0.632MetPhe: 0.632 ± 0.042
1.74MetGly: 1.74 ± 0.06
0.524MetHis: 0.524 ± 0.035
1.106MetIle: 1.106 ± 0.048
1.278MetLys: 1.278 ± 0.052
2.182MetLeu: 2.182 ± 0.083
0.659MetMet: 0.659 ± 0.033
1.142MetAsn: 1.142 ± 0.057
1.38MetPro: 1.38 ± 0.054
0.872MetGln: 0.872 ± 0.043
1.535MetArg: 1.535 ± 0.058
1.976MetSer: 1.976 ± 0.063
1.523MetThr: 1.523 ± 0.059
1.717MetVal: 1.717 ± 0.058
0.301MetTrp: 0.301 ± 0.027
0.539MetTyr: 0.539 ± 0.033
0.0MetXaa: 0.0 ± 0.0
Asn
3.732AsnAla: 3.732 ± 0.09
0.246AsnCys: 0.246 ± 0.022
1.969AsnAsp: 1.969 ± 0.074
2.011AsnGlu: 2.011 ± 0.076
1.323AsnPhe: 1.323 ± 0.057
2.949AsnGly: 2.949 ± 0.108
0.746AsnHis: 0.746 ± 0.044
1.717AsnIle: 1.717 ± 0.06
1.613AsnLys: 1.613 ± 0.067
3.524AsnLeu: 3.524 ± 0.079
0.914AsnMet: 0.914 ± 0.05
1.546AsnAsn: 1.546 ± 0.072
2.49AsnPro: 2.49 ± 0.088
1.516AsnGln: 1.516 ± 0.06
1.752AsnArg: 1.752 ± 0.075
2.694AsnSer: 2.694 ± 0.082
2.271AsnThr: 2.271 ± 0.073
2.313AsnVal: 2.313 ± 0.07
0.609AsnTrp: 0.609 ± 0.036
1.051AsnTyr: 1.051 ± 0.052
0.0AsnXaa: 0.0 ± 0.0
Pro
5.27ProAla: 5.27 ± 0.157
0.255ProCys: 0.255 ± 0.022
2.938ProAsp: 2.938 ± 0.082
3.265ProGlu: 3.265 ± 0.079
1.588ProPhe: 1.588 ± 0.068
3.269ProGly: 3.269 ± 0.091
1.304ProHis: 1.304 ± 0.048
1.974ProIle: 1.974 ± 0.072
1.342ProLys: 1.342 ± 0.062
4.095ProLeu: 4.095 ± 0.103
0.828ProMet: 0.828 ± 0.045
1.436ProAsn: 1.436 ± 0.056
1.085ProPro: 1.085 ± 0.062
2.852ProGln: 2.852 ± 0.102
2.527ProArg: 2.527 ± 0.079
3.391ProSer: 3.391 ± 0.091
2.78ProThr: 2.78 ± 0.072
3.958ProVal: 3.958 ± 0.096
0.659ProTrp: 0.659 ± 0.038
1.228ProTyr: 1.228 ± 0.054
0.0ProXaa: 0.0 ± 0.0
Gln
5.961GlnAla: 5.961 ± 0.166
0.251GlnCys: 0.251 ± 0.021
2.016GlnAsp: 2.016 ± 0.064
2.717GlnGlu: 2.717 ± 0.085
1.382GlnPhe: 1.382 ± 0.056
2.961GlnGly: 2.961 ± 0.086
1.137GlnHis: 1.137 ± 0.047
2.424GlnIle: 2.424 ± 0.073
2.127GlnLys: 2.127 ± 0.075
4.52GlnLeu: 4.52 ± 0.112
1.382GlnMet: 1.382 ± 0.054
1.443GlnAsn: 1.443 ± 0.056
2.382GlnPro: 2.382 ± 0.115
2.871GlnGln: 2.871 ± 0.106
2.806GlnArg: 2.806 ± 0.081
2.405GlnSer: 2.405 ± 0.074
2.41GlnThr: 2.41 ± 0.07
3.208GlnVal: 3.208 ± 0.08
0.714GlnTrp: 0.714 ± 0.044
1.171GlnTyr: 1.171 ± 0.056
0.0GlnXaa: 0.0 ± 0.0
Arg
5.154ArgAla: 5.154 ± 0.136
0.32ArgCys: 0.32 ± 0.026
3.039ArgAsp: 3.039 ± 0.085
4.006ArgGlu: 4.006 ± 0.102
2.216ArgPhe: 2.216 ± 0.072
3.195ArgGly: 3.195 ± 0.092
1.464ArgHis: 1.464 ± 0.058
2.98ArgIle: 2.98 ± 0.084
2.561ArgLys: 2.561 ± 0.079
5.066ArgLeu: 5.066 ± 0.125
1.588ArgMet: 1.588 ± 0.061
2.1ArgAsn: 2.1 ± 0.072
2.182ArgPro: 2.182 ± 0.066
2.506ArgGln: 2.506 ± 0.081
3.871ArgArg: 3.871 ± 0.138
3.64ArgSer: 3.64 ± 0.091
3.159ArgThr: 3.159 ± 0.088
4.147ArgVal: 4.147 ± 0.091
0.849ArgTrp: 0.849 ± 0.038
1.689ArgTyr: 1.689 ± 0.062
0.0ArgXaa: 0.0 ± 0.0
Ser
7.818SerAla: 7.818 ± 0.197
0.503SerCys: 0.503 ± 0.033
4.505SerAsp: 4.505 ± 0.124
4.181SerGlu: 4.181 ± 0.119
2.53SerPhe: 2.53 ± 0.086
5.03SerGly: 5.03 ± 0.113
1.879SerHis: 1.879 ± 0.059
3.233SerIle: 3.233 ± 0.078
2.376SerLys: 2.376 ± 0.08
6.807SerLeu: 6.807 ± 0.133
1.634SerMet: 1.634 ± 0.051
2.249SerAsn: 2.249 ± 0.078
2.881SerPro: 2.881 ± 0.085
3.652SerGln: 3.652 ± 0.106
3.932SerArg: 3.932 ± 0.103
6.27SerSer: 6.27 ± 0.202
4.482SerThr: 4.482 ± 0.111
5.261SerVal: 5.261 ± 0.117
0.988SerTrp: 0.988 ± 0.043
1.733SerTyr: 1.733 ± 0.064
0.0SerXaa: 0.0 ± 0.0
Thr
5.771ThrAla: 5.771 ± 0.11
0.423ThrCys: 0.423 ± 0.027
3.284ThrAsp: 3.284 ± 0.088
2.557ThrGlu: 2.557 ± 0.083
2.138ThrPhe: 2.138 ± 0.079
4.533ThrGly: 4.533 ± 0.093
1.447ThrHis: 1.447 ± 0.054
3.151ThrIle: 3.151 ± 0.085
2.18ThrLys: 2.18 ± 0.071
5.752ThrLeu: 5.752 ± 0.129
1.198ThrMet: 1.198 ± 0.052
2.024ThrAsn: 2.024 ± 0.071
3.235ThrPro: 3.235 ± 0.092
2.557ThrGln: 2.557 ± 0.08
3.178ThrArg: 3.178 ± 0.083
4.674ThrSer: 4.674 ± 0.12
3.608ThrThr: 3.608 ± 0.1
4.741ThrVal: 4.741 ± 0.115
0.798ThrTrp: 0.798 ± 0.048
1.721ThrTyr: 1.721 ± 0.062
0.0ThrXaa: 0.0 ± 0.0
Val
7.425ValAla: 7.425 ± 0.155
0.699ValCys: 0.699 ± 0.04
4.729ValAsp: 4.729 ± 0.099
4.488ValGlu: 4.488 ± 0.096
2.612ValPhe: 2.612 ± 0.074
4.678ValGly: 4.678 ± 0.106
1.542ValHis: 1.542 ± 0.056
4.113ValIle: 4.113 ± 0.098
3.389ValLys: 3.389 ± 0.087
6.797ValLeu: 6.797 ± 0.155
1.771ValMet: 1.771 ± 0.065
2.759ValAsn: 2.759 ± 0.072
3.292ValPro: 3.292 ± 0.097
2.494ValGln: 2.494 ± 0.091
3.96ValArg: 3.96 ± 0.105
5.681ValSer: 5.681 ± 0.136
4.499ValThr: 4.499 ± 0.12
6.066ValVal: 6.066 ± 0.143
1.022ValTrp: 1.022 ± 0.045
1.797ValTyr: 1.797 ± 0.07
0.0ValXaa: 0.0 ± 0.0
Trp
1.243TrpAla: 1.243 ± 0.054
0.118TrpCys: 0.118 ± 0.015
0.872TrpAsp: 0.872 ± 0.047
0.638TrpGlu: 0.638 ± 0.034
0.548TrpPhe: 0.548 ± 0.033
0.866TrpGly: 0.866 ± 0.045
0.35TrpHis: 0.35 ± 0.032
0.714TrpIle: 0.714 ± 0.046
0.729TrpLys: 0.729 ± 0.043
1.54TrpLeu: 1.54 ± 0.063
0.407TrpMet: 0.407 ± 0.029
0.714TrpAsn: 0.714 ± 0.046
0.505TrpPro: 0.505 ± 0.038
0.731TrpGln: 0.731 ± 0.042
0.773TrpArg: 0.773 ± 0.048
0.893TrpSer: 0.893 ± 0.042
0.735TrpThr: 0.735 ± 0.039
0.933TrpVal: 0.933 ± 0.045
0.308TrpTrp: 0.308 ± 0.025
0.442TrpTyr: 0.442 ± 0.029
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.527TyrAla: 2.527 ± 0.078
0.282TyrCys: 0.282 ± 0.024
1.62TyrAsp: 1.62 ± 0.069
1.632TyrGlu: 1.632 ± 0.065
1.049TyrPhe: 1.049 ± 0.05
2.132TyrGly: 2.132 ± 0.078
0.564TyrHis: 0.564 ± 0.036
1.198TyrIle: 1.198 ± 0.052
0.948TyrLys: 0.948 ± 0.057
2.22TyrLeu: 2.22 ± 0.075
0.552TyrMet: 0.552 ± 0.037
0.99TyrAsn: 0.99 ± 0.049
1.352TyrPro: 1.352 ± 0.055
1.445TyrGln: 1.445 ± 0.062
1.668TyrArg: 1.668 ± 0.067
2.024TyrSer: 2.024 ± 0.085
1.525TyrThr: 1.525 ± 0.065
1.936TyrVal: 1.936 ± 0.065
0.413TyrTrp: 0.413 ± 0.033
0.821TyrTyr: 0.821 ± 0.043
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.002XaaSer: 0.002 ± 0.002
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1287 proteins (474780 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski