Amino acid dipepetide frequency for Planctomycetes bacterium Pan216

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.467AlaAla: 9.467 ± 0.106
1.092AlaCys: 1.092 ± 0.025
5.64AlaAsp: 5.64 ± 0.06
6.442AlaGlu: 6.442 ± 0.075
3.468AlaPhe: 3.468 ± 0.041
7.216AlaGly: 7.216 ± 0.07
1.821AlaHis: 1.821 ± 0.03
5.385AlaIle: 5.385 ± 0.05
3.93AlaLys: 3.93 ± 0.053
8.956AlaLeu: 8.956 ± 0.08
2.347AlaMet: 2.347 ± 0.035
2.948AlaAsn: 2.948 ± 0.048
4.373AlaPro: 4.373 ± 0.049
2.945AlaGln: 2.945 ± 0.037
6.372AlaArg: 6.372 ± 0.067
6.545AlaSer: 6.545 ± 0.057
5.786AlaThr: 5.786 ± 0.054
6.425AlaVal: 6.425 ± 0.068
1.329AlaTrp: 1.329 ± 0.03
2.132AlaTyr: 2.132 ± 0.034
0.0AlaXaa: 0.0 ± 0.0
Cys
0.817CysAla: 0.817 ± 0.02
0.231CysCys: 0.231 ± 0.014
0.63CysAsp: 0.63 ± 0.02
0.618CysGlu: 0.618 ± 0.019
0.441CysPhe: 0.441 ± 0.014
0.915CysGly: 0.915 ± 0.028
0.438CysHis: 0.438 ± 0.015
0.373CysIle: 0.373 ± 0.013
0.279CysLys: 0.279 ± 0.019
1.124CysLeu: 1.124 ± 0.023
0.191CysMet: 0.191 ± 0.01
0.239CysAsn: 0.239 ± 0.011
0.689CysPro: 0.689 ± 0.021
0.41CysGln: 0.41 ± 0.015
0.89CysArg: 0.89 ± 0.024
0.631CysSer: 0.631 ± 0.021
0.413CysThr: 0.413 ± 0.015
0.735CysVal: 0.735 ± 0.021
0.225CysTrp: 0.225 ± 0.011
0.329CysTyr: 0.329 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
5.828AspAla: 5.828 ± 0.073
0.577AspCys: 0.577 ± 0.017
4.159AspAsp: 4.159 ± 0.074
4.657AspGlu: 4.657 ± 0.059
2.244AspPhe: 2.244 ± 0.039
5.607AspGly: 5.607 ± 0.138
1.583AspHis: 1.583 ± 0.028
2.403AspIle: 2.403 ± 0.038
1.733AspLys: 1.733 ± 0.038
6.045AspLeu: 6.045 ± 0.065
0.965AspMet: 0.965 ± 0.022
1.466AspAsn: 1.466 ± 0.033
3.892AspPro: 3.892 ± 0.045
2.341AspGln: 2.341 ± 0.033
4.823AspArg: 4.823 ± 0.053
3.238AspSer: 3.238 ± 0.066
2.359AspThr: 2.359 ± 0.076
4.627AspVal: 4.627 ± 0.065
1.06AspTrp: 1.06 ± 0.025
1.622AspTyr: 1.622 ± 0.031
0.0AspXaa: 0.0 ± 0.0
Glu
6.337GluAla: 6.337 ± 0.069
0.522GluCys: 0.522 ± 0.017
3.164GluAsp: 3.164 ± 0.037
5.164GluGlu: 5.164 ± 0.068
2.296GluPhe: 2.296 ± 0.035
4.734GluGly: 4.734 ± 0.061
1.56GluHis: 1.56 ± 0.032
3.429GluIle: 3.429 ± 0.05
2.832GluLys: 2.832 ± 0.046
6.929GluLeu: 6.929 ± 0.062
1.527GluMet: 1.527 ± 0.028
1.675GluAsn: 1.675 ± 0.027
3.423GluPro: 3.423 ± 0.049
2.748GluGln: 2.748 ± 0.038
5.577GluArg: 5.577 ± 0.079
4.429GluSer: 4.429 ± 0.05
3.842GluThr: 3.842 ± 0.046
4.705GluVal: 4.705 ± 0.052
0.827GluTrp: 0.827 ± 0.021
1.301GluTyr: 1.301 ± 0.028
0.0GluXaa: 0.0 ± 0.0
Phe
3.692PheAla: 3.692 ± 0.043
0.427PheCys: 0.427 ± 0.016
2.784PheAsp: 2.784 ± 0.049
2.451PheGlu: 2.451 ± 0.037
1.524PhePhe: 1.524 ± 0.035
3.434PheGly: 3.434 ± 0.048
0.931PheHis: 0.931 ± 0.022
1.313PheIle: 1.313 ± 0.026
0.876PheLys: 0.876 ± 0.02
3.828PheLeu: 3.828 ± 0.051
0.633PheMet: 0.633 ± 0.018
1.068PheAsn: 1.068 ± 0.026
1.841PhePro: 1.841 ± 0.034
1.287PheGln: 1.287 ± 0.027
2.547PheArg: 2.547 ± 0.034
2.288PheSer: 2.288 ± 0.039
1.984PheThr: 1.984 ± 0.044
2.957PheVal: 2.957 ± 0.037
0.571PheTrp: 0.571 ± 0.018
0.862PheTyr: 0.862 ± 0.021
0.0PheXaa: 0.0 ± 0.0
Gly
6.382GlyAla: 6.382 ± 0.058
1.056GlyCys: 1.056 ± 0.029
4.958GlyAsp: 4.958 ± 0.107
5.169GlyGlu: 5.169 ± 0.052
3.117GlyPhe: 3.117 ± 0.041
6.568GlyGly: 6.568 ± 0.141
1.772GlyHis: 1.772 ± 0.031
4.219GlyIle: 4.219 ± 0.05
3.62GlyLys: 3.62 ± 0.056
7.497GlyLeu: 7.497 ± 0.071
1.987GlyMet: 1.987 ± 0.035
2.708GlyAsn: 2.708 ± 0.099
3.498GlyPro: 3.498 ± 0.037
2.8GlyGln: 2.8 ± 0.058
5.495GlyArg: 5.495 ± 0.061
5.188GlySer: 5.188 ± 0.067
4.737GlyThr: 4.737 ± 0.061
5.658GlyVal: 5.658 ± 0.057
1.353GlyTrp: 1.353 ± 0.026
2.162GlyTyr: 2.162 ± 0.048
0.0GlyXaa: 0.0 ± 0.0
His
2.056HisAla: 2.056 ± 0.038
0.302HisCys: 0.302 ± 0.012
1.423HisAsp: 1.423 ± 0.028
1.452HisGlu: 1.452 ± 0.028
0.956HisPhe: 0.956 ± 0.022
1.951HisGly: 1.951 ± 0.034
0.817HisHis: 0.817 ± 0.022
0.716HisIle: 0.716 ± 0.02
0.521HisLys: 0.521 ± 0.016
2.487HisLeu: 2.487 ± 0.038
0.297HisMet: 0.297 ± 0.011
0.526HisAsn: 0.526 ± 0.018
1.59HisPro: 1.59 ± 0.031
0.943HisGln: 0.943 ± 0.024
1.97HisArg: 1.97 ± 0.031
1.123HisSer: 1.123 ± 0.023
0.83HisThr: 0.83 ± 0.019
1.649HisVal: 1.649 ± 0.034
0.46HisTrp: 0.46 ± 0.016
0.666HisTyr: 0.666 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
5.564IleAla: 5.564 ± 0.057
0.535IleCys: 0.535 ± 0.017
3.92IleAsp: 3.92 ± 0.065
3.678IleGlu: 3.678 ± 0.049
1.59IlePhe: 1.59 ± 0.031
4.129IleGly: 4.129 ± 0.048
1.13IleHis: 1.13 ± 0.022
1.901IleIle: 1.901 ± 0.037
1.496IleLys: 1.496 ± 0.03
4.235IleLeu: 4.235 ± 0.053
0.683IleMet: 0.683 ± 0.02
1.487IleAsn: 1.487 ± 0.03
2.538IlePro: 2.538 ± 0.033
1.297IleGln: 1.297 ± 0.027
3.091IleArg: 3.091 ± 0.039
2.598IleSer: 2.598 ± 0.038
2.399IleThr: 2.399 ± 0.044
4.115IleVal: 4.115 ± 0.049
0.554IleTrp: 0.554 ± 0.017
1.062IleTyr: 1.062 ± 0.024
0.0IleXaa: 0.0 ± 0.0
Lys
3.224LysAla: 3.224 ± 0.048
0.258LysCys: 0.258 ± 0.013
1.824LysAsp: 1.824 ± 0.034
2.633LysGlu: 2.633 ± 0.043
0.941LysPhe: 0.941 ± 0.023
2.742LysGly: 2.742 ± 0.044
0.748LysHis: 0.748 ± 0.021
1.658LysIle: 1.658 ± 0.03
1.759LysLys: 1.759 ± 0.045
3.381LysLeu: 3.381 ± 0.045
0.78LysMet: 0.78 ± 0.021
0.931LysAsn: 0.931 ± 0.02
2.277LysPro: 2.277 ± 0.038
1.386LysGln: 1.386 ± 0.032
3.045LysArg: 3.045 ± 0.046
2.249LysSer: 2.249 ± 0.038
2.14LysThr: 2.14 ± 0.03
2.414LysVal: 2.414 ± 0.039
0.439LysTrp: 0.439 ± 0.015
0.774LysTyr: 0.774 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
11.413LeuAla: 11.413 ± 0.101
1.033LeuCys: 1.033 ± 0.026
6.234LeuAsp: 6.234 ± 0.059
6.682LeuGlu: 6.682 ± 0.074
3.691LeuPhe: 3.691 ± 0.049
8.314LeuGly: 8.314 ± 0.069
2.046LeuHis: 2.046 ± 0.034
4.296LeuIle: 4.296 ± 0.06
3.296LeuLys: 3.296 ± 0.049
10.882LeuLeu: 10.882 ± 0.101
1.857LeuMet: 1.857 ± 0.031
2.447LeuAsn: 2.447 ± 0.033
5.639LeuPro: 5.639 ± 0.061
2.794LeuGln: 2.794 ± 0.039
7.187LeuArg: 7.187 ± 0.077
6.323LeuSer: 6.323 ± 0.056
5.48LeuThr: 5.48 ± 0.065
7.983LeuVal: 7.983 ± 0.076
1.373LeuTrp: 1.373 ± 0.029
2.124LeuTyr: 2.124 ± 0.035
0.0LeuXaa: 0.0 ± 0.0
Met
2.126MetAla: 2.126 ± 0.033
0.177MetCys: 0.177 ± 0.012
0.911MetAsp: 0.911 ± 0.021
1.032MetGlu: 1.032 ± 0.025
0.571MetPhe: 0.571 ± 0.018
1.625MetGly: 1.625 ± 0.037
0.376MetHis: 0.376 ± 0.013
1.181MetIle: 1.181 ± 0.028
0.867MetLys: 0.867 ± 0.022
2.182MetLeu: 2.182 ± 0.039
0.532MetMet: 0.532 ± 0.015
0.699MetAsn: 0.699 ± 0.019
1.304MetPro: 1.304 ± 0.029
0.606MetGln: 0.606 ± 0.018
1.482MetArg: 1.482 ± 0.027
1.448MetSer: 1.448 ± 0.029
1.551MetThr: 1.551 ± 0.027
1.502MetVal: 1.502 ± 0.03
0.191MetTrp: 0.191 ± 0.01
0.31MetTyr: 0.31 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.643AsnAla: 2.643 ± 0.034
0.259AsnCys: 0.259 ± 0.012
1.952AsnAsp: 1.952 ± 0.085
1.737AsnGlu: 1.737 ± 0.03
0.999AsnPhe: 0.999 ± 0.024
2.455AsnGly: 2.455 ± 0.051
0.66AsnHis: 0.66 ± 0.017
1.258AsnIle: 1.258 ± 0.026
0.835AsnLys: 0.835 ± 0.023
2.752AsnLeu: 2.752 ± 0.037
0.5AsnMet: 0.5 ± 0.017
0.924AsnAsn: 0.924 ± 0.031
1.889AsnPro: 1.889 ± 0.031
1.026AsnGln: 1.026 ± 0.025
2.002AsnArg: 2.002 ± 0.035
1.566AsnSer: 1.566 ± 0.035
1.299AsnThr: 1.299 ± 0.03
2.19AsnVal: 2.19 ± 0.039
0.445AsnTrp: 0.445 ± 0.015
0.795AsnTyr: 0.795 ± 0.023
0.0AsnXaa: 0.0 ± 0.0
Pro
4.755ProAla: 4.755 ± 0.061
0.426ProCys: 0.426 ± 0.017
3.491ProAsp: 3.491 ± 0.051
4.319ProGlu: 4.319 ± 0.05
2.056ProPhe: 2.056 ± 0.037
4.179ProGly: 4.179 ± 0.057
1.121ProHis: 1.121 ± 0.026
2.756ProIle: 2.756 ± 0.044
2.404ProLys: 2.404 ± 0.04
4.732ProLeu: 4.732 ± 0.054
1.164ProMet: 1.164 ± 0.026
1.839ProAsn: 1.839 ± 0.035
3.224ProPro: 3.224 ± 0.084
1.431ProGln: 1.431 ± 0.027
3.347ProArg: 3.347 ± 0.046
4.026ProSer: 4.026 ± 0.054
3.579ProThr: 3.579 ± 0.048
3.844ProVal: 3.844 ± 0.053
0.837ProTrp: 0.837 ± 0.022
1.211ProTyr: 1.211 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
3.455GlnAla: 3.455 ± 0.048
0.359GlnCys: 0.359 ± 0.015
1.349GlnAsp: 1.349 ± 0.031
2.141GlnGlu: 2.141 ± 0.039
1.269GlnPhe: 1.269 ± 0.024
2.446GlnGly: 2.446 ± 0.04
0.749GlnHis: 0.749 ± 0.02
1.652GlnIle: 1.652 ± 0.03
1.085GlnLys: 1.085 ± 0.023
3.691GlnLeu: 3.691 ± 0.043
0.734GlnMet: 0.734 ± 0.017
0.764GlnAsn: 0.764 ± 0.021
1.863GlnPro: 1.863 ± 0.051
1.709GlnGln: 1.709 ± 0.066
3.083GlnArg: 3.083 ± 0.046
2.038GlnSer: 2.038 ± 0.035
1.761GlnThr: 1.761 ± 0.032
2.293GlnVal: 2.293 ± 0.033
0.559GlnTrp: 0.559 ± 0.017
0.696GlnTyr: 0.696 ± 0.019
0.0GlnXaa: 0.0 ± 0.0
Arg
5.721ArgAla: 5.721 ± 0.063
0.862ArgCys: 0.862 ± 0.025
4.39ArgAsp: 4.39 ± 0.052
5.114ArgGlu: 5.114 ± 0.063
3.125ArgPhe: 3.125 ± 0.041
4.824ArgGly: 4.824 ± 0.061
1.949ArgHis: 1.949 ± 0.038
3.508ArgIle: 3.508 ± 0.04
2.51ArgLys: 2.51 ± 0.039
8.251ArgLeu: 8.251 ± 0.086
1.686ArgMet: 1.686 ± 0.03
1.761ArgAsn: 1.761 ± 0.032
3.628ArgPro: 3.628 ± 0.044
2.956ArgGln: 2.956 ± 0.047
6.863ArgArg: 6.863 ± 0.078
4.472ArgSer: 4.472 ± 0.048
3.641ArgThr: 3.641 ± 0.049
5.042ArgVal: 5.042 ± 0.051
1.416ArgTrp: 1.416 ± 0.028
1.989ArgTyr: 1.989 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
5.326SerAla: 5.326 ± 0.053
0.659SerCys: 0.659 ± 0.02
3.979SerAsp: 3.979 ± 0.055
3.676SerGlu: 3.676 ± 0.039
2.514SerPhe: 2.514 ± 0.038
5.35SerGly: 5.35 ± 0.066
1.323SerHis: 1.323 ± 0.029
3.1SerIle: 3.1 ± 0.048
2.192SerLys: 2.192 ± 0.034
6.801SerLeu: 6.801 ± 0.065
1.404SerMet: 1.404 ± 0.026
1.809SerAsn: 1.809 ± 0.033
3.776SerPro: 3.776 ± 0.054
2.081SerGln: 2.081 ± 0.032
4.317SerArg: 4.317 ± 0.05
4.632SerSer: 4.632 ± 0.06
3.604SerThr: 3.604 ± 0.046
4.497SerVal: 4.497 ± 0.06
0.974SerTrp: 0.974 ± 0.03
1.441SerTyr: 1.441 ± 0.028
0.0SerXaa: 0.0 ± 0.0
Thr
4.828ThrAla: 4.828 ± 0.055
0.559ThrCys: 0.559 ± 0.019
2.992ThrAsp: 2.992 ± 0.056
2.952ThrGlu: 2.952 ± 0.044
2.286ThrPhe: 2.286 ± 0.043
4.52ThrGly: 4.52 ± 0.056
1.127ThrHis: 1.127 ± 0.026
3.547ThrIle: 3.547 ± 0.064
1.998ThrLys: 1.998 ± 0.036
5.87ThrLeu: 5.87 ± 0.063
1.14ThrMet: 1.14 ± 0.023
1.667ThrAsn: 1.667 ± 0.037
3.56ThrPro: 3.56 ± 0.05
1.469ThrGln: 1.469 ± 0.029
3.262ThrArg: 3.262 ± 0.041
3.775ThrSer: 3.775 ± 0.055
3.5ThrThr: 3.5 ± 0.061
3.969ThrVal: 3.969 ± 0.063
0.808ThrTrp: 0.808 ± 0.022
1.382ThrTyr: 1.382 ± 0.035
0.0ThrXaa: 0.0 ± 0.0
Val
7.763ValAla: 7.763 ± 0.077
0.784ValCys: 0.784 ± 0.024
5.016ValAsp: 5.016 ± 0.06
4.887ValGlu: 4.887 ± 0.052
2.565ValPhe: 2.565 ± 0.039
6.022ValGly: 6.022 ± 0.065
1.516ValHis: 1.516 ± 0.028
3.687ValIle: 3.687 ± 0.049
2.138ValLys: 2.138 ± 0.04
7.098ValLeu: 7.098 ± 0.061
1.463ValMet: 1.463 ± 0.03
2.078ValAsn: 2.078 ± 0.035
3.835ValPro: 3.835 ± 0.044
1.912ValGln: 1.912 ± 0.028
5.074ValArg: 5.074 ± 0.058
4.512ValSer: 4.512 ± 0.066
4.377ValThr: 4.377 ± 0.068
6.261ValVal: 6.261 ± 0.062
0.93ValTrp: 0.93 ± 0.024
1.601ValTyr: 1.601 ± 0.032
0.0ValXaa: 0.0 ± 0.0
Trp
1.206TrpAla: 1.206 ± 0.026
0.197TrpCys: 0.197 ± 0.01
0.822TrpAsp: 0.822 ± 0.023
0.785TrpGlu: 0.785 ± 0.02
0.601TrpPhe: 0.601 ± 0.018
0.998TrpGly: 0.998 ± 0.025
0.416TrpHis: 0.416 ± 0.013
0.805TrpIle: 0.805 ± 0.021
0.664TrpLys: 0.664 ± 0.017
1.73TrpLeu: 1.73 ± 0.035
0.397TrpMet: 0.397 ± 0.014
0.54TrpAsn: 0.54 ± 0.018
0.714TrpPro: 0.714 ± 0.022
0.568TrpGln: 0.568 ± 0.014
1.179TrpArg: 1.179 ± 0.026
1.073TrpSer: 1.073 ± 0.023
0.828TrpThr: 0.828 ± 0.022
0.895TrpVal: 0.895 ± 0.021
0.296TrpTrp: 0.296 ± 0.012
0.369TrpTyr: 0.369 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.015TyrAla: 2.015 ± 0.033
0.303TyrCys: 0.303 ± 0.013
1.593TyrAsp: 1.593 ± 0.04
1.432TyrGlu: 1.432 ± 0.026
1.045TyrPhe: 1.045 ± 0.025
1.916TyrGly: 1.916 ± 0.044
0.65TyrHis: 0.65 ± 0.017
0.816TyrIle: 0.816 ± 0.019
0.567TyrLys: 0.567 ± 0.02
2.624TyrLeu: 2.624 ± 0.04
0.384TyrMet: 0.384 ± 0.014
0.606TyrAsn: 0.606 ± 0.02
1.186TyrPro: 1.186 ± 0.027
0.988TyrGln: 0.988 ± 0.02
2.103TyrArg: 2.103 ± 0.033
1.301TyrSer: 1.301 ± 0.028
1.101TyrThr: 1.101 ± 0.031
1.759TyrVal: 1.759 ± 0.029
0.438TyrTrp: 0.438 ± 0.014
0.737TyrTyr: 0.737 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5710 proteins (2113643 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski