Amino acid dipepetide frequency for Polynucleobacter sp. VK13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.354AlaAla: 7.354 ± 0.151
0.931AlaCys: 0.931 ± 0.032
4.022AlaAsp: 4.022 ± 0.07
4.286AlaGlu: 4.286 ± 0.089
3.333AlaPhe: 3.333 ± 0.063
6.514AlaGly: 6.514 ± 0.108
1.698AlaHis: 1.698 ± 0.052
6.412AlaIle: 6.412 ± 0.101
5.749AlaLys: 5.749 ± 0.098
8.958AlaLeu: 8.958 ± 0.122
2.517AlaMet: 2.517 ± 0.051
4.083AlaAsn: 4.083 ± 0.093
3.405AlaPro: 3.405 ± 0.066
3.877AlaGln: 3.877 ± 0.081
3.639AlaArg: 3.639 ± 0.072
5.352AlaSer: 5.352 ± 0.089
4.513AlaThr: 4.513 ± 0.108
5.264AlaVal: 5.264 ± 0.089
1.041AlaTrp: 1.041 ± 0.04
2.337AlaTyr: 2.337 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.799CysAla: 0.799 ± 0.032
0.116CysCys: 0.116 ± 0.013
0.473CysAsp: 0.473 ± 0.023
0.455CysGlu: 0.455 ± 0.024
0.455CysPhe: 0.455 ± 0.019
0.889CysGly: 0.889 ± 0.038
0.288CysHis: 0.288 ± 0.018
0.602CysIle: 0.602 ± 0.029
0.348CysLys: 0.348 ± 0.02
0.934CysLeu: 0.934 ± 0.034
0.205CysMet: 0.205 ± 0.016
0.317CysAsn: 0.317 ± 0.019
0.457CysPro: 0.457 ± 0.027
0.373CysGln: 0.373 ± 0.019
0.36CysArg: 0.36 ± 0.02
0.638CysSer: 0.638 ± 0.032
0.49CysThr: 0.49 ± 0.026
0.609CysVal: 0.609 ± 0.027
0.09CysTrp: 0.09 ± 0.011
0.262CysTyr: 0.262 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
4.198AspAla: 4.198 ± 0.076
0.475AspCys: 0.475 ± 0.024
2.189AspAsp: 2.189 ± 0.053
2.946AspGlu: 2.946 ± 0.062
2.624AspPhe: 2.624 ± 0.053
3.503AspGly: 3.503 ± 0.08
1.173AspHis: 1.173 ± 0.04
3.449AspIle: 3.449 ± 0.06
2.126AspLys: 2.126 ± 0.055
5.543AspLeu: 5.543 ± 0.09
1.189AspMet: 1.189 ± 0.038
1.525AspAsn: 1.525 ± 0.052
2.594AspPro: 2.594 ± 0.055
2.295AspGln: 2.295 ± 0.05
2.267AspArg: 2.267 ± 0.061
2.725AspSer: 2.725 ± 0.056
2.465AspThr: 2.465 ± 0.079
3.493AspVal: 3.493 ± 0.069
0.833AspTrp: 0.833 ± 0.027
1.569AspTyr: 1.569 ± 0.041
0.0AspXaa: 0.0 ± 0.0
Glu
4.521GluAla: 4.521 ± 0.085
0.422GluCys: 0.422 ± 0.02
2.567GluAsp: 2.567 ± 0.061
3.253GluGlu: 3.253 ± 0.077
2.382GluPhe: 2.382 ± 0.056
3.129GluGly: 3.129 ± 0.064
1.243GluHis: 1.243 ± 0.04
4.663GluIle: 4.663 ± 0.075
3.955GluLys: 3.955 ± 0.076
6.281GluLeu: 6.281 ± 0.102
1.721GluMet: 1.721 ± 0.05
2.674GluAsn: 2.674 ± 0.061
1.868GluPro: 1.868 ± 0.048
2.552GluGln: 2.552 ± 0.059
2.587GluArg: 2.587 ± 0.057
3.352GluSer: 3.352 ± 0.068
2.643GluThr: 2.643 ± 0.05
4.167GluVal: 4.167 ± 0.08
0.743GluTrp: 0.743 ± 0.029
1.592GluTyr: 1.592 ± 0.045
0.0GluXaa: 0.0 ± 0.0
Phe
3.723PheAla: 3.723 ± 0.068
0.526PheCys: 0.526 ± 0.024
2.531PheAsp: 2.531 ± 0.057
2.462PheGlu: 2.462 ± 0.057
1.953PhePhe: 1.953 ± 0.061
3.446PheGly: 3.446 ± 0.079
0.85PheHis: 0.85 ± 0.028
2.968PheIle: 2.968 ± 0.065
2.323PheLys: 2.323 ± 0.053
4.103PheLeu: 4.103 ± 0.085
0.97PheMet: 0.97 ± 0.034
2.07PheAsn: 2.07 ± 0.049
1.784PhePro: 1.784 ± 0.044
1.461PheGln: 1.461 ± 0.041
1.52PheArg: 1.52 ± 0.042
3.138PheSer: 3.138 ± 0.071
2.357PheThr: 2.357 ± 0.059
2.759PheVal: 2.759 ± 0.062
0.554PheTrp: 0.554 ± 0.026
1.314PheTyr: 1.314 ± 0.038
0.0PheXaa: 0.0 ± 0.0
Gly
5.777GlyAla: 5.777 ± 0.11
0.729GlyCys: 0.729 ± 0.033
3.147GlyAsp: 3.147 ± 0.066
3.526GlyGlu: 3.526 ± 0.079
3.478GlyPhe: 3.478 ± 0.077
5.72GlyGly: 5.72 ± 0.12
1.618GlyHis: 1.618 ± 0.044
5.323GlyIle: 5.323 ± 0.083
4.502GlyLys: 4.502 ± 0.084
7.442GlyLeu: 7.442 ± 0.093
2.031GlyMet: 2.031 ± 0.053
3.007GlyAsn: 3.007 ± 0.1
2.404GlyPro: 2.404 ± 0.056
2.914GlyGln: 2.914 ± 0.058
3.036GlyArg: 3.036 ± 0.07
4.802GlySer: 4.802 ± 0.099
4.078GlyThr: 4.078 ± 0.113
5.587GlyVal: 5.587 ± 0.084
1.023GlyTrp: 1.023 ± 0.037
2.513GlyTyr: 2.513 ± 0.062
0.0GlyXaa: 0.0 ± 0.0
His
1.808HisAla: 1.808 ± 0.047
0.281HisCys: 0.281 ± 0.016
0.947HisAsp: 0.947 ± 0.034
1.084HisGlu: 1.084 ± 0.034
1.111HisPhe: 1.111 ± 0.036
1.607HisGly: 1.607 ± 0.044
0.776HisHis: 0.776 ± 0.032
1.439HisIle: 1.439 ± 0.048
0.702HisLys: 0.702 ± 0.027
2.698HisLeu: 2.698 ± 0.053
0.492HisMet: 0.492 ± 0.026
0.586HisAsn: 0.586 ± 0.028
1.435HisPro: 1.435 ± 0.043
1.284HisGln: 1.284 ± 0.043
0.974HisArg: 0.974 ± 0.032
1.257HisSer: 1.257 ± 0.037
0.98HisThr: 0.98 ± 0.034
1.344HisVal: 1.344 ± 0.04
0.384HisTrp: 0.384 ± 0.017
0.684HisTyr: 0.684 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
7.07IleAla: 7.07 ± 0.111
0.715IleCys: 0.715 ± 0.028
4.074IleAsp: 4.074 ± 0.068
4.639IleGlu: 4.639 ± 0.08
2.838IlePhe: 2.838 ± 0.067
5.421IleGly: 5.421 ± 0.083
1.55IleHis: 1.55 ± 0.038
4.078IleIle: 4.078 ± 0.081
3.83IleLys: 3.83 ± 0.066
6.423IleLeu: 6.423 ± 0.105
1.327IleMet: 1.327 ± 0.045
3.285IleAsn: 3.285 ± 0.077
3.513IlePro: 3.513 ± 0.066
3.073IleGln: 3.073 ± 0.055
3.029IleArg: 3.029 ± 0.058
5.096IleSer: 5.096 ± 0.106
4.082IleThr: 4.082 ± 0.14
4.513IleVal: 4.513 ± 0.07
0.785IleTrp: 0.785 ± 0.028
1.822IleTyr: 1.822 ± 0.048
0.0IleXaa: 0.0 ± 0.0
Lys
4.637LysAla: 4.637 ± 0.085
0.258LysCys: 0.258 ± 0.017
3.107LysAsp: 3.107 ± 0.073
3.584LysGlu: 3.584 ± 0.064
1.944LysPhe: 1.944 ± 0.046
3.304LysGly: 3.304 ± 0.068
1.033LysHis: 1.033 ± 0.039
4.304LysIle: 4.304 ± 0.071
3.89LysLys: 3.89 ± 0.077
5.448LysLeu: 5.448 ± 0.088
1.725LysMet: 1.725 ± 0.048
3.298LysAsn: 3.298 ± 0.076
2.741LysPro: 2.741 ± 0.079
2.131LysGln: 2.131 ± 0.052
2.464LysArg: 2.464 ± 0.056
3.851LysSer: 3.851 ± 0.072
3.289LysThr: 3.289 ± 0.053
4.116LysVal: 4.116 ± 0.074
0.644LysTrp: 0.644 ± 0.028
1.589LysTyr: 1.589 ± 0.046
0.0LysXaa: 0.0 ± 0.0
Leu
9.464LeuAla: 9.464 ± 0.122
0.969LeuCys: 0.969 ± 0.031
5.139LeuAsp: 5.139 ± 0.081
5.988LeuGlu: 5.988 ± 0.099
4.074LeuPhe: 4.074 ± 0.082
7.674LeuGly: 7.674 ± 0.116
2.041LeuHis: 2.041 ± 0.052
7.19LeuIle: 7.19 ± 0.108
6.196LeuLys: 6.196 ± 0.098
9.926LeuLeu: 9.926 ± 0.138
2.72LeuMet: 2.72 ± 0.06
4.759LeuAsn: 4.759 ± 0.083
5.164LeuPro: 5.164 ± 0.092
3.913LeuGln: 3.913 ± 0.07
4.386LeuArg: 4.386 ± 0.084
7.423LeuSer: 7.423 ± 0.108
5.518LeuThr: 5.518 ± 0.109
6.556LeuVal: 6.556 ± 0.098
1.085LeuTrp: 1.085 ± 0.041
2.22LeuTyr: 2.22 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
2.374MetAla: 2.374 ± 0.06
0.182MetCys: 0.182 ± 0.014
1.122MetAsp: 1.122 ± 0.04
1.241MetGlu: 1.241 ± 0.042
0.818MetPhe: 0.818 ± 0.032
1.991MetGly: 1.991 ± 0.053
0.55MetHis: 0.55 ± 0.025
1.791MetIle: 1.791 ± 0.049
1.626MetLys: 1.626 ± 0.046
2.358MetLeu: 2.358 ± 0.052
0.878MetMet: 0.878 ± 0.034
1.38MetAsn: 1.38 ± 0.039
1.368MetPro: 1.368 ± 0.043
1.29MetGln: 1.29 ± 0.038
1.296MetArg: 1.296 ± 0.044
1.952MetSer: 1.952 ± 0.045
1.682MetThr: 1.682 ± 0.046
1.645MetVal: 1.645 ± 0.044
0.226MetTrp: 0.226 ± 0.016
0.505MetTyr: 0.505 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
3.745AsnAla: 3.745 ± 0.078
0.392AsnCys: 0.392 ± 0.021
2.05AsnAsp: 2.05 ± 0.059
2.253AsnGlu: 2.253 ± 0.058
1.935AsnPhe: 1.935 ± 0.046
3.123AsnGly: 3.123 ± 0.095
1.073AsnHis: 1.073 ± 0.038
3.059AsnIle: 3.059 ± 0.08
2.343AsnLys: 2.343 ± 0.053
4.799AsnLeu: 4.799 ± 0.09
1.039AsnMet: 1.039 ± 0.031
2.05AsnAsn: 2.05 ± 0.081
2.88AsnPro: 2.88 ± 0.056
2.416AsnGln: 2.416 ± 0.063
2.021AsnArg: 2.021 ± 0.046
2.82AsnSer: 2.82 ± 0.084
2.492AsnThr: 2.492 ± 0.07
2.506AsnVal: 2.506 ± 0.066
0.7AsnTrp: 0.7 ± 0.029
1.39AsnTyr: 1.39 ± 0.044
0.0AsnXaa: 0.0 ± 0.0
Pro
3.49ProAla: 3.49 ± 0.074
0.315ProCys: 0.315 ± 0.019
2.437ProAsp: 2.437 ± 0.06
3.365ProGlu: 3.365 ± 0.064
1.969ProPhe: 1.969 ± 0.046
3.252ProGly: 3.252 ± 0.065
1.009ProHis: 1.009 ± 0.033
3.451ProIle: 3.451 ± 0.06
2.75ProLys: 2.75 ± 0.059
4.349ProLeu: 4.349 ± 0.062
1.203ProMet: 1.203 ± 0.04
2.434ProAsn: 2.434 ± 0.064
1.637ProPro: 1.637 ± 0.055
1.714ProGln: 1.714 ± 0.043
1.519ProArg: 1.519 ± 0.043
3.089ProSer: 3.089 ± 0.066
2.768ProThr: 2.768 ± 0.051
3.203ProVal: 3.203 ± 0.066
0.57ProTrp: 0.57 ± 0.026
1.534ProTyr: 1.534 ± 0.045
0.0ProXaa: 0.0 ± 0.0
Gln
3.994GlnAla: 3.994 ± 0.08
0.316GlnCys: 0.316 ± 0.02
1.827GlnAsp: 1.827 ± 0.045
2.415GlnGlu: 2.415 ± 0.066
1.868GlnPhe: 1.868 ± 0.046
2.603GlnGly: 2.603 ± 0.066
0.919GlnHis: 0.919 ± 0.033
3.293GlnIle: 3.293 ± 0.061
2.447GlnLys: 2.447 ± 0.057
4.636GlnLeu: 4.636 ± 0.084
1.239GlnMet: 1.239 ± 0.035
1.741GlnAsn: 1.741 ± 0.056
1.561GlnPro: 1.561 ± 0.042
1.908GlnGln: 1.908 ± 0.056
1.841GlnArg: 1.841 ± 0.05
2.953GlnSer: 2.953 ± 0.064
2.417GlnThr: 2.417 ± 0.055
3.107GlnVal: 3.107 ± 0.061
0.561GlnTrp: 0.561 ± 0.026
1.147GlnTyr: 1.147 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
3.414ArgAla: 3.414 ± 0.076
0.365ArgCys: 0.365 ± 0.021
2.142ArgAsp: 2.142 ± 0.051
2.477ArgGlu: 2.477 ± 0.058
2.022ArgPhe: 2.022 ± 0.044
2.679ArgGly: 2.679 ± 0.067
0.952ArgHis: 0.952 ± 0.033
3.066ArgIle: 3.066 ± 0.068
2.33ArgLys: 2.33 ± 0.054
4.637ArgLeu: 4.637 ± 0.076
1.217ArgMet: 1.217 ± 0.035
1.83ArgAsn: 1.83 ± 0.047
1.827ArgPro: 1.827 ± 0.044
1.788ArgGln: 1.788 ± 0.055
1.998ArgArg: 1.998 ± 0.056
2.573ArgSer: 2.573 ± 0.051
2.054ArgThr: 2.054 ± 0.056
2.977ArgVal: 2.977 ± 0.065
0.602ArgTrp: 0.602 ± 0.027
1.404ArgTyr: 1.404 ± 0.044
0.0ArgXaa: 0.0 ± 0.0
Ser
5.447SerAla: 5.447 ± 0.091
0.585SerCys: 0.585 ± 0.026
3.089SerAsp: 3.089 ± 0.072
3.657SerGlu: 3.657 ± 0.074
2.972SerPhe: 2.972 ± 0.074
5.348SerGly: 5.348 ± 0.098
1.474SerHis: 1.474 ± 0.04
4.752SerIle: 4.752 ± 0.088
3.759SerLys: 3.759 ± 0.069
6.654SerLeu: 6.654 ± 0.112
1.755SerMet: 1.755 ± 0.048
3.133SerAsn: 3.133 ± 0.088
3.078SerPro: 3.078 ± 0.071
2.839SerGln: 2.839 ± 0.056
2.595SerArg: 2.595 ± 0.056
4.723SerSer: 4.723 ± 0.115
3.616SerThr: 3.616 ± 0.086
4.213SerVal: 4.213 ± 0.076
0.822SerTrp: 0.822 ± 0.035
1.96SerTyr: 1.96 ± 0.052
0.0SerXaa: 0.0 ± 0.0
Thr
4.389ThrAla: 4.389 ± 0.125
0.462ThrCys: 0.462 ± 0.024
2.59ThrAsp: 2.59 ± 0.062
2.603ThrGlu: 2.603 ± 0.057
2.224ThrPhe: 2.224 ± 0.044
4.49ThrGly: 4.49 ± 0.128
1.318ThrHis: 1.318 ± 0.04
3.942ThrIle: 3.942 ± 0.078
2.81ThrLys: 2.81 ± 0.059
5.765ThrLeu: 5.765 ± 0.098
1.291ThrMet: 1.291 ± 0.037
2.532ThrAsn: 2.532 ± 0.089
3.206ThrPro: 3.206 ± 0.06
2.236ThrGln: 2.236 ± 0.05
2.058ThrArg: 2.058 ± 0.049
3.691ThrSer: 3.691 ± 0.117
3.143ThrThr: 3.143 ± 0.107
3.582ThrVal: 3.582 ± 0.091
0.627ThrTrp: 0.627 ± 0.026
1.564ThrTyr: 1.564 ± 0.061
0.0ThrXaa: 0.0 ± 0.0
Val
5.927ValAla: 5.927 ± 0.102
0.663ValCys: 0.663 ± 0.029
3.688ValAsp: 3.688 ± 0.066
3.969ValGlu: 3.969 ± 0.082
2.83ValPhe: 2.83 ± 0.063
4.881ValGly: 4.881 ± 0.073
1.345ValHis: 1.345 ± 0.039
5.025ValIle: 5.025 ± 0.078
3.817ValLys: 3.817 ± 0.065
6.558ValLeu: 6.558 ± 0.098
1.904ValMet: 1.904 ± 0.052
2.954ValAsn: 2.954 ± 0.072
3.098ValPro: 3.098 ± 0.064
2.4ValGln: 2.4 ± 0.066
2.714ValArg: 2.714 ± 0.058
4.288ValSer: 4.288 ± 0.071
3.85ValThr: 3.85 ± 0.097
4.991ValVal: 4.991 ± 0.084
0.808ValTrp: 0.808 ± 0.031
1.628ValTyr: 1.628 ± 0.045
0.0ValXaa: 0.0 ± 0.0
Trp
0.797TrpAla: 0.797 ± 0.035
0.124TrpCys: 0.124 ± 0.012
0.579TrpAsp: 0.579 ± 0.028
0.681TrpGlu: 0.681 ± 0.029
0.588TrpPhe: 0.588 ± 0.029
0.838TrpGly: 0.838 ± 0.033
0.37TrpHis: 0.37 ± 0.017
0.885TrpIle: 0.885 ± 0.032
0.588TrpLys: 0.588 ± 0.025
1.584TrpLeu: 1.584 ± 0.048
0.387TrpMet: 0.387 ± 0.019
0.543TrpAsn: 0.543 ± 0.026
0.52TrpPro: 0.52 ± 0.026
0.744TrpGln: 0.744 ± 0.028
0.654TrpArg: 0.654 ± 0.026
0.78TrpSer: 0.78 ± 0.034
0.633TrpThr: 0.633 ± 0.025
0.86TrpVal: 0.86 ± 0.03
0.18TrpTrp: 0.18 ± 0.016
0.361TrpTyr: 0.361 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.358TyrAla: 2.358 ± 0.055
0.334TyrCys: 0.334 ± 0.02
1.463TyrAsp: 1.463 ± 0.039
1.433TyrGlu: 1.433 ± 0.041
1.395TyrPhe: 1.395 ± 0.042
2.245TyrGly: 2.245 ± 0.057
0.597TyrHis: 0.597 ± 0.027
1.472TyrIle: 1.472 ± 0.048
1.268TyrLys: 1.268 ± 0.041
3.306TyrLeu: 3.306 ± 0.068
0.465TyrMet: 0.465 ± 0.026
0.881TyrAsn: 0.881 ± 0.039
1.517TyrPro: 1.517 ± 0.044
1.583TyrGln: 1.583 ± 0.046
1.399TyrArg: 1.399 ± 0.046
1.965TyrSer: 1.965 ± 0.056
1.451TyrThr: 1.451 ± 0.049
1.846TyrVal: 1.846 ± 0.047
0.417TyrTrp: 0.417 ± 0.023
0.806TyrTyr: 0.806 ± 0.035
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2757 proteins (888343 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski