Amino acid dipepetide frequency for Verrucomicrobiales bacterium VVV1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.055AlaAla: 13.055 ± 0.135
1.072AlaCys: 1.072 ± 0.03
5.614AlaAsp: 5.614 ± 0.069
6.123AlaGlu: 6.123 ± 0.09
4.01AlaPhe: 4.01 ± 0.061
9.47AlaGly: 9.47 ± 0.112
1.712AlaHis: 1.712 ± 0.039
5.168AlaIle: 5.168 ± 0.066
4.831AlaLys: 4.831 ± 0.082
10.398AlaLeu: 10.398 ± 0.108
2.559AlaMet: 2.559 ± 0.046
3.498AlaAsn: 3.498 ± 0.061
4.963AlaPro: 4.963 ± 0.066
2.893AlaGln: 2.893 ± 0.051
5.816AlaArg: 5.816 ± 0.089
6.855AlaSer: 6.855 ± 0.086
6.56AlaThr: 6.56 ± 0.103
6.998AlaVal: 6.998 ± 0.072
1.681AlaTrp: 1.681 ± 0.038
2.149AlaTyr: 2.149 ± 0.039
0.001AlaXaa: 0.001 ± 0.001
Cys
0.857CysAla: 0.857 ± 0.028
0.138CysCys: 0.138 ± 0.011
0.515CysAsp: 0.515 ± 0.02
0.535CysGlu: 0.535 ± 0.02
0.417CysPhe: 0.417 ± 0.018
0.96CysGly: 0.96 ± 0.03
0.355CysHis: 0.355 ± 0.02
0.427CysIle: 0.427 ± 0.017
0.277CysLys: 0.277 ± 0.016
0.974CysLeu: 0.974 ± 0.031
0.169CysMet: 0.169 ± 0.012
0.229CysAsn: 0.229 ± 0.011
0.493CysPro: 0.493 ± 0.017
0.285CysGln: 0.285 ± 0.014
0.637CysArg: 0.637 ± 0.019
0.589CysSer: 0.589 ± 0.022
0.447CysThr: 0.447 ± 0.017
0.583CysVal: 0.583 ± 0.021
0.142CysTrp: 0.142 ± 0.01
0.238CysTyr: 0.238 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
5.422AspAla: 5.422 ± 0.077
0.492AspCys: 0.492 ± 0.02
2.687AspAsp: 2.687 ± 0.049
3.243AspGlu: 3.243 ± 0.051
2.537AspPhe: 2.537 ± 0.039
5.247AspGly: 5.247 ± 0.07
1.285AspHis: 1.285 ± 0.034
2.133AspIle: 2.133 ± 0.041
2.003AspLys: 2.003 ± 0.036
5.719AspLeu: 5.719 ± 0.072
0.829AspMet: 0.829 ± 0.021
1.372AspAsn: 1.372 ± 0.035
3.651AspPro: 3.651 ± 0.057
1.845AspGln: 1.845 ± 0.037
3.499AspArg: 3.499 ± 0.054
3.057AspSer: 3.057 ± 0.045
2.526AspThr: 2.526 ± 0.038
3.302AspVal: 3.302 ± 0.052
1.086AspTrp: 1.086 ± 0.025
1.544AspTyr: 1.544 ± 0.032
0.0AspXaa: 0.0 ± 0.0
Glu
6.237GluAla: 6.237 ± 0.106
0.427GluCys: 0.427 ± 0.021
2.576GluAsp: 2.576 ± 0.041
3.374GluGlu: 3.374 ± 0.062
2.264GluPhe: 2.264 ± 0.038
4.179GluGly: 4.179 ± 0.065
1.137GluHis: 1.137 ± 0.033
3.437GluIle: 3.437 ± 0.059
3.321GluLys: 3.321 ± 0.057
5.88GluLeu: 5.88 ± 0.073
1.348GluMet: 1.348 ± 0.032
1.867GluAsn: 1.867 ± 0.034
2.581GluPro: 2.581 ± 0.05
1.909GluGln: 1.909 ± 0.043
3.799GluArg: 3.799 ± 0.063
3.493GluSer: 3.493 ± 0.057
3.352GluThr: 3.352 ± 0.052
3.985GluVal: 3.985 ± 0.064
0.923GluTrp: 0.923 ± 0.027
1.044GluTyr: 1.044 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
3.843PheAla: 3.843 ± 0.052
0.428PheCys: 0.428 ± 0.015
2.484PheAsp: 2.484 ± 0.035
2.329PheGlu: 2.329 ± 0.043
1.641PhePhe: 1.641 ± 0.037
3.44PheGly: 3.44 ± 0.051
0.974PheHis: 0.974 ± 0.027
1.864PheIle: 1.864 ± 0.038
1.444PheLys: 1.444 ± 0.033
3.798PheLeu: 3.798 ± 0.06
0.748PheMet: 0.748 ± 0.023
1.459PheAsn: 1.459 ± 0.036
1.993PhePro: 1.993 ± 0.041
1.272PheGln: 1.272 ± 0.032
2.418PheArg: 2.418 ± 0.046
2.992PheSer: 2.992 ± 0.05
2.823PheThr: 2.823 ± 0.051
2.525PheVal: 2.525 ± 0.045
0.587PheTrp: 0.587 ± 0.021
0.981PheTyr: 0.981 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
7.885GlyAla: 7.885 ± 0.124
0.832GlyCys: 0.832 ± 0.026
4.439GlyAsp: 4.439 ± 0.062
4.484GlyGlu: 4.484 ± 0.062
3.6GlyPhe: 3.6 ± 0.047
7.914GlyGly: 7.914 ± 0.149
1.755GlyHis: 1.755 ± 0.038
4.347GlyIle: 4.347 ± 0.06
4.589GlyLys: 4.589 ± 0.068
7.964GlyLeu: 7.964 ± 0.072
1.849GlyMet: 1.849 ± 0.037
3.335GlyAsn: 3.335 ± 0.075
3.218GlyPro: 3.218 ± 0.052
2.628GlyGln: 2.628 ± 0.044
4.817GlyArg: 4.817 ± 0.069
6.432GlySer: 6.432 ± 0.119
6.658GlyThr: 6.658 ± 0.174
5.638GlyVal: 5.638 ± 0.067
1.52GlyTrp: 1.52 ± 0.032
2.279GlyTyr: 2.279 ± 0.049
0.001GlyXaa: 0.001 ± 0.001
His
2.097HisAla: 2.097 ± 0.042
0.255HisCys: 0.255 ± 0.014
1.161HisAsp: 1.161 ± 0.028
1.196HisGlu: 1.196 ± 0.033
0.983HisPhe: 0.983 ± 0.028
1.856HisGly: 1.856 ± 0.038
0.651HisHis: 0.651 ± 0.025
0.76HisIle: 0.76 ± 0.026
0.543HisLys: 0.543 ± 0.018
2.237HisLeu: 2.237 ± 0.045
0.322HisMet: 0.322 ± 0.017
0.501HisAsn: 0.501 ± 0.018
1.462HisPro: 1.462 ± 0.032
0.696HisGln: 0.696 ± 0.022
1.431HisArg: 1.431 ± 0.034
1.155HisSer: 1.155 ± 0.029
0.936HisThr: 0.936 ± 0.022
1.294HisVal: 1.294 ± 0.034
0.374HisTrp: 0.374 ± 0.018
0.601HisTyr: 0.601 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
5.518IleAla: 5.518 ± 0.061
0.514IleCys: 0.514 ± 0.019
3.033IleAsp: 3.033 ± 0.046
3.34IleGlu: 3.34 ± 0.056
1.607IlePhe: 1.607 ± 0.039
4.258IleGly: 4.258 ± 0.061
1.104IleHis: 1.104 ± 0.034
2.025IleIle: 2.025 ± 0.04
1.606IleLys: 1.606 ± 0.034
4.676IleLeu: 4.676 ± 0.069
0.651IleMet: 0.651 ± 0.022
1.592IleAsn: 1.592 ± 0.037
2.748IlePro: 2.748 ± 0.046
1.534IleGln: 1.534 ± 0.034
3.209IleArg: 3.209 ± 0.055
3.446IleSer: 3.446 ± 0.055
3.193IleThr: 3.193 ± 0.068
3.374IleVal: 3.374 ± 0.048
0.578IleTrp: 0.578 ± 0.022
1.05IleTyr: 1.05 ± 0.029
0.0IleXaa: 0.0 ± 0.0
Lys
4.505LysAla: 4.505 ± 0.067
0.272LysCys: 0.272 ± 0.014
2.577LysAsp: 2.577 ± 0.046
2.652LysGlu: 2.652 ± 0.048
1.422LysPhe: 1.422 ± 0.031
3.219LysGly: 3.219 ± 0.059
0.85LysHis: 0.85 ± 0.028
2.173LysIle: 2.173 ± 0.037
2.285LysLys: 2.285 ± 0.052
4.489LysLeu: 4.489 ± 0.062
0.975LysMet: 0.975 ± 0.025
1.354LysAsn: 1.354 ± 0.034
2.873LysPro: 2.873 ± 0.053
1.401LysGln: 1.401 ± 0.031
2.676LysArg: 2.676 ± 0.051
2.849LysSer: 2.849 ± 0.058
2.538LysThr: 2.538 ± 0.046
2.945LysVal: 2.945 ± 0.058
0.723LysTrp: 0.723 ± 0.021
0.744LysTyr: 0.744 ± 0.025
0.0LysXaa: 0.0 ± 0.0
Leu
11.662LeuAla: 11.662 ± 0.097
1.036LeuCys: 1.036 ± 0.031
5.638LeuAsp: 5.638 ± 0.059
5.63LeuGlu: 5.63 ± 0.076
3.494LeuPhe: 3.494 ± 0.049
7.926LeuGly: 7.926 ± 0.078
2.019LeuHis: 2.019 ± 0.041
4.683LeuIle: 4.683 ± 0.063
4.593LeuLys: 4.593 ± 0.067
9.815LeuLeu: 9.815 ± 0.123
2.033LeuMet: 2.033 ± 0.044
3.237LeuAsn: 3.237 ± 0.055
5.843LeuPro: 5.843 ± 0.071
2.938LeuGln: 2.938 ± 0.043
6.429LeuArg: 6.429 ± 0.072
6.998LeuSer: 6.998 ± 0.077
6.619LeuThr: 6.619 ± 0.095
6.704LeuVal: 6.704 ± 0.072
1.276LeuTrp: 1.276 ± 0.032
1.884LeuTyr: 1.884 ± 0.035
0.002LeuXaa: 0.002 ± 0.001
Met
2.172MetAla: 2.172 ± 0.039
0.12MetCys: 0.12 ± 0.009
0.997MetAsp: 0.997 ± 0.026
1.167MetGlu: 1.167 ± 0.029
0.672MetPhe: 0.672 ± 0.023
1.452MetGly: 1.452 ± 0.035
0.375MetHis: 0.375 ± 0.016
1.18MetIle: 1.18 ± 0.031
1.384MetLys: 1.384 ± 0.032
2.031MetLeu: 2.031 ± 0.044
0.483MetMet: 0.483 ± 0.02
0.881MetAsn: 0.881 ± 0.024
1.33MetPro: 1.33 ± 0.031
0.659MetGln: 0.659 ± 0.023
1.331MetArg: 1.331 ± 0.032
1.365MetSer: 1.365 ± 0.031
1.314MetThr: 1.314 ± 0.032
1.356MetVal: 1.356 ± 0.029
0.209MetTrp: 0.209 ± 0.012
0.238MetTyr: 0.238 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
3.43AsnAla: 3.43 ± 0.056
0.287AsnCys: 0.287 ± 0.015
1.616AsnAsp: 1.616 ± 0.034
1.554AsnGlu: 1.554 ± 0.034
1.339AsnPhe: 1.339 ± 0.031
3.608AsnGly: 3.608 ± 0.084
0.736AsnHis: 0.736 ± 0.023
1.382AsnIle: 1.382 ± 0.037
0.934AsnLys: 0.934 ± 0.03
3.471AsnLeu: 3.471 ± 0.053
0.461AsnMet: 0.461 ± 0.019
1.281AsnAsn: 1.281 ± 0.045
2.417AsnPro: 2.417 ± 0.047
1.067AsnGln: 1.067 ± 0.027
1.92AsnArg: 1.92 ± 0.038
2.22AsnSer: 2.22 ± 0.059
2.081AsnThr: 2.081 ± 0.069
2.216AsnVal: 2.216 ± 0.05
0.602AsnTrp: 0.602 ± 0.023
0.89AsnTyr: 0.89 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
6.296ProAla: 6.296 ± 0.08
0.372ProCys: 0.372 ± 0.018
3.474ProAsp: 3.474 ± 0.053
4.036ProGlu: 4.036 ± 0.064
2.046ProPhe: 2.046 ± 0.039
4.761ProGly: 4.761 ± 0.055
1.003ProHis: 1.003 ± 0.032
2.321ProIle: 2.321 ± 0.04
2.357ProLys: 2.357 ± 0.046
4.837ProLeu: 4.837 ± 0.051
1.282ProMet: 1.282 ± 0.032
1.732ProAsn: 1.732 ± 0.04
2.837ProPro: 2.837 ± 0.061
1.597ProGln: 1.597 ± 0.036
2.452ProArg: 2.452 ± 0.043
3.394ProSer: 3.394 ± 0.053
3.063ProThr: 3.063 ± 0.046
4.143ProVal: 4.143 ± 0.061
0.839ProTrp: 0.839 ± 0.023
1.115ProTyr: 1.115 ± 0.028
0.001ProXaa: 0.001 ± 0.001
Gln
3.502GlnAla: 3.502 ± 0.058
0.257GlnCys: 0.257 ± 0.013
1.323GlnAsp: 1.323 ± 0.031
1.558GlnGlu: 1.558 ± 0.04
1.19GlnPhe: 1.19 ± 0.028
2.295GlnGly: 2.295 ± 0.041
0.627GlnHis: 0.627 ± 0.024
1.689GlnIle: 1.689 ± 0.033
1.406GlnLys: 1.406 ± 0.037
3.11GlnLeu: 3.11 ± 0.051
0.7GlnMet: 0.7 ± 0.022
1.002GlnAsn: 1.002 ± 0.029
1.852GlnPro: 1.852 ± 0.04
1.247GlnGln: 1.247 ± 0.035
2.162GlnArg: 2.162 ± 0.043
2.048GlnSer: 2.048 ± 0.043
1.889GlnThr: 1.889 ± 0.043
2.213GlnVal: 2.213 ± 0.038
0.55GlnTrp: 0.55 ± 0.019
0.597GlnTyr: 0.597 ± 0.023
0.0GlnXaa: 0.0 ± 0.0
Arg
5.363ArgAla: 5.363 ± 0.065
0.553ArgCys: 0.553 ± 0.02
3.279ArgAsp: 3.279 ± 0.051
4.031ArgGlu: 4.031 ± 0.076
2.966ArgPhe: 2.966 ± 0.046
3.973ArgGly: 3.973 ± 0.057
1.431ArgHis: 1.431 ± 0.039
3.47ArgIle: 3.47 ± 0.05
2.652ArgLys: 2.652 ± 0.054
6.585ArgLeu: 6.585 ± 0.076
1.581ArgMet: 1.581 ± 0.035
1.903ArgAsn: 1.903 ± 0.033
2.922ArgPro: 2.922 ± 0.05
2.175ArgGln: 2.175 ± 0.048
4.248ArgArg: 4.248 ± 0.072
3.747ArgSer: 3.747 ± 0.05
3.088ArgThr: 3.088 ± 0.047
4.064ArgVal: 4.064 ± 0.061
1.113ArgTrp: 1.113 ± 0.027
1.576ArgTyr: 1.576 ± 0.032
0.002ArgXaa: 0.002 ± 0.001
Ser
6.74SerAla: 6.74 ± 0.084
0.625SerCys: 0.625 ± 0.023
3.404SerAsp: 3.404 ± 0.048
3.242SerGlu: 3.242 ± 0.049
2.697SerPhe: 2.697 ± 0.044
7.508SerGly: 7.508 ± 0.161
1.25SerHis: 1.25 ± 0.028
3.325SerIle: 3.325 ± 0.049
2.571SerLys: 2.571 ± 0.046
6.614SerLeu: 6.614 ± 0.079
1.355SerMet: 1.355 ± 0.03
2.314SerAsn: 2.314 ± 0.056
3.748SerPro: 3.748 ± 0.058
1.88SerGln: 1.88 ± 0.035
3.744SerArg: 3.744 ± 0.054
4.721SerSer: 4.721 ± 0.074
4.104SerThr: 4.104 ± 0.086
4.218SerVal: 4.218 ± 0.057
1.017SerTrp: 1.017 ± 0.027
1.534SerTyr: 1.534 ± 0.037
0.001SerXaa: 0.001 ± 0.001
Thr
6.517ThrAla: 6.517 ± 0.102
0.457ThrCys: 0.457 ± 0.017
3.009ThrAsp: 3.009 ± 0.048
2.669ThrGlu: 2.669 ± 0.044
2.668ThrPhe: 2.668 ± 0.056
6.353ThrGly: 6.353 ± 0.124
1.084ThrHis: 1.084 ± 0.028
3.204ThrIle: 3.204 ± 0.061
2.162ThrLys: 2.162 ± 0.039
7.228ThrLeu: 7.228 ± 0.119
1.066ThrMet: 1.066 ± 0.028
2.202ThrAsn: 2.202 ± 0.066
3.724ThrPro: 3.724 ± 0.059
1.594ThrGln: 1.594 ± 0.031
3.215ThrArg: 3.215 ± 0.049
3.935ThrSer: 3.935 ± 0.084
4.186ThrThr: 4.186 ± 0.108
4.606ThrVal: 4.606 ± 0.088
0.959ThrTrp: 0.959 ± 0.031
1.612ThrTyr: 1.612 ± 0.042
0.001ThrXaa: 0.001 ± 0.001
Val
7.091ValAla: 7.091 ± 0.083
0.72ValCys: 0.72 ± 0.022
3.464ValAsp: 3.464 ± 0.055
3.946ValGlu: 3.946 ± 0.06
2.719ValPhe: 2.719 ± 0.045
4.848ValGly: 4.848 ± 0.06
1.253ValHis: 1.253 ± 0.029
3.806ValIle: 3.806 ± 0.056
2.922ValLys: 2.922 ± 0.054
6.533ValLeu: 6.533 ± 0.077
1.536ValMet: 1.536 ± 0.038
2.353ValAsn: 2.353 ± 0.057
3.534ValPro: 3.534 ± 0.055
1.905ValGln: 1.905 ± 0.037
4.091ValArg: 4.091 ± 0.055
4.734ValSer: 4.734 ± 0.065
4.672ValThr: 4.672 ± 0.089
4.935ValVal: 4.935 ± 0.064
0.997ValTrp: 0.997 ± 0.025
1.356ValTyr: 1.356 ± 0.034
0.001ValXaa: 0.001 ± 0.001
Trp
1.205TrpAla: 1.205 ± 0.032
0.168TrpCys: 0.168 ± 0.01
0.798TrpAsp: 0.798 ± 0.023
0.741TrpGlu: 0.741 ± 0.026
0.681TrpPhe: 0.681 ± 0.022
0.966TrpGly: 0.966 ± 0.027
0.365TrpHis: 0.365 ± 0.014
0.899TrpIle: 0.899 ± 0.024
0.948TrpLys: 0.948 ± 0.028
1.832TrpLeu: 1.832 ± 0.041
0.466TrpMet: 0.466 ± 0.019
0.671TrpAsn: 0.671 ± 0.023
0.663TrpPro: 0.663 ± 0.022
0.715TrpGln: 0.715 ± 0.023
1.1TrpArg: 1.1 ± 0.03
1.138TrpSer: 1.138 ± 0.027
0.992TrpThr: 0.992 ± 0.03
0.905TrpVal: 0.905 ± 0.025
0.332TrpTrp: 0.332 ± 0.016
0.309TrpTyr: 0.309 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.028TyrAla: 2.028 ± 0.041
0.241TyrCys: 0.241 ± 0.014
1.39TyrAsp: 1.39 ± 0.031
1.284TyrGlu: 1.284 ± 0.028
1.092TyrPhe: 1.092 ± 0.028
1.952TyrGly: 1.952 ± 0.039
0.545TyrHis: 0.545 ± 0.019
0.764TyrIle: 0.764 ± 0.023
0.692TyrLys: 0.692 ± 0.023
2.267TyrLeu: 2.267 ± 0.043
0.3TyrMet: 0.3 ± 0.013
0.727TyrAsn: 0.727 ± 0.023
1.103TyrPro: 1.103 ± 0.026
0.977TyrGln: 0.977 ± 0.027
1.726TyrArg: 1.726 ± 0.039
1.446TyrSer: 1.446 ± 0.036
1.407TyrThr: 1.407 ± 0.039
1.411TyrVal: 1.411 ± 0.035
0.387TyrTrp: 0.387 ± 0.016
0.657TyrTyr: 0.657 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.001
0.0XaaLeu: 0.0 ± 0.0
0.001XaaMet: 0.001 ± 0.001
0.0XaaAsn: 0.0 ± 0.0
0.002XaaPro: 0.002 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.001
0.001XaaSer: 0.001 ± 0.001
0.001XaaThr: 0.001 ± 0.001
0.001XaaVal: 0.001 ± 0.001
0.001XaaTrp: 0.001 ± 0.001
0.0XaaTyr: 0.0 ± 0.0
0.018XaaXaa: 0.018 ± 0.006
Statistics based on 5352 proteins (1447052 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski