Amino acid dipepetide frequency for Nitrosospira sp. Nsp14

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.957AlaAla: 11.957 ± 0.145
0.981AlaCys: 0.981 ± 0.037
5.118AlaAsp: 5.118 ± 0.078
6.286AlaGlu: 6.286 ± 0.104
3.746AlaPhe: 3.746 ± 0.073
8.663AlaGly: 8.663 ± 0.128
2.302AlaHis: 2.302 ± 0.052
5.628AlaIle: 5.628 ± 0.094
3.819AlaLys: 3.819 ± 0.077
10.884AlaLeu: 10.884 ± 0.129
2.728AlaMet: 2.728 ± 0.056
3.079AlaAsn: 3.079 ± 0.067
4.131AlaPro: 4.131 ± 0.076
3.98AlaGln: 3.98 ± 0.074
6.813AlaArg: 6.813 ± 0.092
5.917AlaSer: 5.917 ± 0.086
5.072AlaThr: 5.072 ± 0.066
7.33AlaVal: 7.33 ± 0.1
1.392AlaTrp: 1.392 ± 0.041
2.564AlaTyr: 2.564 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.979CysAla: 0.979 ± 0.033
0.161CysCys: 0.161 ± 0.014
0.544CysAsp: 0.544 ± 0.03
0.566CysGlu: 0.566 ± 0.027
0.347CysPhe: 0.347 ± 0.018
0.948CysGly: 0.948 ± 0.035
0.365CysHis: 0.365 ± 0.025
0.493CysIle: 0.493 ± 0.024
0.306CysLys: 0.306 ± 0.02
0.893CysLeu: 0.893 ± 0.033
0.192CysMet: 0.192 ± 0.015
0.304CysAsn: 0.304 ± 0.022
0.518CysPro: 0.518 ± 0.034
0.267CysGln: 0.267 ± 0.018
0.686CysArg: 0.686 ± 0.031
0.596CysSer: 0.596 ± 0.027
0.458CysThr: 0.458 ± 0.024
0.614CysVal: 0.614 ± 0.023
0.136CysTrp: 0.136 ± 0.011
0.292CysTyr: 0.292 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
5.324AspAla: 5.324 ± 0.079
0.473AspCys: 0.473 ± 0.022
2.691AspAsp: 2.691 ± 0.082
3.544AspGlu: 3.544 ± 0.07
2.288AspPhe: 2.288 ± 0.057
4.128AspGly: 4.128 ± 0.087
1.212AspHis: 1.212 ± 0.038
3.157AspIle: 3.157 ± 0.063
2.184AspLys: 2.184 ± 0.06
5.461AspLeu: 5.461 ± 0.095
1.228AspMet: 1.228 ± 0.04
1.732AspAsn: 1.732 ± 0.045
2.81AspPro: 2.81 ± 0.054
1.739AspGln: 1.739 ± 0.041
3.215AspArg: 3.215 ± 0.072
2.916AspSer: 2.916 ± 0.057
2.583AspThr: 2.583 ± 0.055
3.736AspVal: 3.736 ± 0.075
0.81AspTrp: 0.81 ± 0.03
1.737AspTyr: 1.737 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
6.205GluAla: 6.205 ± 0.106
0.519GluCys: 0.519 ± 0.024
2.516GluAsp: 2.516 ± 0.056
3.511GluGlu: 3.511 ± 0.086
2.207GluPhe: 2.207 ± 0.043
3.96GluGly: 3.96 ± 0.069
1.466GluHis: 1.466 ± 0.042
3.976GluIle: 3.976 ± 0.069
3.161GluLys: 3.161 ± 0.068
6.193GluLeu: 6.193 ± 0.098
1.619GluMet: 1.619 ± 0.042
2.161GluAsn: 2.161 ± 0.048
2.474GluPro: 2.474 ± 0.05
2.783GluGln: 2.783 ± 0.056
4.353GluArg: 4.353 ± 0.078
3.187GluSer: 3.187 ± 0.059
3.17GluThr: 3.17 ± 0.064
3.965GluVal: 3.965 ± 0.073
0.81GluTrp: 0.81 ± 0.033
1.527GluTyr: 1.527 ± 0.044
0.0GluXaa: 0.0 ± 0.0
Phe
3.656PheAla: 3.656 ± 0.061
0.444PheCys: 0.444 ± 0.025
2.427PheAsp: 2.427 ± 0.058
2.161PheGlu: 2.161 ± 0.051
1.669PhePhe: 1.669 ± 0.051
3.164PheGly: 3.164 ± 0.063
0.93PheHis: 0.93 ± 0.03
2.257PheIle: 2.257 ± 0.049
1.248PheLys: 1.248 ± 0.035
3.519PheLeu: 3.519 ± 0.068
0.847PheMet: 0.847 ± 0.028
1.473PheAsn: 1.473 ± 0.046
1.798PhePro: 1.798 ± 0.047
1.195PheGln: 1.195 ± 0.039
2.23PheArg: 2.23 ± 0.052
2.81PheSer: 2.81 ± 0.056
2.117PheThr: 2.117 ± 0.045
2.658PheVal: 2.658 ± 0.065
0.512PheTrp: 0.512 ± 0.027
1.23PheTyr: 1.23 ± 0.04
0.0PheXaa: 0.0 ± 0.0
Gly
7.162GlyAla: 7.162 ± 0.11
0.866GlyCys: 0.866 ± 0.034
4.097GlyAsp: 4.097 ± 0.091
4.738GlyGlu: 4.738 ± 0.08
3.252GlyPhe: 3.252 ± 0.064
6.683GlyGly: 6.683 ± 0.127
1.858GlyHis: 1.858 ± 0.054
5.047GlyIle: 5.047 ± 0.07
3.953GlyLys: 3.953 ± 0.078
7.729GlyLeu: 7.729 ± 0.094
2.541GlyMet: 2.541 ± 0.072
2.953GlyAsn: 2.953 ± 0.094
2.783GlyPro: 2.783 ± 0.063
2.888GlyGln: 2.888 ± 0.064
4.853GlyArg: 4.853 ± 0.073
4.826GlySer: 4.826 ± 0.094
4.409GlyThr: 4.409 ± 0.085
5.666GlyVal: 5.666 ± 0.073
1.251GlyTrp: 1.251 ± 0.05
2.509GlyTyr: 2.509 ± 0.055
0.0GlyXaa: 0.0 ± 0.0
His
2.507HisAla: 2.507 ± 0.052
0.297HisCys: 0.297 ± 0.019
1.279HisAsp: 1.279 ± 0.037
1.435HisGlu: 1.435 ± 0.038
0.983HisPhe: 0.983 ± 0.032
2.06HisGly: 2.06 ± 0.05
0.704HisHis: 0.704 ± 0.032
1.23HisIle: 1.23 ± 0.041
0.759HisLys: 0.759 ± 0.03
2.314HisLeu: 2.314 ± 0.059
0.522HisMet: 0.522 ± 0.025
0.719HisAsn: 0.719 ± 0.031
1.45HisPro: 1.45 ± 0.041
0.862HisGln: 0.862 ± 0.03
1.483HisArg: 1.483 ± 0.043
1.28HisSer: 1.28 ± 0.032
1.076HisThr: 1.076 ± 0.033
1.586HisVal: 1.586 ± 0.045
0.335HisTrp: 0.335 ± 0.021
0.769HisTyr: 0.769 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.291IleAla: 6.291 ± 0.092
0.55IleCys: 0.55 ± 0.024
3.333IleAsp: 3.333 ± 0.064
3.7IleGlu: 3.7 ± 0.06
1.994IlePhe: 1.994 ± 0.051
4.533IleGly: 4.533 ± 0.061
1.214IleHis: 1.214 ± 0.037
2.802IleIle: 2.802 ± 0.061
2.273IleLys: 2.273 ± 0.052
5.06IleLeu: 5.06 ± 0.087
1.117IleMet: 1.117 ± 0.036
2.202IleAsn: 2.202 ± 0.057
2.78IlePro: 2.78 ± 0.054
1.664IleGln: 1.664 ± 0.046
3.446IleArg: 3.446 ± 0.064
3.667IleSer: 3.667 ± 0.07
3.227IleThr: 3.227 ± 0.064
3.874IleVal: 3.874 ± 0.068
0.663IleTrp: 0.663 ± 0.033
1.445IleTyr: 1.445 ± 0.036
0.0IleXaa: 0.0 ± 0.0
Lys
3.82LysAla: 3.82 ± 0.074
0.283LysCys: 0.283 ± 0.019
2.037LysAsp: 2.037 ± 0.054
2.347LysGlu: 2.347 ± 0.062
1.29LysPhe: 1.29 ± 0.04
2.688LysGly: 2.688 ± 0.066
0.891LysHis: 0.891 ± 0.035
2.444LysIle: 2.444 ± 0.061
2.061LysLys: 2.061 ± 0.063
4.334LysLeu: 4.334 ± 0.074
1.108LysMet: 1.108 ± 0.036
1.543LysAsn: 1.543 ± 0.04
2.22LysPro: 2.22 ± 0.045
1.704LysGln: 1.704 ± 0.044
2.72LysArg: 2.72 ± 0.057
2.386LysSer: 2.386 ± 0.055
2.436LysThr: 2.436 ± 0.056
2.698LysVal: 2.698 ± 0.061
0.502LysTrp: 0.502 ± 0.024
1.005LysTyr: 1.005 ± 0.032
0.0LysXaa: 0.0 ± 0.0
Leu
11.428LeuAla: 11.428 ± 0.124
1.014LeuCys: 1.014 ± 0.036
5.684LeuAsp: 5.684 ± 0.092
5.994LeuGlu: 5.994 ± 0.096
3.728LeuPhe: 3.728 ± 0.071
8.048LeuGly: 8.048 ± 0.115
2.343LeuHis: 2.343 ± 0.052
5.529LeuIle: 5.529 ± 0.076
4.387LeuLys: 4.387 ± 0.076
10.96LeuLeu: 10.96 ± 0.162
2.395LeuMet: 2.395 ± 0.054
3.581LeuAsn: 3.581 ± 0.063
5.737LeuPro: 5.737 ± 0.086
3.589LeuGln: 3.589 ± 0.065
7.041LeuArg: 7.041 ± 0.091
6.69LeuSer: 6.69 ± 0.098
5.445LeuThr: 5.445 ± 0.08
6.944LeuVal: 6.944 ± 0.096
1.224LeuTrp: 1.224 ± 0.045
2.442LeuTyr: 2.442 ± 0.061
0.0LeuXaa: 0.0 ± 0.0
Met
2.443MetAla: 2.443 ± 0.051
0.186MetCys: 0.186 ± 0.014
1.182MetAsp: 1.182 ± 0.038
1.381MetGlu: 1.381 ± 0.04
0.729MetPhe: 0.729 ± 0.027
1.822MetGly: 1.822 ± 0.05
0.609MetHis: 0.609 ± 0.03
1.25MetIle: 1.25 ± 0.028
1.257MetLys: 1.257 ± 0.034
2.79MetLeu: 2.79 ± 0.053
0.603MetMet: 0.603 ± 0.025
0.979MetAsn: 0.979 ± 0.033
1.353MetPro: 1.353 ± 0.038
1.029MetGln: 1.029 ± 0.033
1.773MetArg: 1.773 ± 0.047
1.613MetSer: 1.613 ± 0.039
1.4MetThr: 1.4 ± 0.039
1.7MetVal: 1.7 ± 0.042
0.204MetTrp: 0.204 ± 0.013
0.377MetTyr: 0.377 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
3.311AsnAla: 3.311 ± 0.073
0.335AsnCys: 0.335 ± 0.019
2.01AsnAsp: 2.01 ± 0.08
1.848AsnGlu: 1.848 ± 0.049
1.376AsnPhe: 1.376 ± 0.039
2.939AsnGly: 2.939 ± 0.087
0.753AsnHis: 0.753 ± 0.027
2.019AsnIle: 2.019 ± 0.051
1.169AsnLys: 1.169 ± 0.036
3.655AsnLeu: 3.655 ± 0.071
0.737AsnMet: 0.737 ± 0.031
1.332AsnAsn: 1.332 ± 0.047
2.1AsnPro: 2.1 ± 0.052
1.145AsnGln: 1.145 ± 0.031
2.255AsnArg: 2.255 ± 0.048
2.021AsnSer: 2.021 ± 0.052
1.724AsnThr: 1.724 ± 0.047
2.277AsnVal: 2.277 ± 0.062
0.513AsnTrp: 0.513 ± 0.024
0.982AsnTyr: 0.982 ± 0.03
0.0AsnXaa: 0.0 ± 0.0
Pro
4.967ProAla: 4.967 ± 0.075
0.435ProCys: 0.435 ± 0.034
3.201ProAsp: 3.201 ± 0.066
3.571ProGlu: 3.571 ± 0.06
1.861ProPhe: 1.861 ± 0.048
4.365ProGly: 4.365 ± 0.082
1.159ProHis: 1.159 ± 0.032
2.281ProIle: 2.281 ± 0.05
1.629ProLys: 1.629 ± 0.046
4.807ProLeu: 4.807 ± 0.075
1.071ProMet: 1.071 ± 0.033
1.616ProAsn: 1.616 ± 0.043
2.616ProPro: 2.616 ± 0.071
1.809ProGln: 1.809 ± 0.046
2.513ProArg: 2.513 ± 0.053
2.801ProSer: 2.801 ± 0.059
2.143ProThr: 2.143 ± 0.045
3.982ProVal: 3.982 ± 0.061
0.626ProTrp: 0.626 ± 0.03
1.358ProTyr: 1.358 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
4.016GlnAla: 4.016 ± 0.076
0.287GlnCys: 0.287 ± 0.017
1.586GlnAsp: 1.586 ± 0.045
2.01GlnGlu: 2.01 ± 0.057
1.348GlnPhe: 1.348 ± 0.035
2.623GlnGly: 2.623 ± 0.053
0.951GlnHis: 0.951 ± 0.032
2.015GlnIle: 2.015 ± 0.051
1.549GlnLys: 1.549 ± 0.042
3.999GlnLeu: 3.999 ± 0.07
0.947GlnMet: 0.947 ± 0.028
1.222GlnAsn: 1.222 ± 0.039
1.86GlnPro: 1.86 ± 0.041
1.669GlnGln: 1.669 ± 0.052
2.642GlnArg: 2.642 ± 0.066
2.026GlnSer: 2.026 ± 0.046
1.817GlnThr: 1.817 ± 0.045
2.829GlnVal: 2.829 ± 0.059
0.488GlnTrp: 0.488 ± 0.021
0.942GlnTyr: 0.942 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
5.943ArgAla: 5.943 ± 0.082
0.616ArgCys: 0.616 ± 0.027
3.534ArgAsp: 3.534 ± 0.061
4.269ArgGlu: 4.269 ± 0.077
2.803ArgPhe: 2.803 ± 0.052
4.365ArgGly: 4.365 ± 0.074
1.749ArgHis: 1.749 ± 0.043
3.879ArgIle: 3.879 ± 0.067
2.723ArgLys: 2.723 ± 0.055
7.166ArgLeu: 7.166 ± 0.089
1.756ArgMet: 1.756 ± 0.041
2.421ArgAsn: 2.421 ± 0.057
2.827ArgPro: 2.827 ± 0.055
2.762ArgGln: 2.762 ± 0.058
4.498ArgArg: 4.498 ± 0.085
3.489ArgSer: 3.489 ± 0.069
2.93ArgThr: 2.93 ± 0.062
4.381ArgVal: 4.381 ± 0.067
0.912ArgTrp: 0.912 ± 0.033
2.154ArgTyr: 2.154 ± 0.056
0.0ArgXaa: 0.0 ± 0.0
Ser
6.042SerAla: 6.042 ± 0.093
0.629SerCys: 0.629 ± 0.033
3.043SerAsp: 3.043 ± 0.065
3.071SerGlu: 3.071 ± 0.057
2.394SerPhe: 2.394 ± 0.053
5.849SerGly: 5.849 ± 0.093
1.463SerHis: 1.463 ± 0.038
3.221SerIle: 3.221 ± 0.062
2.151SerLys: 2.151 ± 0.047
6.202SerLeu: 6.202 ± 0.093
1.496SerMet: 1.496 ± 0.035
1.918SerAsn: 1.918 ± 0.046
3.025SerPro: 3.025 ± 0.055
2.026SerGln: 2.026 ± 0.051
3.963SerArg: 3.963 ± 0.059
3.908SerSer: 3.908 ± 0.077
2.918SerThr: 2.918 ± 0.062
4.149SerVal: 4.149 ± 0.079
0.819SerTrp: 0.819 ± 0.029
1.679SerTyr: 1.679 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
5.261ThrAla: 5.261 ± 0.078
0.452ThrCys: 0.452 ± 0.023
2.768ThrAsp: 2.768 ± 0.061
2.807ThrGlu: 2.807 ± 0.057
1.88ThrPhe: 1.88 ± 0.042
5.009ThrGly: 5.009 ± 0.082
1.232ThrHis: 1.232 ± 0.031
2.54ThrIle: 2.54 ± 0.054
1.551ThrLys: 1.551 ± 0.041
6.059ThrLeu: 6.059 ± 0.079
1.096ThrMet: 1.096 ± 0.033
1.469ThrAsn: 1.469 ± 0.041
2.968ThrPro: 2.968 ± 0.056
1.767ThrGln: 1.767 ± 0.043
3.166ThrArg: 3.166 ± 0.059
2.993ThrSer: 2.993 ± 0.059
2.618ThrThr: 2.618 ± 0.056
3.933ThrVal: 3.933 ± 0.081
0.597ThrTrp: 0.597 ± 0.025
1.32ThrTyr: 1.32 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
7.327ValAla: 7.327 ± 0.102
0.691ValCys: 0.691 ± 0.03
3.787ValAsp: 3.787 ± 0.068
4.387ValGlu: 4.387 ± 0.07
2.682ValPhe: 2.682 ± 0.051
5.092ValGly: 5.092 ± 0.093
1.395ValHis: 1.395 ± 0.039
4.125ValIle: 4.125 ± 0.063
2.884ValLys: 2.884 ± 0.06
7.446ValLeu: 7.446 ± 0.09
1.821ValMet: 1.821 ± 0.043
2.475ValAsn: 2.475 ± 0.058
3.478ValPro: 3.478 ± 0.062
2.17ValGln: 2.17 ± 0.051
4.411ValArg: 4.411 ± 0.074
4.413ValSer: 4.413 ± 0.065
4.028ValThr: 4.028 ± 0.058
5.628ValVal: 5.628 ± 0.095
0.804ValTrp: 0.804 ± 0.03
1.655ValTyr: 1.655 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
1.062TrpAla: 1.062 ± 0.035
0.133TrpCys: 0.133 ± 0.012
0.648TrpAsp: 0.648 ± 0.03
0.642TrpGlu: 0.642 ± 0.026
0.52TrpPhe: 0.52 ± 0.022
0.899TrpGly: 0.899 ± 0.035
0.372TrpHis: 0.372 ± 0.019
0.743TrpIle: 0.743 ± 0.037
0.571TrpLys: 0.571 ± 0.025
1.682TrpLeu: 1.682 ± 0.054
0.374TrpMet: 0.374 ± 0.023
0.489TrpAsn: 0.489 ± 0.025
0.529TrpPro: 0.529 ± 0.025
0.674TrpGln: 0.674 ± 0.036
1.052TrpArg: 1.052 ± 0.036
0.76TrpSer: 0.76 ± 0.032
0.567TrpThr: 0.567 ± 0.027
0.93TrpVal: 0.93 ± 0.03
0.209TrpTrp: 0.209 ± 0.014
0.376TrpTyr: 0.376 ± 0.025
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.69TyrAla: 2.69 ± 0.059
0.315TyrCys: 0.315 ± 0.018
1.482TyrAsp: 1.482 ± 0.038
1.496TyrGlu: 1.496 ± 0.045
1.248TyrPhe: 1.248 ± 0.039
2.226TyrGly: 2.226 ± 0.052
0.683TyrHis: 0.683 ± 0.028
1.197TyrIle: 1.197 ± 0.037
0.881TyrLys: 0.881 ± 0.035
3.05TyrLeu: 3.05 ± 0.058
0.482TyrMet: 0.482 ± 0.022
0.835TyrAsn: 0.835 ± 0.031
1.355TyrPro: 1.355 ± 0.041
1.104TyrGln: 1.104 ± 0.042
2.062TyrArg: 2.062 ± 0.046
1.666TyrSer: 1.666 ± 0.042
1.349TyrThr: 1.349 ± 0.045
1.817TyrVal: 1.817 ± 0.045
0.426TyrTrp: 0.426 ± 0.023
0.85TyrTyr: 0.85 ± 0.033
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3081 proteins (940644 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski