Amino acid dipepetide frequency for Candidatus Nitrosocosmicus arcticus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.314AlaAla: 3.314 ± 0.093
0.562AlaCys: 0.562 ± 0.036
2.453AlaAsp: 2.453 ± 0.062
2.806AlaGlu: 2.806 ± 0.075
2.255AlaPhe: 2.255 ± 0.063
3.464AlaGly: 3.464 ± 0.081
0.876AlaHis: 0.876 ± 0.039
4.988AlaIle: 4.988 ± 0.113
3.544AlaLys: 3.544 ± 0.078
4.95AlaLeu: 4.95 ± 0.097
1.135AlaMet: 1.135 ± 0.039
2.65AlaAsn: 2.65 ± 0.076
1.463AlaPro: 1.463 ± 0.06
1.662AlaGln: 1.662 ± 0.061
1.889AlaArg: 1.889 ± 0.051
3.728AlaSer: 3.728 ± 0.084
2.74AlaThr: 2.74 ± 0.072
3.073AlaVal: 3.073 ± 0.062
0.416AlaTrp: 0.416 ± 0.028
1.645AlaTyr: 1.645 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
0.485CysAla: 0.485 ± 0.028
0.188CysCys: 0.188 ± 0.018
0.604CysAsp: 0.604 ± 0.034
0.582CysGlu: 0.582 ± 0.035
0.417CysPhe: 0.417 ± 0.024
0.858CysGly: 0.858 ± 0.044
0.238CysHis: 0.238 ± 0.02
0.975CysIle: 0.975 ± 0.038
0.799CysLys: 0.799 ± 0.036
0.839CysLeu: 0.839 ± 0.032
0.259CysMet: 0.259 ± 0.02
0.725CysAsn: 0.725 ± 0.04
0.573CysPro: 0.573 ± 0.035
0.232CysGln: 0.232 ± 0.018
0.36CysArg: 0.36 ± 0.02
0.815CysSer: 0.815 ± 0.04
0.519CysThr: 0.519 ± 0.033
0.544CysVal: 0.544 ± 0.029
0.094CysTrp: 0.094 ± 0.012
0.387CysTyr: 0.387 ± 0.029
0.0CysXaa: 0.0 ± 0.0
Asp
2.583AspAla: 2.583 ± 0.065
0.606AspCys: 0.606 ± 0.036
3.326AspAsp: 3.326 ± 0.095
3.854AspGlu: 3.854 ± 0.08
2.704AspPhe: 2.704 ± 0.071
3.384AspGly: 3.384 ± 0.089
1.029AspHis: 1.029 ± 0.037
5.321AspIle: 5.321 ± 0.1
4.488AspLys: 4.488 ± 0.096
5.388AspLeu: 5.388 ± 0.086
1.216AspMet: 1.216 ± 0.04
3.751AspAsn: 3.751 ± 0.088
2.284AspPro: 2.284 ± 0.064
1.614AspGln: 1.614 ± 0.049
2.103AspArg: 2.103 ± 0.055
4.603AspSer: 4.603 ± 0.099
2.801AspThr: 2.801 ± 0.079
3.356AspVal: 3.356 ± 0.066
0.534AspTrp: 0.534 ± 0.029
2.311AspTyr: 2.311 ± 0.059
0.0AspXaa: 0.0 ± 0.0
Glu
2.894GluAla: 2.894 ± 0.077
0.577GluCys: 0.577 ± 0.029
3.115GluAsp: 3.115 ± 0.067
4.078GluGlu: 4.078 ± 0.091
2.813GluPhe: 2.813 ± 0.073
3.23GluGly: 3.23 ± 0.088
1.03GluHis: 1.03 ± 0.037
6.711GluIle: 6.711 ± 0.109
5.922GluLys: 5.922 ± 0.103
5.537GluLeu: 5.537 ± 0.105
1.627GluMet: 1.627 ± 0.058
4.455GluAsn: 4.455 ± 0.086
1.75GluPro: 1.75 ± 0.062
1.911GluGln: 1.911 ± 0.058
2.447GluArg: 2.447 ± 0.065
4.614GluSer: 4.614 ± 0.082
3.061GluThr: 3.061 ± 0.079
3.398GluVal: 3.398 ± 0.08
0.594GluTrp: 0.594 ± 0.029
2.382GluTyr: 2.382 ± 0.057
0.0GluXaa: 0.0 ± 0.0
Phe
2.411PheAla: 2.411 ± 0.06
0.555PheCys: 0.555 ± 0.03
3.136PheAsp: 3.136 ± 0.071
2.856PheGlu: 2.856 ± 0.067
2.163PhePhe: 2.163 ± 0.072
2.931PheGly: 2.931 ± 0.076
0.878PheHis: 0.878 ± 0.04
3.917PheIle: 3.917 ± 0.083
2.848PheLys: 2.848 ± 0.061
4.282PheLeu: 4.282 ± 0.096
1.024PheMet: 1.024 ± 0.039
2.646PheAsn: 2.646 ± 0.057
1.584PhePro: 1.584 ± 0.047
1.288PheGln: 1.288 ± 0.043
1.579PheArg: 1.579 ± 0.046
4.01PheSer: 4.01 ± 0.083
2.387PheThr: 2.387 ± 0.057
2.949PheVal: 2.949 ± 0.066
0.452PheTrp: 0.452 ± 0.027
1.814PheTyr: 1.814 ± 0.061
0.0PheXaa: 0.0 ± 0.0
Gly
3.175GlyAla: 3.175 ± 0.076
0.615GlyCys: 0.615 ± 0.04
3.067GlyAsp: 3.067 ± 0.077
3.049GlyGlu: 3.049 ± 0.064
3.037GlyPhe: 3.037 ± 0.085
4.205GlyGly: 4.205 ± 0.119
1.159GlyHis: 1.159 ± 0.045
6.393GlyIle: 6.393 ± 0.104
4.636GlyLys: 4.636 ± 0.09
5.465GlyLeu: 5.465 ± 0.095
1.605GlyMet: 1.605 ± 0.049
3.864GlyAsn: 3.864 ± 0.102
1.826GlyPro: 1.826 ± 0.053
1.805GlyGln: 1.805 ± 0.064
2.213GlyArg: 2.213 ± 0.065
4.638GlySer: 4.638 ± 0.113
3.346GlyThr: 3.346 ± 0.082
3.483GlyVal: 3.483 ± 0.077
0.574GlyTrp: 0.574 ± 0.03
2.378GlyTyr: 2.378 ± 0.066
0.0GlyXaa: 0.0 ± 0.0
His
0.849HisAla: 0.849 ± 0.033
0.205HisCys: 0.205 ± 0.017
0.998HisAsp: 0.998 ± 0.039
1.134HisGlu: 1.134 ± 0.045
0.842HisPhe: 0.842 ± 0.037
1.149HisGly: 1.149 ± 0.039
0.465HisHis: 0.465 ± 0.029
1.62HisIle: 1.62 ± 0.051
1.114HisLys: 1.114 ± 0.04
1.756HisLeu: 1.756 ± 0.054
0.417HisMet: 0.417 ± 0.025
1.071HisAsn: 1.071 ± 0.042
0.926HisPro: 0.926 ± 0.039
0.529HisGln: 0.529 ± 0.028
0.683HisArg: 0.683 ± 0.035
1.367HisSer: 1.367 ± 0.045
0.966HisThr: 0.966 ± 0.042
1.08HisVal: 1.08 ± 0.034
0.188HisTrp: 0.188 ± 0.016
0.7HisTyr: 0.7 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
5.146IleAla: 5.146 ± 0.101
1.041IleCys: 1.041 ± 0.046
5.867IleAsp: 5.867 ± 0.101
6.124IleGlu: 6.124 ± 0.11
4.201IlePhe: 4.201 ± 0.083
5.756IleGly: 5.756 ± 0.103
1.727IleHis: 1.727 ± 0.054
9.017IleIle: 9.017 ± 0.141
7.057IleLys: 7.057 ± 0.111
9.103IleLeu: 9.103 ± 0.131
2.161IleMet: 2.161 ± 0.056
5.806IleAsn: 5.806 ± 0.106
4.104IlePro: 4.104 ± 0.082
2.845IleGln: 2.845 ± 0.063
3.684IleArg: 3.684 ± 0.083
7.765IleSer: 7.765 ± 0.129
5.225IleThr: 5.225 ± 0.085
5.949IleVal: 5.949 ± 0.098
0.698IleTrp: 0.698 ± 0.032
2.921IleTyr: 2.921 ± 0.059
0.0IleXaa: 0.0 ± 0.0
Lys
3.069LysAla: 3.069 ± 0.066
0.718LysCys: 0.718 ± 0.037
4.38LysAsp: 4.38 ± 0.078
5.768LysGlu: 5.768 ± 0.112
3.027LysPhe: 3.027 ± 0.063
3.885LysGly: 3.885 ± 0.087
1.252LysHis: 1.252 ± 0.041
8.4LysIle: 8.4 ± 0.13
6.721LysLys: 6.721 ± 0.121
6.373LysLeu: 6.373 ± 0.125
2.258LysMet: 2.258 ± 0.059
5.613LysAsn: 5.613 ± 0.112
2.341LysPro: 2.341 ± 0.057
2.115LysGln: 2.115 ± 0.055
2.972LysArg: 2.972 ± 0.065
5.415LysSer: 5.415 ± 0.081
3.911LysThr: 3.911 ± 0.086
4.113LysVal: 4.113 ± 0.081
0.637LysTrp: 0.637 ± 0.033
3.011LysTyr: 3.011 ± 0.074
0.0LysXaa: 0.0 ± 0.0
Leu
4.902LeuAla: 4.902 ± 0.108
0.827LeuCys: 0.827 ± 0.036
5.568LeuAsp: 5.568 ± 0.084
5.87LeuGlu: 5.87 ± 0.095
4.267LeuPhe: 4.267 ± 0.094
5.566LeuGly: 5.566 ± 0.096
1.464LeuHis: 1.464 ± 0.045
8.294LeuIle: 8.294 ± 0.137
6.887LeuLys: 6.887 ± 0.104
8.465LeuLeu: 8.465 ± 0.134
2.148LeuMet: 2.148 ± 0.057
5.453LeuAsn: 5.453 ± 0.089
3.341LeuPro: 3.341 ± 0.066
2.471LeuGln: 2.471 ± 0.07
3.763LeuArg: 3.763 ± 0.085
7.784LeuSer: 7.784 ± 0.129
4.76LeuThr: 4.76 ± 0.092
5.803LeuVal: 5.803 ± 0.096
0.686LeuTrp: 0.686 ± 0.031
2.895LeuTyr: 2.895 ± 0.068
0.0LeuXaa: 0.0 ± 0.0
Met
1.268MetAla: 1.268 ± 0.048
0.197MetCys: 0.197 ± 0.018
1.484MetAsp: 1.484 ± 0.041
1.44MetGlu: 1.44 ± 0.052
0.897MetPhe: 0.897 ± 0.038
1.451MetGly: 1.451 ± 0.055
0.452MetHis: 0.452 ± 0.027
2.342MetIle: 2.342 ± 0.055
2.031MetLys: 2.031 ± 0.053
2.1MetLeu: 2.1 ± 0.059
0.695MetMet: 0.695 ± 0.036
1.678MetAsn: 1.678 ± 0.051
0.912MetPro: 0.912 ± 0.04
0.646MetGln: 0.646 ± 0.031
0.9MetArg: 0.9 ± 0.039
2.046MetSer: 2.046 ± 0.053
1.469MetThr: 1.469 ± 0.05
1.709MetVal: 1.709 ± 0.059
0.163MetTrp: 0.163 ± 0.015
0.737MetTyr: 0.737 ± 0.033
0.0MetXaa: 0.0 ± 0.0
Asn
2.975AsnAla: 2.975 ± 0.078
0.67AsnCys: 0.67 ± 0.039
3.706AsnAsp: 3.706 ± 0.091
4.05AsnGlu: 4.05 ± 0.08
2.815AsnPhe: 2.815 ± 0.063
3.535AsnGly: 3.535 ± 0.092
1.156AsnHis: 1.156 ± 0.044
5.888AsnIle: 5.888 ± 0.101
5.041AsnLys: 5.041 ± 0.09
5.839AsnLeu: 5.839 ± 0.094
1.733AsnMet: 1.733 ± 0.047
5.625AsnAsn: 5.625 ± 0.134
2.809AsnPro: 2.809 ± 0.074
2.474AsnGln: 2.474 ± 0.087
2.315AsnArg: 2.315 ± 0.064
5.683AsnSer: 5.683 ± 0.109
3.588AsnThr: 3.588 ± 0.092
3.473AsnVal: 3.473 ± 0.084
0.541AsnTrp: 0.541 ± 0.029
2.432AsnTyr: 2.432 ± 0.065
0.0AsnXaa: 0.0 ± 0.0
Pro
1.703ProAla: 1.703 ± 0.054
0.334ProCys: 0.334 ± 0.026
2.213ProAsp: 2.213 ± 0.061
2.436ProGlu: 2.436 ± 0.075
1.799ProPhe: 1.799 ± 0.049
2.16ProGly: 2.16 ± 0.076
0.77ProHis: 0.77 ± 0.034
3.304ProIle: 3.304 ± 0.074
2.236ProLys: 2.236 ± 0.052
3.31ProLeu: 3.31 ± 0.069
0.794ProMet: 0.794 ± 0.03
2.077ProAsn: 2.077 ± 0.06
1.359ProPro: 1.359 ± 0.062
1.135ProGln: 1.135 ± 0.046
1.246ProArg: 1.246 ± 0.038
3.027ProSer: 3.027 ± 0.073
2.189ProThr: 2.189 ± 0.084
2.484ProVal: 2.484 ± 0.069
0.329ProTrp: 0.329 ± 0.027
1.336ProTyr: 1.336 ± 0.046
0.0ProXaa: 0.0 ± 0.0
Gln
1.412GlnAla: 1.412 ± 0.05
0.329GlnCys: 0.329 ± 0.023
1.552GlnAsp: 1.552 ± 0.056
1.82GlnGlu: 1.82 ± 0.058
1.3GlnPhe: 1.3 ± 0.043
1.57GlnGly: 1.57 ± 0.058
0.522GlnHis: 0.522 ± 0.026
2.9GlnIle: 2.9 ± 0.067
2.296GlnLys: 2.296 ± 0.07
2.474GlnLeu: 2.474 ± 0.063
0.826GlnMet: 0.826 ± 0.036
2.426GlnAsn: 2.426 ± 0.073
0.939GlnPro: 0.939 ± 0.043
1.14GlnGln: 1.14 ± 0.053
1.149GlnArg: 1.149 ± 0.037
2.61GlnSer: 2.61 ± 0.093
1.736GlnThr: 1.736 ± 0.061
1.75GlnVal: 1.75 ± 0.055
0.245GlnTrp: 0.245 ± 0.02
1.05GlnTyr: 1.05 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
1.645ArgAla: 1.645 ± 0.056
0.426ArgCys: 0.426 ± 0.025
2.001ArgAsp: 2.001 ± 0.058
2.538ArgGlu: 2.538 ± 0.079
1.751ArgPhe: 1.751 ± 0.055
2.101ArgGly: 2.101 ± 0.062
0.661ArgHis: 0.661 ± 0.034
3.992ArgIle: 3.992 ± 0.09
3.209ArgLys: 3.209 ± 0.063
3.561ArgLeu: 3.561 ± 0.085
1.044ArgMet: 1.044 ± 0.038
2.542ArgAsn: 2.542 ± 0.066
1.174ArgPro: 1.174 ± 0.049
1.044ArgGln: 1.044 ± 0.039
1.557ArgArg: 1.557 ± 0.054
2.43ArgSer: 2.43 ± 0.059
1.792ArgThr: 1.792 ± 0.049
2.125ArgVal: 2.125 ± 0.061
0.348ArgTrp: 0.348 ± 0.023
1.4ArgTyr: 1.4 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
3.684SerAla: 3.684 ± 0.084
0.818SerCys: 0.818 ± 0.038
4.413SerAsp: 4.413 ± 0.074
4.614SerGlu: 4.614 ± 0.077
3.748SerPhe: 3.748 ± 0.073
5.006SerGly: 5.006 ± 0.129
1.434SerHis: 1.434 ± 0.048
7.678SerIle: 7.678 ± 0.112
6.298SerLys: 6.298 ± 0.099
7.219SerLeu: 7.219 ± 0.108
2.007SerMet: 2.007 ± 0.054
5.994SerAsn: 5.994 ± 0.118
2.732SerPro: 2.732 ± 0.068
2.553SerGln: 2.553 ± 0.081
2.789SerArg: 2.789 ± 0.07
7.116SerSer: 7.116 ± 0.15
4.626SerThr: 4.626 ± 0.097
4.361SerVal: 4.361 ± 0.084
0.597SerTrp: 0.597 ± 0.03
2.613SerTyr: 2.613 ± 0.068
0.0SerXaa: 0.0 ± 0.0
Thr
2.87ThrAla: 2.87 ± 0.07
0.553ThrCys: 0.553 ± 0.034
2.91ThrAsp: 2.91 ± 0.082
3.021ThrGlu: 3.021 ± 0.064
2.553ThrPhe: 2.553 ± 0.062
3.826ThrGly: 3.826 ± 0.086
0.974ThrHis: 0.974 ± 0.038
5.237ThrIle: 5.237 ± 0.102
3.712ThrLys: 3.712 ± 0.076
4.847ThrLeu: 4.847 ± 0.082
1.315ThrMet: 1.315 ± 0.045
3.558ThrAsn: 3.558 ± 0.087
2.236ThrPro: 2.236 ± 0.086
1.593ThrGln: 1.593 ± 0.055
1.853ThrArg: 1.853 ± 0.058
4.363ThrSer: 4.363 ± 0.091
3.519ThrThr: 3.519 ± 0.152
3.332ThrVal: 3.332 ± 0.078
0.431ThrTrp: 0.431 ± 0.024
1.721ThrTyr: 1.721 ± 0.058
0.0ThrXaa: 0.0 ± 0.0
Val
3.183ValAla: 3.183 ± 0.072
0.633ValCys: 0.633 ± 0.032
3.597ValAsp: 3.597 ± 0.069
3.562ValGlu: 3.562 ± 0.078
2.777ValPhe: 2.777 ± 0.071
3.654ValGly: 3.654 ± 0.08
0.993ValHis: 0.993 ± 0.039
5.779ValIle: 5.779 ± 0.099
4.125ValLys: 4.125 ± 0.078
5.378ValLeu: 5.378 ± 0.1
1.436ValMet: 1.436 ± 0.045
3.622ValAsn: 3.622 ± 0.084
2.136ValPro: 2.136 ± 0.059
1.608ValGln: 1.608 ± 0.045
2.143ValArg: 2.143 ± 0.069
4.687ValSer: 4.687 ± 0.09
3.633ValThr: 3.633 ± 0.069
3.891ValVal: 3.891 ± 0.085
0.464ValTrp: 0.464 ± 0.027
2.104ValTyr: 2.104 ± 0.055
0.0ValXaa: 0.0 ± 0.0
Trp
0.378TrpAla: 0.378 ± 0.022
0.097TrpCys: 0.097 ± 0.012
0.456TrpAsp: 0.456 ± 0.026
0.447TrpGlu: 0.447 ± 0.03
0.431TrpPhe: 0.431 ± 0.03
0.498TrpGly: 0.498 ± 0.029
0.194TrpHis: 0.194 ± 0.019
0.948TrpIle: 0.948 ± 0.036
0.79TrpLys: 0.79 ± 0.036
0.752TrpLeu: 0.752 ± 0.036
0.235TrpMet: 0.235 ± 0.019
0.6TrpAsn: 0.6 ± 0.027
0.232TrpPro: 0.232 ± 0.02
0.245TrpGln: 0.245 ± 0.021
0.304TrpArg: 0.304 ± 0.02
0.517TrpSer: 0.517 ± 0.033
0.452TrpThr: 0.452 ± 0.027
0.459TrpVal: 0.459 ± 0.027
0.108TrpTrp: 0.108 ± 0.015
0.295TrpTyr: 0.295 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.668TyrAla: 1.668 ± 0.057
0.543TyrCys: 0.543 ± 0.032
2.408TyrAsp: 2.408 ± 0.078
2.097TyrGlu: 2.097 ± 0.061
1.911TyrPhe: 1.911 ± 0.048
2.305TyrGly: 2.305 ± 0.059
0.784TyrHis: 0.784 ± 0.03
2.495TyrIle: 2.495 ± 0.056
2.224TyrLys: 2.224 ± 0.057
3.579TyrLeu: 3.579 ± 0.075
0.682TyrMet: 0.682 ± 0.029
2.192TyrAsn: 2.192 ± 0.061
1.522TyrPro: 1.522 ± 0.048
1.137TyrGln: 1.137 ± 0.041
1.394TyrArg: 1.394 ± 0.047
3.081TyrSer: 3.081 ± 0.071
1.703TyrThr: 1.703 ± 0.051
2.07TyrVal: 2.07 ± 0.055
0.338TyrTrp: 0.338 ± 0.023
1.508TyrTyr: 1.508 ± 0.054
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3069 proteins (668652 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski