Amino acid dipepetide frequency for Armatimonadetes bacterium Uphvl-Ar1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.9AlaAla: 8.9 ± 0.119
0.848AlaCys: 0.848 ± 0.035
4.827AlaAsp: 4.827 ± 0.07
6.296AlaGlu: 6.296 ± 0.095
3.691AlaPhe: 3.691 ± 0.059
7.77AlaGly: 7.77 ± 0.11
1.607AlaHis: 1.607 ± 0.049
5.757AlaIle: 5.757 ± 0.096
4.61AlaLys: 4.61 ± 0.086
8.763AlaLeu: 8.763 ± 0.14
2.612AlaMet: 2.612 ± 0.064
3.199AlaAsn: 3.199 ± 0.056
3.714AlaPro: 3.714 ± 0.069
3.723AlaGln: 3.723 ± 0.071
4.81AlaArg: 4.81 ± 0.061
5.183AlaSer: 5.183 ± 0.08
5.325AlaThr: 5.325 ± 0.078
6.669AlaVal: 6.669 ± 0.091
1.255AlaTrp: 1.255 ± 0.038
2.172AlaTyr: 2.172 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
0.718CysAla: 0.718 ± 0.03
0.135CysCys: 0.135 ± 0.013
0.522CysAsp: 0.522 ± 0.023
0.537CysGlu: 0.537 ± 0.028
0.338CysPhe: 0.338 ± 0.017
0.955CysGly: 0.955 ± 0.034
0.231CysHis: 0.231 ± 0.018
0.431CysIle: 0.431 ± 0.024
0.259CysLys: 0.259 ± 0.018
0.909CysLeu: 0.909 ± 0.032
0.196CysMet: 0.196 ± 0.016
0.266CysAsn: 0.266 ± 0.017
0.55CysPro: 0.55 ± 0.026
0.326CysGln: 0.326 ± 0.021
0.559CysArg: 0.559 ± 0.028
0.574CysSer: 0.574 ± 0.028
0.47CysThr: 0.47 ± 0.022
0.601CysVal: 0.601 ± 0.026
0.111CysTrp: 0.111 ± 0.01
0.206CysTyr: 0.206 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
4.484AspAla: 4.484 ± 0.076
0.49AspCys: 0.49 ± 0.023
2.771AspAsp: 2.771 ± 0.061
3.613AspGlu: 3.613 ± 0.084
2.526AspPhe: 2.526 ± 0.05
4.772AspGly: 4.772 ± 0.09
1.191AspHis: 1.191 ± 0.037
2.713AspIle: 2.713 ± 0.055
1.961AspLys: 1.961 ± 0.048
5.779AspLeu: 5.779 ± 0.084
1.203AspMet: 1.203 ± 0.037
1.531AspAsn: 1.531 ± 0.042
3.305AspPro: 3.305 ± 0.067
2.333AspGln: 2.333 ± 0.052
3.543AspArg: 3.543 ± 0.067
3.258AspSer: 3.258 ± 0.056
2.198AspThr: 2.198 ± 0.05
3.685AspVal: 3.685 ± 0.066
1.001AspTrp: 1.001 ± 0.038
1.565AspTyr: 1.565 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
5.932GluAla: 5.932 ± 0.095
0.45GluCys: 0.45 ± 0.021
2.875GluAsp: 2.875 ± 0.06
4.211GluGlu: 4.211 ± 0.083
2.856GluPhe: 2.856 ± 0.055
4.239GluGly: 4.239 ± 0.078
1.103GluHis: 1.103 ± 0.037
4.229GluIle: 4.229 ± 0.073
3.228GluLys: 3.228 ± 0.077
6.33GluLeu: 6.33 ± 0.086
1.803GluMet: 1.803 ± 0.052
2.249GluAsn: 2.249 ± 0.049
2.789GluPro: 2.789 ± 0.056
2.332GluGln: 2.332 ± 0.054
3.895GluArg: 3.895 ± 0.075
4.243GluSer: 4.243 ± 0.071
3.333GluThr: 3.333 ± 0.064
4.895GluVal: 4.895 ± 0.093
0.988GluTrp: 0.988 ± 0.033
1.475GluTyr: 1.475 ± 0.044
0.0GluXaa: 0.0 ± 0.0
Phe
3.993PheAla: 3.993 ± 0.069
0.457PheCys: 0.457 ± 0.022
2.749PheAsp: 2.749 ± 0.05
2.714PheGlu: 2.714 ± 0.054
1.61PhePhe: 1.61 ± 0.049
4.073PheGly: 4.073 ± 0.075
0.779PheHis: 0.779 ± 0.032
1.943PheIle: 1.943 ± 0.05
1.456PheLys: 1.456 ± 0.048
3.721PheLeu: 3.721 ± 0.084
0.883PheMet: 0.883 ± 0.033
1.576PheAsn: 1.576 ± 0.047
1.903PhePro: 1.903 ± 0.052
1.438PheGln: 1.438 ± 0.042
2.365PheArg: 2.365 ± 0.057
2.698PheSer: 2.698 ± 0.054
2.401PheThr: 2.401 ± 0.052
3.159PheVal: 3.159 ± 0.057
0.63PheTrp: 0.63 ± 0.028
1.152PheTyr: 1.152 ± 0.035
0.0PheXaa: 0.0 ± 0.0
Gly
6.794GlyAla: 6.794 ± 0.108
0.899GlyCys: 0.899 ± 0.032
4.467GlyAsp: 4.467 ± 0.07
5.101GlyGlu: 5.101 ± 0.082
4.03GlyPhe: 4.03 ± 0.075
7.282GlyGly: 7.282 ± 0.16
1.627GlyHis: 1.627 ± 0.05
4.84GlyIle: 4.84 ± 0.069
4.12GlyLys: 4.12 ± 0.075
7.897GlyLeu: 7.897 ± 0.114
2.395GlyMet: 2.395 ± 0.068
3.089GlyAsn: 3.089 ± 0.057
2.705GlyPro: 2.705 ± 0.06
3.228GlyGln: 3.228 ± 0.067
4.501GlyArg: 4.501 ± 0.08
5.481GlySer: 5.481 ± 0.073
4.87GlyThr: 4.87 ± 0.091
6.491GlyVal: 6.491 ± 0.098
1.475GlyTrp: 1.475 ± 0.046
2.371GlyTyr: 2.371 ± 0.055
0.0GlyXaa: 0.0 ± 0.0
His
1.543HisAla: 1.543 ± 0.044
0.206HisCys: 0.206 ± 0.018
0.93HisAsp: 0.93 ± 0.036
1.162HisGlu: 1.162 ± 0.039
0.808HisPhe: 0.808 ± 0.029
1.479HisGly: 1.479 ± 0.041
0.518HisHis: 0.518 ± 0.031
1.037HisIle: 1.037 ± 0.042
0.645HisLys: 0.645 ± 0.026
1.913HisLeu: 1.913 ± 0.052
0.391HisMet: 0.391 ± 0.023
0.614HisAsn: 0.614 ± 0.026
1.292HisPro: 1.292 ± 0.044
0.733HisGln: 0.733 ± 0.033
1.144HisArg: 1.144 ± 0.038
1.147HisSer: 1.147 ± 0.036
0.979HisThr: 0.979 ± 0.031
1.179HisVal: 1.179 ± 0.036
0.264HisTrp: 0.264 ± 0.017
0.543HisTyr: 0.543 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
5.976IleAla: 5.976 ± 0.087
0.564IleCys: 0.564 ± 0.024
3.424IleAsp: 3.424 ± 0.063
3.974IleGlu: 3.974 ± 0.076
2.128IlePhe: 2.128 ± 0.055
4.901IleGly: 4.901 ± 0.084
1.124IleHis: 1.124 ± 0.039
2.769IleIle: 2.769 ± 0.056
2.404IleLys: 2.404 ± 0.062
5.562IleLeu: 5.562 ± 0.088
0.986IleMet: 0.986 ± 0.038
2.141IleAsn: 2.141 ± 0.05
3.137IlePro: 3.137 ± 0.072
2.189IleGln: 2.189 ± 0.054
3.291IleArg: 3.291 ± 0.054
3.6IleSer: 3.6 ± 0.061
3.363IleThr: 3.363 ± 0.07
3.951IleVal: 3.951 ± 0.073
0.77IleTrp: 0.77 ± 0.033
1.393IleTyr: 1.393 ± 0.046
0.0IleXaa: 0.0 ± 0.0
Lys
4.028LysAla: 4.028 ± 0.079
0.292LysCys: 0.292 ± 0.02
2.18LysAsp: 2.18 ± 0.053
2.746LysGlu: 2.746 ± 0.063
1.898LysPhe: 1.898 ± 0.05
3.099LysGly: 3.099 ± 0.063
0.708LysHis: 0.708 ± 0.025
2.611LysIle: 2.611 ± 0.061
2.479LysLys: 2.479 ± 0.066
4.17LysLeu: 4.17 ± 0.071
1.262LysMet: 1.262 ± 0.043
1.836LysAsn: 1.836 ± 0.043
2.422LysPro: 2.422 ± 0.057
1.572LysGln: 1.572 ± 0.044
2.471LysArg: 2.471 ± 0.059
3.259LysSer: 3.259 ± 0.069
2.847LysThr: 2.847 ± 0.069
3.203LysVal: 3.203 ± 0.062
0.635LysTrp: 0.635 ± 0.026
1.112LysTyr: 1.112 ± 0.036
0.0LysXaa: 0.0 ± 0.0
Leu
9.975LeuAla: 9.975 ± 0.109
0.885LeuCys: 0.885 ± 0.032
5.304LeuAsp: 5.304 ± 0.09
5.678LeuGlu: 5.678 ± 0.09
3.463LeuPhe: 3.463 ± 0.069
8.029LeuGly: 8.029 ± 0.111
1.62LeuHis: 1.62 ± 0.044
5.487LeuIle: 5.487 ± 0.082
4.392LeuLys: 4.392 ± 0.084
8.069LeuLeu: 8.069 ± 0.115
2.27LeuMet: 2.27 ± 0.055
3.581LeuAsn: 3.581 ± 0.064
4.707LeuPro: 4.707 ± 0.067
2.927LeuGln: 2.927 ± 0.06
5.561LeuArg: 5.561 ± 0.074
6.089LeuSer: 6.089 ± 0.09
5.887LeuThr: 5.887 ± 0.091
6.974LeuVal: 6.974 ± 0.099
1.141LeuTrp: 1.141 ± 0.035
2.079LeuTyr: 2.079 ± 0.054
0.0LeuXaa: 0.0 ± 0.0
Met
2.501MetAla: 2.501 ± 0.059
0.178MetCys: 0.178 ± 0.014
1.253MetAsp: 1.253 ± 0.038
1.405MetGlu: 1.405 ± 0.041
0.839MetPhe: 0.839 ± 0.034
2.136MetGly: 2.136 ± 0.053
0.399MetHis: 0.399 ± 0.023
1.602MetIle: 1.602 ± 0.044
1.468MetLys: 1.468 ± 0.045
2.046MetLeu: 2.046 ± 0.055
0.796MetMet: 0.796 ± 0.034
1.021MetAsn: 1.021 ± 0.031
1.208MetPro: 1.208 ± 0.035
0.731MetGln: 0.731 ± 0.031
1.654MetArg: 1.654 ± 0.048
1.791MetSer: 1.791 ± 0.048
1.54MetThr: 1.54 ± 0.045
1.983MetVal: 1.983 ± 0.053
0.266MetTrp: 0.266 ± 0.019
0.395MetTyr: 0.395 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.086AsnAla: 3.086 ± 0.063
0.328AsnCys: 0.328 ± 0.02
1.877AsnAsp: 1.877 ± 0.048
1.94AsnGlu: 1.94 ± 0.04
1.587AsnPhe: 1.587 ± 0.043
3.099AsnGly: 3.099 ± 0.068
0.815AsnHis: 0.815 ± 0.037
1.814AsnIle: 1.814 ± 0.05
1.294AsnLys: 1.294 ± 0.037
3.83AsnLeu: 3.83 ± 0.068
0.821AsnMet: 0.821 ± 0.03
1.316AsnAsn: 1.316 ± 0.042
2.87AsnPro: 2.87 ± 0.068
1.648AsnGln: 1.648 ± 0.045
2.266AsnArg: 2.266 ± 0.052
2.461AsnSer: 2.461 ± 0.059
1.88AsnThr: 1.88 ± 0.052
2.366AsnVal: 2.366 ± 0.052
0.726AsnTrp: 0.726 ± 0.03
1.13AsnTyr: 1.13 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
4.229ProAla: 4.229 ± 0.084
0.354ProCys: 0.354 ± 0.018
3.112ProAsp: 3.112 ± 0.066
3.964ProGlu: 3.964 ± 0.065
1.903ProPhe: 1.903 ± 0.044
4.226ProGly: 4.226 ± 0.072
0.901ProHis: 0.901 ± 0.03
2.88ProIle: 2.88 ± 0.055
2.376ProLys: 2.376 ± 0.057
3.876ProLeu: 3.876 ± 0.067
1.216ProMet: 1.216 ± 0.035
2.276ProAsn: 2.276 ± 0.062
2.038ProPro: 2.038 ± 0.071
1.724ProGln: 1.724 ± 0.05
2.041ProArg: 2.041 ± 0.049
3.003ProSer: 3.003 ± 0.062
3.202ProThr: 3.202 ± 0.058
3.75ProVal: 3.75 ± 0.06
0.637ProTrp: 0.637 ± 0.032
1.247ProTyr: 1.247 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
3.7GlnAla: 3.7 ± 0.063
0.23GlnCys: 0.23 ± 0.018
1.695GlnAsp: 1.695 ± 0.042
2.04GlnGlu: 2.04 ± 0.052
1.719GlnPhe: 1.719 ± 0.044
2.605GlnGly: 2.605 ± 0.066
0.533GlnHis: 0.533 ± 0.024
2.643GlnIle: 2.643 ± 0.061
1.671GlnLys: 1.671 ± 0.046
3.472GlnLeu: 3.472 ± 0.064
1.055GlnMet: 1.055 ± 0.034
1.621GlnAsn: 1.621 ± 0.043
1.749GlnPro: 1.749 ± 0.047
1.247GlnGln: 1.247 ± 0.041
1.992GlnArg: 1.992 ± 0.047
2.664GlnSer: 2.664 ± 0.056
2.422GlnThr: 2.422 ± 0.056
2.721GlnVal: 2.721 ± 0.051
0.468GlnTrp: 0.468 ± 0.024
0.915GlnTyr: 0.915 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
4.856ArgAla: 4.856 ± 0.085
0.445ArgCys: 0.445 ± 0.022
3.101ArgAsp: 3.101 ± 0.063
3.894ArgGlu: 3.894 ± 0.061
2.439ArgPhe: 2.439 ± 0.064
4.077ArgGly: 4.077 ± 0.072
1.039ArgHis: 1.039 ± 0.031
3.685ArgIle: 3.685 ± 0.066
2.71ArgLys: 2.71 ± 0.054
5.672ArgLeu: 5.672 ± 0.084
1.676ArgMet: 1.676 ± 0.041
2.098ArgAsn: 2.098 ± 0.047
2.472ArgPro: 2.472 ± 0.059
2.187ArgGln: 2.187 ± 0.047
3.66ArgArg: 3.66 ± 0.079
3.477ArgSer: 3.477 ± 0.062
3.086ArgThr: 3.086 ± 0.061
4.35ArgVal: 4.35 ± 0.07
0.836ArgTrp: 0.836 ± 0.036
1.538ArgTyr: 1.538 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
5.765SerAla: 5.765 ± 0.097
0.492SerCys: 0.492 ± 0.025
3.269SerAsp: 3.269 ± 0.061
3.905SerGlu: 3.905 ± 0.073
2.665SerPhe: 2.665 ± 0.059
6.335SerGly: 6.335 ± 0.088
1.153SerHis: 1.153 ± 0.04
3.479SerIle: 3.479 ± 0.06
2.592SerLys: 2.592 ± 0.051
6.02SerLeu: 6.02 ± 0.091
1.543SerMet: 1.543 ± 0.04
2.383SerAsn: 2.383 ± 0.054
3.372SerPro: 3.372 ± 0.067
2.516SerGln: 2.516 ± 0.055
3.52SerArg: 3.52 ± 0.059
4.167SerSer: 4.167 ± 0.084
3.647SerThr: 3.647 ± 0.07
4.673SerVal: 4.673 ± 0.073
0.958SerTrp: 0.958 ± 0.029
1.623SerTyr: 1.623 ± 0.052
0.0SerXaa: 0.0 ± 0.0
Thr
5.079ThrAla: 5.079 ± 0.075
0.441ThrCys: 0.441 ± 0.024
3.015ThrAsp: 3.015 ± 0.061
3.248ThrGlu: 3.248 ± 0.064
2.429ThrPhe: 2.429 ± 0.055
5.196ThrGly: 5.196 ± 0.081
1.159ThrHis: 1.159 ± 0.038
3.527ThrIle: 3.527 ± 0.08
2.471ThrLys: 2.471 ± 0.055
5.634ThrLeu: 5.634 ± 0.09
1.267ThrMet: 1.267 ± 0.042
2.162ThrAsn: 2.162 ± 0.054
3.465ThrPro: 3.465 ± 0.079
2.198ThrGln: 2.198 ± 0.056
2.675ThrArg: 2.675 ± 0.056
3.509ThrSer: 3.509 ± 0.072
3.631ThrThr: 3.631 ± 0.09
4.285ThrVal: 4.285 ± 0.073
0.863ThrTrp: 0.863 ± 0.033
1.592ThrTyr: 1.592 ± 0.043
0.0ThrXaa: 0.0 ± 0.0
Val
6.961ValAla: 6.961 ± 0.092
0.749ValCys: 0.749 ± 0.029
4.208ValAsp: 4.208 ± 0.078
4.668ValGlu: 4.668 ± 0.083
3.006ValPhe: 3.006 ± 0.066
6.089ValGly: 6.089 ± 0.101
1.194ValHis: 1.194 ± 0.036
4.2ValIle: 4.2 ± 0.067
3.229ValLys: 3.229 ± 0.071
6.633ValLeu: 6.633 ± 0.106
1.868ValMet: 1.868 ± 0.053
2.65ValAsn: 2.65 ± 0.064
3.435ValPro: 3.435 ± 0.069
2.417ValGln: 2.417 ± 0.051
4.489ValArg: 4.489 ± 0.078
4.649ValSer: 4.649 ± 0.083
4.624ValThr: 4.624 ± 0.08
6.093ValVal: 6.093 ± 0.103
0.985ValTrp: 0.985 ± 0.035
1.714ValTyr: 1.714 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
1.082TrpAla: 1.082 ± 0.035
0.165TrpCys: 0.165 ± 0.015
0.78TrpAsp: 0.78 ± 0.033
0.832TrpGlu: 0.832 ± 0.032
0.657TrpPhe: 0.657 ± 0.03
1.133TrpGly: 1.133 ± 0.041
0.31TrpHis: 0.31 ± 0.018
0.887TrpIle: 0.887 ± 0.036
0.574TrpLys: 0.574 ± 0.023
1.461TrpLeu: 1.461 ± 0.041
0.474TrpMet: 0.474 ± 0.024
0.621TrpAsn: 0.621 ± 0.031
0.608TrpPro: 0.608 ± 0.027
0.64TrpGln: 0.64 ± 0.029
1.049TrpArg: 1.049 ± 0.036
0.974TrpSer: 0.974 ± 0.038
0.77TrpThr: 0.77 ± 0.034
1.055TrpVal: 1.055 ± 0.032
0.217TrpTrp: 0.217 ± 0.016
0.362TrpTyr: 0.362 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.13TyrAla: 2.13 ± 0.052
0.288TyrCys: 0.288 ± 0.017
1.562TyrAsp: 1.562 ± 0.054
1.527TyrGlu: 1.527 ± 0.042
1.107TyrPhe: 1.107 ± 0.034
2.168TyrGly: 2.168 ± 0.046
0.517TyrHis: 0.517 ± 0.028
1.114TyrIle: 1.114 ± 0.038
0.893TyrLys: 0.893 ± 0.029
2.359TyrLeu: 2.359 ± 0.055
0.496TyrMet: 0.496 ± 0.021
0.921TyrAsn: 0.921 ± 0.036
1.213TyrPro: 1.213 ± 0.046
1.051TyrGln: 1.051 ± 0.031
1.812TyrArg: 1.812 ± 0.046
1.853TyrSer: 1.853 ± 0.052
1.372TyrThr: 1.372 ± 0.044
1.778TyrVal: 1.778 ± 0.05
0.424TyrTrp: 0.424 ± 0.023
0.712TyrTyr: 0.712 ± 0.028
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3008 proteins (893138 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski