Amino acid dipepetide frequency for Boseongicola sp. HY14

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.353AlaAla: 18.353 ± 0.197
1.023AlaCys: 1.023 ± 0.034
7.525AlaAsp: 7.525 ± 0.093
8.837AlaGlu: 8.837 ± 0.122
4.427AlaPhe: 4.427 ± 0.082
11.689AlaGly: 11.689 ± 0.133
2.448AlaHis: 2.448 ± 0.054
6.017AlaIle: 6.017 ± 0.075
3.565AlaLys: 3.565 ± 0.086
14.392AlaLeu: 14.392 ± 0.165
4.078AlaMet: 4.078 ± 0.063
2.679AlaAsn: 2.679 ± 0.054
6.444AlaPro: 6.444 ± 0.117
3.772AlaGln: 3.772 ± 0.063
10.691AlaArg: 10.691 ± 0.135
5.395AlaSer: 5.395 ± 0.084
6.372AlaThr: 6.372 ± 0.09
8.857AlaVal: 8.857 ± 0.103
1.565AlaTrp: 1.565 ± 0.04
2.485AlaTyr: 2.485 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.968CysAla: 0.968 ± 0.032
0.085CysCys: 0.085 ± 0.009
0.586CysAsp: 0.586 ± 0.021
0.423CysGlu: 0.423 ± 0.018
0.289CysPhe: 0.289 ± 0.016
0.873CysGly: 0.873 ± 0.034
0.235CysHis: 0.235 ± 0.014
0.334CysIle: 0.334 ± 0.018
0.178CysLys: 0.178 ± 0.014
0.703CysLeu: 0.703 ± 0.028
0.14CysMet: 0.14 ± 0.012
0.21CysAsn: 0.21 ± 0.014
0.502CysPro: 0.502 ± 0.025
0.206CysGln: 0.206 ± 0.013
0.515CysArg: 0.515 ± 0.021
0.379CysSer: 0.379 ± 0.02
0.425CysThr: 0.425 ± 0.022
0.527CysVal: 0.527 ± 0.024
0.101CysTrp: 0.101 ± 0.009
0.193CysTyr: 0.193 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
7.582AspAla: 7.582 ± 0.091
0.491AspCys: 0.491 ± 0.023
3.935AspAsp: 3.935 ± 0.074
4.154AspGlu: 4.154 ± 0.074
2.363AspPhe: 2.363 ± 0.047
5.772AspGly: 5.772 ± 0.09
1.423AspHis: 1.423 ± 0.042
3.135AspIle: 3.135 ± 0.057
1.763AspLys: 1.763 ± 0.043
6.656AspLeu: 6.656 ± 0.091
1.732AspMet: 1.732 ± 0.04
1.298AspAsn: 1.298 ± 0.038
3.98AspPro: 3.98 ± 0.074
1.639AspGln: 1.639 ± 0.038
4.794AspArg: 4.794 ± 0.083
2.109AspSer: 2.109 ± 0.049
3.056AspThr: 3.056 ± 0.059
4.094AspVal: 4.094 ± 0.059
1.287AspTrp: 1.287 ± 0.033
1.546AspTyr: 1.546 ± 0.037
0.0AspXaa: 0.0 ± 0.0
Glu
9.111GluAla: 9.111 ± 0.132
0.35GluCys: 0.35 ± 0.02
3.56GluAsp: 3.56 ± 0.078
3.193GluGlu: 3.193 ± 0.07
1.822GluPhe: 1.822 ± 0.045
5.065GluGly: 5.065 ± 0.078
1.227GluHis: 1.227 ± 0.038
3.717GluIle: 3.717 ± 0.061
1.938GluLys: 1.938 ± 0.05
5.046GluLeu: 5.046 ± 0.072
1.815GluMet: 1.815 ± 0.039
1.531GluAsn: 1.531 ± 0.038
2.665GluPro: 2.665 ± 0.051
1.678GluGln: 1.678 ± 0.043
4.494GluArg: 4.494 ± 0.076
1.958GluSer: 1.958 ± 0.05
3.817GluThr: 3.817 ± 0.063
4.89GluVal: 4.89 ± 0.068
0.675GluTrp: 0.675 ± 0.023
0.993GluTyr: 0.993 ± 0.034
0.0GluXaa: 0.0 ± 0.0
Phe
4.656PheAla: 4.656 ± 0.072
0.379PheCys: 0.379 ± 0.02
2.914PheAsp: 2.914 ± 0.047
2.183PheGlu: 2.183 ± 0.04
1.412PhePhe: 1.412 ± 0.041
3.719PheGly: 3.719 ± 0.071
0.798PheHis: 0.798 ± 0.026
1.588PheIle: 1.588 ± 0.041
0.829PheLys: 0.829 ± 0.029
3.559PheLeu: 3.559 ± 0.069
0.833PheMet: 0.833 ± 0.032
0.981PheAsn: 0.981 ± 0.034
1.591PhePro: 1.591 ± 0.036
0.926PheGln: 0.926 ± 0.03
2.24PheArg: 2.24 ± 0.046
2.076PheSer: 2.076 ± 0.046
2.119PheThr: 2.119 ± 0.047
2.63PheVal: 2.63 ± 0.051
0.594PheTrp: 0.594 ± 0.027
0.867PheTyr: 0.867 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
10.797GlyAla: 10.797 ± 0.126
0.841GlyCys: 0.841 ± 0.033
5.17GlyAsp: 5.17 ± 0.085
4.991GlyGlu: 4.991 ± 0.072
3.824GlyPhe: 3.824 ± 0.064
7.904GlyGly: 7.904 ± 0.135
2.067GlyHis: 2.067 ± 0.052
4.444GlyIle: 4.444 ± 0.058
3.192GlyLys: 3.192 ± 0.07
9.486GlyLeu: 9.486 ± 0.112
2.636GlyMet: 2.636 ± 0.058
2.003GlyAsn: 2.003 ± 0.05
4.092GlyPro: 4.092 ± 0.068
3.018GlyGln: 3.018 ± 0.051
6.273GlyArg: 6.273 ± 0.07
3.958GlySer: 3.958 ± 0.071
4.566GlyThr: 4.566 ± 0.068
6.722GlyVal: 6.722 ± 0.089
1.654GlyTrp: 1.654 ± 0.047
2.281GlyTyr: 2.281 ± 0.048
0.0GlyXaa: 0.0 ± 0.0
His
2.402HisAla: 2.402 ± 0.055
0.194HisCys: 0.194 ± 0.014
1.417HisAsp: 1.417 ± 0.033
1.129HisGlu: 1.129 ± 0.038
0.82HisPhe: 0.82 ± 0.031
2.028HisGly: 2.028 ± 0.046
0.588HisHis: 0.588 ± 0.032
0.926HisIle: 0.926 ± 0.033
0.516HisLys: 0.516 ± 0.023
2.12HisLeu: 2.12 ± 0.045
0.519HisMet: 0.519 ± 0.022
0.409HisAsn: 0.409 ± 0.02
1.465HisPro: 1.465 ± 0.041
0.527HisGln: 0.527 ± 0.021
1.446HisArg: 1.446 ± 0.04
0.85HisSer: 0.85 ± 0.026
0.805HisThr: 0.805 ± 0.029
1.609HisVal: 1.609 ± 0.037
0.343HisTrp: 0.343 ± 0.018
0.549HisTyr: 0.549 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
7.393IleAla: 7.393 ± 0.095
0.551IleCys: 0.551 ± 0.021
3.565IleAsp: 3.565 ± 0.061
3.791IleGlu: 3.791 ± 0.058
1.601IlePhe: 1.601 ± 0.047
4.965IleGly: 4.965 ± 0.074
0.917IleHis: 0.917 ± 0.029
1.97IleIle: 1.97 ± 0.047
1.251IleLys: 1.251 ± 0.034
4.592IleLeu: 4.592 ± 0.079
1.053IleMet: 1.053 ± 0.034
1.203IleAsn: 1.203 ± 0.038
2.151IlePro: 2.151 ± 0.047
1.036IleGln: 1.036 ± 0.031
3.362IleArg: 3.362 ± 0.052
2.595IleSer: 2.595 ± 0.052
2.712IleThr: 2.712 ± 0.051
3.915IleVal: 3.915 ± 0.073
0.696IleTrp: 0.696 ± 0.024
1.104IleTyr: 1.104 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
3.699LysAla: 3.699 ± 0.068
0.159LysCys: 0.159 ± 0.014
1.567LysAsp: 1.567 ± 0.048
1.276LysGlu: 1.276 ± 0.046
0.804LysPhe: 0.804 ± 0.029
2.541LysGly: 2.541 ± 0.06
0.601LysHis: 0.601 ± 0.024
1.461LysIle: 1.461 ± 0.044
1.1LysLys: 1.1 ± 0.04
2.808LysLeu: 2.808 ± 0.054
0.754LysMet: 0.754 ± 0.027
0.675LysAsn: 0.675 ± 0.026
1.804LysPro: 1.804 ± 0.049
0.803LysGln: 0.803 ± 0.03
2.119LysArg: 2.119 ± 0.053
1.631LysSer: 1.631 ± 0.044
1.795LysThr: 1.795 ± 0.048
2.289LysVal: 2.289 ± 0.053
0.361LysTrp: 0.361 ± 0.019
0.598LysTyr: 0.598 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
14.249LeuAla: 14.249 ± 0.163
0.713LeuCys: 0.713 ± 0.027
6.401LeuAsp: 6.401 ± 0.083
5.231LeuGlu: 5.231 ± 0.074
3.637LeuPhe: 3.637 ± 0.081
8.849LeuGly: 8.849 ± 0.105
1.813LeuHis: 1.813 ± 0.045
5.312LeuIle: 5.312 ± 0.087
2.902LeuLys: 2.902 ± 0.054
7.846LeuLeu: 7.846 ± 0.126
2.75LeuMet: 2.75 ± 0.048
2.374LeuAsn: 2.374 ± 0.053
5.562LeuPro: 5.562 ± 0.081
2.068LeuGln: 2.068 ± 0.047
6.812LeuArg: 6.812 ± 0.096
5.865LeuSer: 5.865 ± 0.084
5.918LeuThr: 5.918 ± 0.086
7.543LeuVal: 7.543 ± 0.103
1.255LeuTrp: 1.255 ± 0.039
1.887LeuTyr: 1.887 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
3.618MetAla: 3.618 ± 0.059
0.171MetCys: 0.171 ± 0.013
1.421MetAsp: 1.421 ± 0.042
1.239MetGlu: 1.239 ± 0.033
0.878MetPhe: 0.878 ± 0.028
2.248MetGly: 2.248 ± 0.054
0.452MetHis: 0.452 ± 0.021
1.546MetIle: 1.546 ± 0.043
0.957MetLys: 0.957 ± 0.033
2.645MetLeu: 2.645 ± 0.053
0.759MetMet: 0.759 ± 0.026
0.798MetAsn: 0.798 ± 0.028
1.566MetPro: 1.566 ± 0.043
0.936MetGln: 0.936 ± 0.03
2.07MetArg: 2.07 ± 0.048
1.692MetSer: 1.692 ± 0.036
2.132MetThr: 2.132 ± 0.046
1.978MetVal: 1.978 ± 0.046
0.242MetTrp: 0.242 ± 0.015
0.314MetTyr: 0.314 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
2.934AsnAla: 2.934 ± 0.048
0.218AsnCys: 0.218 ± 0.014
1.284AsnAsp: 1.284 ± 0.042
1.1AsnGlu: 1.1 ± 0.032
0.881AsnPhe: 0.881 ± 0.029
2.033AsnGly: 2.033 ± 0.046
0.502AsnHis: 0.502 ± 0.022
1.197AsnIle: 1.197 ± 0.037
0.58AsnLys: 0.58 ± 0.025
2.376AsnLeu: 2.376 ± 0.053
0.666AsnMet: 0.666 ± 0.024
0.539AsnAsn: 0.539 ± 0.024
1.818AsnPro: 1.818 ± 0.04
0.587AsnGln: 0.587 ± 0.026
1.692AsnArg: 1.692 ± 0.042
1.004AsnSer: 1.004 ± 0.032
1.169AsnThr: 1.169 ± 0.036
1.691AsnVal: 1.691 ± 0.047
0.362AsnTrp: 0.362 ± 0.02
0.601AsnTyr: 0.601 ± 0.027
0.0AsnXaa: 0.0 ± 0.0
Pro
6.342ProAla: 6.342 ± 0.094
0.35ProCys: 0.35 ± 0.02
4.543ProAsp: 4.543 ± 0.081
4.544ProGlu: 4.544 ± 0.069
1.885ProPhe: 1.885 ± 0.042
5.312ProGly: 5.312 ± 0.079
1.064ProHis: 1.064 ± 0.035
2.185ProIle: 2.185 ± 0.049
1.552ProLys: 1.552 ± 0.051
4.459ProLeu: 4.459 ± 0.075
1.346ProMet: 1.346 ± 0.04
1.173ProAsn: 1.173 ± 0.038
2.542ProPro: 2.542 ± 0.066
1.354ProGln: 1.354 ± 0.04
3.16ProArg: 3.16 ± 0.056
2.274ProSer: 2.274 ± 0.049
2.456ProThr: 2.456 ± 0.05
4.574ProVal: 4.574 ± 0.071
0.637ProTrp: 0.637 ± 0.025
1.154ProTyr: 1.154 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
3.608GlnAla: 3.608 ± 0.063
0.171GlnCys: 0.171 ± 0.013
1.473GlnAsp: 1.473 ± 0.039
1.304GlnGlu: 1.304 ± 0.032
0.974GlnPhe: 0.974 ± 0.029
2.324GlnGly: 2.324 ± 0.049
0.532GlnHis: 0.532 ± 0.022
1.782GlnIle: 1.782 ± 0.041
0.925GlnLys: 0.925 ± 0.031
2.279GlnLeu: 2.279 ± 0.052
0.918GlnMet: 0.918 ± 0.027
0.75GlnAsn: 0.75 ± 0.026
1.509GlnPro: 1.509 ± 0.036
0.799GlnGln: 0.799 ± 0.031
1.967GlnArg: 1.967 ± 0.043
1.439GlnSer: 1.439 ± 0.04
1.477GlnThr: 1.477 ± 0.04
2.204GlnVal: 2.204 ± 0.047
0.345GlnTrp: 0.345 ± 0.018
0.516GlnTyr: 0.516 ± 0.025
0.0GlnXaa: 0.0 ± 0.0
Arg
9.598ArgAla: 9.598 ± 0.128
0.44ArgCys: 0.44 ± 0.022
4.778ArgAsp: 4.778 ± 0.064
4.079ArgGlu: 4.079 ± 0.068
2.817ArgPhe: 2.817 ± 0.055
5.118ArgGly: 5.118 ± 0.072
1.749ArgHis: 1.749 ± 0.042
4.128ArgIle: 4.128 ± 0.064
2.193ArgLys: 2.193 ± 0.05
7.988ArgLeu: 7.988 ± 0.108
2.095ArgMet: 2.095 ± 0.047
1.604ArgAsn: 1.604 ± 0.046
3.63ArgPro: 3.63 ± 0.061
2.189ArgGln: 2.189 ± 0.056
5.628ArgArg: 5.628 ± 0.089
2.969ArgSer: 2.969 ± 0.058
3.119ArgThr: 3.119 ± 0.056
5.18ArgVal: 5.18 ± 0.073
0.975ArgTrp: 0.975 ± 0.031
1.537ArgTyr: 1.537 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
5.39SerAla: 5.39 ± 0.081
0.39SerCys: 0.39 ± 0.018
2.952SerAsp: 2.952 ± 0.053
2.484SerGlu: 2.484 ± 0.05
2.082SerPhe: 2.082 ± 0.046
5.046SerGly: 5.046 ± 0.064
0.999SerHis: 0.999 ± 0.028
2.15SerIle: 2.15 ± 0.052
1.233SerLys: 1.233 ± 0.041
4.608SerLeu: 4.608 ± 0.073
1.211SerMet: 1.211 ± 0.037
1.072SerAsn: 1.072 ± 0.033
2.487SerPro: 2.487 ± 0.05
1.332SerGln: 1.332 ± 0.034
3.229SerArg: 3.229 ± 0.06
2.167SerSer: 2.167 ± 0.053
2.28SerThr: 2.28 ± 0.051
3.59SerVal: 3.59 ± 0.071
0.627SerTrp: 0.627 ± 0.026
1.207SerTyr: 1.207 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
6.211ThrAla: 6.211 ± 0.081
0.472ThrCys: 0.472 ± 0.02
3.203ThrAsp: 3.203 ± 0.052
3.022ThrGlu: 3.022 ± 0.049
1.969ThrPhe: 1.969 ± 0.046
5.747ThrGly: 5.747 ± 0.074
1.022ThrHis: 1.022 ± 0.031
2.638ThrIle: 2.638 ± 0.054
1.251ThrLys: 1.251 ± 0.039
5.95ThrLeu: 5.95 ± 0.073
1.304ThrMet: 1.304 ± 0.039
1.102ThrAsn: 1.102 ± 0.033
3.514ThrPro: 3.514 ± 0.055
1.312ThrGln: 1.312 ± 0.034
3.926ThrArg: 3.926 ± 0.059
2.393ThrSer: 2.393 ± 0.052
2.822ThrThr: 2.822 ± 0.063
3.943ThrVal: 3.943 ± 0.067
0.713ThrTrp: 0.713 ± 0.03
1.175ThrTyr: 1.175 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
9.792ValAla: 9.792 ± 0.096
0.52ValCys: 0.52 ± 0.023
4.224ValAsp: 4.224 ± 0.071
4.714ValGlu: 4.714 ± 0.068
2.998ValPhe: 2.998 ± 0.051
5.575ValGly: 5.575 ± 0.084
1.392ValHis: 1.392 ± 0.042
4.368ValIle: 4.368 ± 0.067
2.01ValLys: 2.01 ± 0.053
7.596ValLeu: 7.596 ± 0.101
2.169ValMet: 2.169 ± 0.042
1.924ValAsn: 1.924 ± 0.05
3.813ValPro: 3.813 ± 0.065
1.872ValGln: 1.872 ± 0.039
4.589ValArg: 4.589 ± 0.073
3.976ValSer: 3.976 ± 0.064
4.84ValThr: 4.84 ± 0.067
6.34ValVal: 6.34 ± 0.098
0.89ValTrp: 0.89 ± 0.029
1.475ValTyr: 1.475 ± 0.037
0.0ValXaa: 0.0 ± 0.0
Trp
1.486TrpAla: 1.486 ± 0.037
0.13TrpCys: 0.13 ± 0.01
0.762TrpAsp: 0.762 ± 0.024
0.615TrpGlu: 0.615 ± 0.024
0.551TrpPhe: 0.551 ± 0.025
1.046TrpGly: 1.046 ± 0.034
0.335TrpHis: 0.335 ± 0.019
0.685TrpIle: 0.685 ± 0.025
0.39TrpLys: 0.39 ± 0.021
1.686TrpLeu: 1.686 ± 0.038
0.414TrpMet: 0.414 ± 0.02
0.384TrpAsn: 0.384 ± 0.017
0.721TrpPro: 0.721 ± 0.026
0.613TrpGln: 0.613 ± 0.025
1.172TrpArg: 1.172 ± 0.034
0.731TrpSer: 0.731 ± 0.029
0.754TrpThr: 0.754 ± 0.028
0.944TrpVal: 0.944 ± 0.032
0.237TrpTrp: 0.237 ± 0.015
0.265TrpTyr: 0.265 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.423TyrAla: 2.423 ± 0.052
0.225TyrCys: 0.225 ± 0.014
1.531TyrAsp: 1.531 ± 0.037
1.237TyrGlu: 1.237 ± 0.041
0.861TyrPhe: 0.861 ± 0.026
2.064TyrGly: 2.064 ± 0.048
0.5TyrHis: 0.5 ± 0.021
0.842TyrIle: 0.842 ± 0.025
0.516TyrLys: 0.516 ± 0.022
2.194TyrLeu: 2.194 ± 0.046
0.468TyrMet: 0.468 ± 0.02
0.524TyrAsn: 0.524 ± 0.024
1.051TyrPro: 1.051 ± 0.031
0.63TyrGln: 0.63 ± 0.028
1.637TyrArg: 1.637 ± 0.038
1.075TyrSer: 1.075 ± 0.029
1.053TyrThr: 1.053 ± 0.035
1.558TyrVal: 1.558 ± 0.04
0.36TyrTrp: 0.36 ± 0.02
0.548TyrTyr: 0.548 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3383 proteins (1054027 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski