Amino acid dipepetide frequency for Formosa sp. Hel1_31_208

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.085AlaAla: 4.085 ± 0.096
0.611AlaCys: 0.611 ± 0.044
3.492AlaAsp: 3.492 ± 0.1
3.751AlaGlu: 3.751 ± 0.088
3.609AlaPhe: 3.609 ± 0.069
3.925AlaGly: 3.925 ± 0.09
1.237AlaHis: 1.237 ± 0.046
5.668AlaIle: 5.668 ± 0.09
4.486AlaLys: 4.486 ± 0.093
6.347AlaLeu: 6.347 ± 0.101
1.569AlaMet: 1.569 ± 0.051
3.518AlaAsn: 3.518 ± 0.064
1.951AlaPro: 1.951 ± 0.075
2.526AlaGln: 2.526 ± 0.053
1.937AlaArg: 1.937 ± 0.048
4.234AlaSer: 4.234 ± 0.096
3.915AlaThr: 3.915 ± 0.161
3.894AlaVal: 3.894 ± 0.077
0.603AlaTrp: 0.603 ± 0.028
2.462AlaTyr: 2.462 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.513CysAla: 0.513 ± 0.033
0.097CysCys: 0.097 ± 0.011
0.612CysAsp: 0.612 ± 0.08
0.513CysGlu: 0.513 ± 0.044
0.464CysPhe: 0.464 ± 0.026
0.685CysGly: 0.685 ± 0.05
0.188CysHis: 0.188 ± 0.016
0.657CysIle: 0.657 ± 0.024
0.431CysLys: 0.431 ± 0.022
0.627CysLeu: 0.627 ± 0.028
0.146CysMet: 0.146 ± 0.013
0.423CysAsn: 0.423 ± 0.022
0.338CysPro: 0.338 ± 0.026
0.25CysGln: 0.25 ± 0.018
0.179CysArg: 0.179 ± 0.014
0.596CysSer: 0.596 ± 0.036
0.394CysThr: 0.394 ± 0.021
0.457CysVal: 0.457 ± 0.025
0.073CysTrp: 0.073 ± 0.012
0.318CysTyr: 0.318 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
4.078AspAla: 4.078 ± 0.099
0.523AspCys: 0.523 ± 0.051
3.737AspAsp: 3.737 ± 0.087
3.898AspGlu: 3.898 ± 0.068
3.642AspPhe: 3.642 ± 0.06
3.943AspGly: 3.943 ± 0.095
0.971AspHis: 0.971 ± 0.032
5.286AspIle: 5.286 ± 0.087
3.998AspLys: 3.998 ± 0.075
5.492AspLeu: 5.492 ± 0.082
1.327AspMet: 1.327 ± 0.04
3.668AspAsn: 3.668 ± 0.108
1.866AspPro: 1.866 ± 0.071
1.628AspGln: 1.628 ± 0.058
1.821AspArg: 1.821 ± 0.045
3.304AspSer: 3.304 ± 0.064
3.206AspThr: 3.206 ± 0.089
4.071AspVal: 4.071 ± 0.089
0.699AspTrp: 0.699 ± 0.03
2.968AspTyr: 2.968 ± 0.068
0.0AspXaa: 0.0 ± 0.0
Glu
4.46GluAla: 4.46 ± 0.105
0.443GluCys: 0.443 ± 0.049
3.993GluAsp: 3.993 ± 0.079
4.105GluGlu: 4.105 ± 0.091
3.113GluPhe: 3.113 ± 0.065
3.591GluGly: 3.591 ± 0.062
1.292GluHis: 1.292 ± 0.042
5.096GluIle: 5.096 ± 0.076
4.638GluLys: 4.638 ± 0.085
6.155GluLeu: 6.155 ± 0.097
1.446GluMet: 1.446 ± 0.042
4.004GluAsn: 4.004 ± 0.076
1.598GluPro: 1.598 ± 0.041
2.302GluGln: 2.302 ± 0.056
2.591GluArg: 2.591 ± 0.055
3.41GluSer: 3.41 ± 0.078
3.895GluThr: 3.895 ± 0.066
4.127GluVal: 4.127 ± 0.072
0.576GluTrp: 0.576 ± 0.029
2.112GluTyr: 2.112 ± 0.057
0.0GluXaa: 0.0 ± 0.0
Phe
2.936PheAla: 2.936 ± 0.064
0.444PheCys: 0.444 ± 0.022
3.599PheAsp: 3.599 ± 0.077
3.503PheGlu: 3.503 ± 0.071
2.657PhePhe: 2.657 ± 0.072
3.891PheGly: 3.891 ± 0.067
0.847PheHis: 0.847 ± 0.031
4.083PheIle: 4.083 ± 0.073
3.871PheLys: 3.871 ± 0.08
4.544PheLeu: 4.544 ± 0.084
1.152PheMet: 1.152 ± 0.035
3.405PheAsn: 3.405 ± 0.075
1.688PhePro: 1.688 ± 0.042
1.598PheGln: 1.598 ± 0.045
1.611PheArg: 1.611 ± 0.041
4.069PheSer: 4.069 ± 0.062
3.16PheThr: 3.16 ± 0.071
3.143PheVal: 3.143 ± 0.06
0.536PheTrp: 0.536 ± 0.029
2.109PheTyr: 2.109 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
4.025GlyAla: 4.025 ± 0.075
0.624GlyCys: 0.624 ± 0.04
3.697GlyAsp: 3.697 ± 0.083
3.553GlyGlu: 3.553 ± 0.074
3.74GlyPhe: 3.74 ± 0.065
4.466GlyGly: 4.466 ± 0.112
1.198GlyHis: 1.198 ± 0.039
5.206GlyIle: 5.206 ± 0.084
4.299GlyLys: 4.299 ± 0.08
5.729GlyLeu: 5.729 ± 0.093
1.498GlyMet: 1.498 ± 0.051
3.655GlyAsn: 3.655 ± 0.081
1.403GlyPro: 1.403 ± 0.04
2.001GlyGln: 2.001 ± 0.047
2.072GlyArg: 2.072 ± 0.057
3.966GlySer: 3.966 ± 0.091
4.103GlyThr: 4.103 ± 0.12
4.16GlyVal: 4.16 ± 0.079
0.748GlyTrp: 0.748 ± 0.032
2.569GlyTyr: 2.569 ± 0.059
0.0GlyXaa: 0.0 ± 0.0
His
0.94HisAla: 0.94 ± 0.034
0.207HisCys: 0.207 ± 0.015
0.94HisAsp: 0.94 ± 0.029
1.039HisGlu: 1.039 ± 0.038
1.106HisPhe: 1.106 ± 0.037
1.06HisGly: 1.06 ± 0.041
0.549HisHis: 0.549 ± 0.027
1.656HisIle: 1.656 ± 0.043
1.283HisLys: 1.283 ± 0.039
1.876HisLeu: 1.876 ± 0.048
0.388HisMet: 0.388 ± 0.019
1.017HisAsn: 1.017 ± 0.032
0.847HisPro: 0.847 ± 0.032
0.753HisGln: 0.753 ± 0.03
0.68HisArg: 0.68 ± 0.027
1.052HisSer: 1.052 ± 0.033
0.96HisThr: 0.96 ± 0.031
1.041HisVal: 1.041 ± 0.037
0.222HisTrp: 0.222 ± 0.016
0.903HisTyr: 0.903 ± 0.036
0.0HisXaa: 0.0 ± 0.0
Ile
5.695IleAla: 5.695 ± 0.093
0.654IleCys: 0.654 ± 0.027
5.309IleAsp: 5.309 ± 0.087
5.706IleGlu: 5.706 ± 0.09
3.553IlePhe: 3.553 ± 0.077
5.272IleGly: 5.272 ± 0.098
1.362IleHis: 1.362 ± 0.037
6.502IleIle: 6.502 ± 0.124
5.864IleLys: 5.864 ± 0.099
7.063IleLeu: 7.063 ± 0.116
1.45IleMet: 1.45 ± 0.051
4.997IleAsn: 4.997 ± 0.081
3.181IlePro: 3.181 ± 0.059
2.557IleGln: 2.557 ± 0.053
2.446IleArg: 2.446 ± 0.051
5.827IleSer: 5.827 ± 0.073
5.378IleThr: 5.378 ± 0.163
5.01IleVal: 5.01 ± 0.079
0.676IleTrp: 0.676 ± 0.031
2.844IleTyr: 2.844 ± 0.059
0.0IleXaa: 0.0 ± 0.0
Lys
5.004LysAla: 5.004 ± 0.098
0.319LysCys: 0.319 ± 0.02
4.502LysAsp: 4.502 ± 0.086
4.999LysGlu: 4.999 ± 0.099
2.835LysPhe: 2.835 ± 0.059
4.168LysGly: 4.168 ± 0.081
1.571LysHis: 1.571 ± 0.049
5.241LysIle: 5.241 ± 0.09
5.634LysLys: 5.634 ± 0.104
6.33LysLeu: 6.33 ± 0.1
1.751LysMet: 1.751 ± 0.04
4.384LysAsn: 4.384 ± 0.086
2.229LysPro: 2.229 ± 0.048
2.901LysGln: 2.901 ± 0.075
3.16LysArg: 3.16 ± 0.068
4.537LysSer: 4.537 ± 0.084
4.855LysThr: 4.855 ± 0.092
4.482LysVal: 4.482 ± 0.082
0.653LysTrp: 0.653 ± 0.027
2.742LysTyr: 2.742 ± 0.067
0.0LysXaa: 0.0 ± 0.0
Leu
5.419LeuAla: 5.419 ± 0.095
0.711LeuCys: 0.711 ± 0.027
5.339LeuAsp: 5.339 ± 0.087
5.958LeuGlu: 5.958 ± 0.085
4.859LeuPhe: 4.859 ± 0.087
5.584LeuGly: 5.584 ± 0.087
1.552LeuHis: 1.552 ± 0.045
7.175LeuIle: 7.175 ± 0.117
7.643LeuLys: 7.643 ± 0.124
8.461LeuLeu: 8.461 ± 0.13
2.141LeuMet: 2.141 ± 0.059
5.873LeuAsn: 5.873 ± 0.089
3.366LeuPro: 3.366 ± 0.064
3.166LeuGln: 3.166 ± 0.074
3.219LeuArg: 3.219 ± 0.066
6.574LeuSer: 6.574 ± 0.1
5.296LeuThr: 5.296 ± 0.1
5.571LeuVal: 5.571 ± 0.084
0.846LeuTrp: 0.846 ± 0.036
3.055LeuTyr: 3.055 ± 0.067
0.0LeuXaa: 0.0 ± 0.0
Met
1.649MetAla: 1.649 ± 0.047
0.143MetCys: 0.143 ± 0.011
1.146MetAsp: 1.146 ± 0.034
1.233MetGlu: 1.233 ± 0.043
0.942MetPhe: 0.942 ± 0.036
1.244MetGly: 1.244 ± 0.036
0.444MetHis: 0.444 ± 0.021
1.659MetIle: 1.659 ± 0.045
2.075MetLys: 2.075 ± 0.049
1.982MetLeu: 1.982 ± 0.055
0.612MetMet: 0.612 ± 0.027
1.313MetAsn: 1.313 ± 0.04
0.826MetPro: 0.826 ± 0.033
0.839MetGln: 0.839 ± 0.031
0.93MetArg: 0.93 ± 0.036
1.653MetSer: 1.653 ± 0.043
1.376MetThr: 1.376 ± 0.043
1.339MetVal: 1.339 ± 0.048
0.15MetTrp: 0.15 ± 0.013
0.76MetTyr: 0.76 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
4.055AsnAla: 4.055 ± 0.071
0.554AsnCys: 0.554 ± 0.051
3.711AsnAsp: 3.711 ± 0.074
3.598AsnGlu: 3.598 ± 0.078
3.014AsnPhe: 3.014 ± 0.062
3.974AsnGly: 3.974 ± 0.087
1.045AsnHis: 1.045 ± 0.036
4.753AsnIle: 4.753 ± 0.075
3.959AsnLys: 3.959 ± 0.075
5.155AsnLeu: 5.155 ± 0.084
1.39AsnMet: 1.39 ± 0.038
3.811AsnAsn: 3.811 ± 0.086
2.748AsnPro: 2.748 ± 0.06
2.254AsnGln: 2.254 ± 0.053
2.141AsnArg: 2.141 ± 0.049
3.908AsnSer: 3.908 ± 0.078
4.207AsnThr: 4.207 ± 0.123
3.628AsnVal: 3.628 ± 0.067
0.745AsnTrp: 0.745 ± 0.031
2.877AsnTyr: 2.877 ± 0.061
0.0AsnXaa: 0.0 ± 0.0
Pro
1.827ProAla: 1.827 ± 0.058
0.219ProCys: 0.219 ± 0.017
1.982ProAsp: 1.982 ± 0.05
2.647ProGlu: 2.647 ± 0.058
1.886ProPhe: 1.886 ± 0.045
1.787ProGly: 1.787 ± 0.047
0.583ProHis: 0.583 ± 0.029
2.738ProIle: 2.738 ± 0.052
2.526ProLys: 2.526 ± 0.057
2.826ProLeu: 2.826 ± 0.057
0.707ProMet: 0.707 ± 0.028
2.329ProAsn: 2.329 ± 0.049
0.844ProPro: 0.844 ± 0.043
1.174ProGln: 1.174 ± 0.036
0.935ProArg: 0.935 ± 0.03
2.232ProSer: 2.232 ± 0.058
2.077ProThr: 2.077 ± 0.061
2.194ProVal: 2.194 ± 0.054
0.326ProTrp: 0.326 ± 0.02
1.367ProTyr: 1.367 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
1.992GlnAla: 1.992 ± 0.052
0.193GlnCys: 0.193 ± 0.017
1.982GlnAsp: 1.982 ± 0.049
2.169GlnGlu: 2.169 ± 0.063
1.813GlnPhe: 1.813 ± 0.046
1.738GlnGly: 1.738 ± 0.049
0.712GlnHis: 0.712 ± 0.026
2.787GlnIle: 2.787 ± 0.054
2.539GlnLys: 2.539 ± 0.066
3.711GlnLeu: 3.711 ± 0.077
0.846GlnMet: 0.846 ± 0.033
2.273GlnAsn: 2.273 ± 0.057
1.188GlnPro: 1.188 ± 0.032
1.465GlnGln: 1.465 ± 0.042
1.308GlnArg: 1.308 ± 0.039
2.081GlnSer: 2.081 ± 0.048
2.124GlnThr: 2.124 ± 0.052
2.011GlnVal: 2.011 ± 0.052
0.383GlnTrp: 0.383 ± 0.021
1.325GlnTyr: 1.325 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
2.169ArgAla: 2.169 ± 0.051
0.182ArgCys: 0.182 ± 0.014
1.931ArgAsp: 1.931 ± 0.054
1.974ArgGlu: 1.974 ± 0.047
1.979ArgPhe: 1.979 ± 0.049
1.951ArgGly: 1.951 ± 0.052
0.688ArgHis: 0.688 ± 0.029
2.896ArgIle: 2.896 ± 0.062
2.437ArgLys: 2.437 ± 0.059
3.38ArgLeu: 3.38 ± 0.071
0.852ArgMet: 0.852 ± 0.03
2.0ArgAsn: 2.0 ± 0.052
1.088ArgPro: 1.088 ± 0.038
1.284ArgGln: 1.284 ± 0.041
1.41ArgArg: 1.41 ± 0.04
1.911ArgSer: 1.911 ± 0.051
1.956ArgThr: 1.956 ± 0.063
2.134ArgVal: 2.134 ± 0.051
0.371ArgTrp: 0.371 ± 0.02
1.547ArgTyr: 1.547 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
3.944SerAla: 3.944 ± 0.064
0.64SerCys: 0.64 ± 0.03
3.78SerAsp: 3.78 ± 0.077
4.242SerGlu: 4.242 ± 0.067
3.717SerPhe: 3.717 ± 0.066
4.607SerGly: 4.607 ± 0.089
1.102SerHis: 1.102 ± 0.034
5.352SerIle: 5.352 ± 0.106
4.879SerLys: 4.879 ± 0.093
5.797SerLeu: 5.797 ± 0.087
1.332SerMet: 1.332 ± 0.04
4.028SerAsn: 4.028 ± 0.076
2.011SerPro: 2.011 ± 0.055
2.261SerGln: 2.261 ± 0.049
2.041SerArg: 2.041 ± 0.049
4.175SerSer: 4.175 ± 0.082
3.679SerThr: 3.679 ± 0.084
4.163SerVal: 4.163 ± 0.079
0.727SerTrp: 0.727 ± 0.033
2.754SerTyr: 2.754 ± 0.058
0.0SerXaa: 0.0 ± 0.0
Thr
4.023ThrAla: 4.023 ± 0.165
0.465ThrCys: 0.465 ± 0.034
3.561ThrAsp: 3.561 ± 0.162
3.484ThrGlu: 3.484 ± 0.065
3.478ThrPhe: 3.478 ± 0.079
4.1ThrGly: 4.1 ± 0.107
1.129ThrHis: 1.129 ± 0.036
5.537ThrIle: 5.537 ± 0.12
3.872ThrLys: 3.872 ± 0.067
5.667ThrLeu: 5.667 ± 0.088
1.101ThrMet: 1.101 ± 0.04
3.742ThrAsn: 3.742 ± 0.113
2.343ThrPro: 2.343 ± 0.061
1.938ThrGln: 1.938 ± 0.048
1.721ThrArg: 1.721 ± 0.058
4.1ThrSer: 4.1 ± 0.07
3.974ThrThr: 3.974 ± 0.11
4.193ThrVal: 4.193 ± 0.097
0.713ThrTrp: 0.713 ± 0.054
2.593ThrTyr: 2.593 ± 0.07
0.0ThrXaa: 0.0 ± 0.0
Val
4.137ValAla: 4.137 ± 0.069
0.511ValCys: 0.511 ± 0.025
3.76ValAsp: 3.76 ± 0.071
3.76ValGlu: 3.76 ± 0.075
3.502ValPhe: 3.502 ± 0.067
3.722ValGly: 3.722 ± 0.068
0.984ValHis: 0.984 ± 0.034
5.4ValIle: 5.4 ± 0.073
4.243ValLys: 4.243 ± 0.082
6.082ValLeu: 6.082 ± 0.093
1.484ValMet: 1.484 ± 0.045
3.652ValAsn: 3.652 ± 0.069
2.126ValPro: 2.126 ± 0.047
1.809ValGln: 1.809 ± 0.053
1.962ValArg: 1.962 ± 0.058
4.413ValSer: 4.413 ± 0.076
4.03ValThr: 4.03 ± 0.1
4.16ValVal: 4.16 ± 0.079
0.575ValTrp: 0.575 ± 0.028
2.372ValTyr: 2.372 ± 0.048
0.0ValXaa: 0.0 ± 0.0
Trp
0.543TrpAla: 0.543 ± 0.024
0.089TrpCys: 0.089 ± 0.01
0.606TrpAsp: 0.606 ± 0.026
0.611TrpGlu: 0.611 ± 0.027
0.574TrpPhe: 0.574 ± 0.024
0.552TrpGly: 0.552 ± 0.029
0.22TrpHis: 0.22 ± 0.017
0.78TrpIle: 0.78 ± 0.033
0.703TrpLys: 0.703 ± 0.032
0.986TrpLeu: 0.986 ± 0.035
0.306TrpMet: 0.306 ± 0.018
0.699TrpAsn: 0.699 ± 0.028
0.206TrpPro: 0.206 ± 0.016
0.416TrpGln: 0.416 ± 0.024
0.38TrpArg: 0.38 ± 0.02
0.689TrpSer: 0.689 ± 0.031
0.662TrpThr: 0.662 ± 0.045
0.6TrpVal: 0.6 ± 0.029
0.154TrpTrp: 0.154 ± 0.013
0.442TrpTyr: 0.442 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.327TyrAla: 2.327 ± 0.057
0.333TyrCys: 0.333 ± 0.021
2.449TyrAsp: 2.449 ± 0.058
2.206TyrGlu: 2.206 ± 0.051
2.361TyrPhe: 2.361 ± 0.054
2.45TyrGly: 2.45 ± 0.057
0.844TyrHis: 0.844 ± 0.032
2.875TyrIle: 2.875 ± 0.063
2.915TyrLys: 2.915 ± 0.064
3.601TyrLeu: 3.601 ± 0.073
0.817TyrMet: 0.817 ± 0.034
2.737TyrAsn: 2.737 ± 0.064
1.361TyrPro: 1.361 ± 0.035
1.518TyrGln: 1.518 ± 0.051
1.592TyrArg: 1.592 ± 0.043
2.499TyrSer: 2.499 ± 0.055
2.486TyrThr: 2.486 ± 0.067
2.304TyrVal: 2.304 ± 0.055
0.442TyrTrp: 0.442 ± 0.023
1.799TyrTyr: 1.799 ± 0.055
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2771 proteins (943956 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski