Amino acid dipepetide frequency for Microvirga vignae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.95AlaAla: 14.95 ± 0.112
1.066AlaCys: 1.066 ± 0.028
5.96AlaAsp: 5.96 ± 0.061
7.322AlaGlu: 7.322 ± 0.073
4.426AlaPhe: 4.426 ± 0.053
9.418AlaGly: 9.418 ± 0.09
2.148AlaHis: 2.148 ± 0.04
6.537AlaIle: 6.537 ± 0.064
4.237AlaLys: 4.237 ± 0.069
12.923AlaLeu: 12.923 ± 0.104
3.282AlaMet: 3.282 ± 0.046
2.775AlaAsn: 2.775 ± 0.046
5.072AlaPro: 5.072 ± 0.063
4.12AlaGln: 4.12 ± 0.057
7.882AlaArg: 7.882 ± 0.078
6.48AlaSer: 6.48 ± 0.067
5.611AlaThr: 5.611 ± 0.065
8.568AlaVal: 8.568 ± 0.076
1.454AlaTrp: 1.454 ± 0.029
2.655AlaTyr: 2.655 ± 0.038
0.0AlaXaa: 0.0 ± 0.0
Cys
0.879CysAla: 0.879 ± 0.025
0.106CysCys: 0.106 ± 0.009
0.485CysAsp: 0.485 ± 0.016
0.462CysGlu: 0.462 ± 0.017
0.315CysPhe: 0.315 ± 0.014
0.86CysGly: 0.86 ± 0.022
0.266CysHis: 0.266 ± 0.014
0.422CysIle: 0.422 ± 0.017
0.187CysLys: 0.187 ± 0.011
0.838CysLeu: 0.838 ± 0.025
0.166CysMet: 0.166 ± 0.011
0.219CysAsn: 0.219 ± 0.011
0.433CysPro: 0.433 ± 0.017
0.228CysGln: 0.228 ± 0.012
0.661CysArg: 0.661 ± 0.022
0.513CysSer: 0.513 ± 0.018
0.426CysThr: 0.426 ± 0.016
0.587CysVal: 0.587 ± 0.018
0.095CysTrp: 0.095 ± 0.007
0.197CysTyr: 0.197 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
5.933AspAla: 5.933 ± 0.067
0.423AspCys: 0.423 ± 0.014
2.686AspAsp: 2.686 ± 0.049
3.654AspGlu: 3.654 ± 0.054
1.981AspPhe: 1.981 ± 0.036
4.411AspGly: 4.411 ± 0.065
1.196AspHis: 1.196 ± 0.029
2.913AspIle: 2.913 ± 0.041
1.733AspLys: 1.733 ± 0.037
6.112AspLeu: 6.112 ± 0.067
1.197AspMet: 1.197 ± 0.03
1.179AspAsn: 1.179 ± 0.026
3.514AspPro: 3.514 ± 0.053
1.817AspGln: 1.817 ± 0.033
4.016AspArg: 4.016 ± 0.058
1.971AspSer: 1.971 ± 0.031
2.53AspThr: 2.53 ± 0.047
4.165AspVal: 4.165 ± 0.055
0.861AspTrp: 0.861 ± 0.023
1.339AspTyr: 1.339 ± 0.03
0.0AspXaa: 0.0 ± 0.0
Glu
8.026GluAla: 8.026 ± 0.089
0.402GluCys: 0.402 ± 0.017
2.817GluAsp: 2.817 ± 0.047
3.652GluGlu: 3.652 ± 0.06
1.841GluPhe: 1.841 ± 0.035
4.428GluGly: 4.428 ± 0.053
1.354GluHis: 1.354 ± 0.032
3.688GluIle: 3.688 ± 0.048
2.382GluLys: 2.382 ± 0.041
5.506GluLeu: 5.506 ± 0.064
1.522GluMet: 1.522 ± 0.028
1.619GluAsn: 1.619 ± 0.035
2.939GluPro: 2.939 ± 0.055
2.223GluGln: 2.223 ± 0.042
5.623GluArg: 5.623 ± 0.066
2.522GluSer: 2.522 ± 0.041
3.573GluThr: 3.573 ± 0.051
4.107GluVal: 4.107 ± 0.051
0.736GluTrp: 0.736 ± 0.023
0.998GluTyr: 0.998 ± 0.024
0.0GluXaa: 0.0 ± 0.0
Phe
4.224PheAla: 4.224 ± 0.057
0.384PheCys: 0.384 ± 0.016
2.423PheAsp: 2.423 ± 0.039
2.23PheGlu: 2.23 ± 0.039
1.454PhePhe: 1.454 ± 0.035
3.576PheGly: 3.576 ± 0.056
0.752PheHis: 0.752 ± 0.021
1.872PheIle: 1.872 ± 0.036
1.174PheLys: 1.174 ± 0.027
3.482PheLeu: 3.482 ± 0.052
0.848PheMet: 0.848 ± 0.025
1.011PheAsn: 1.011 ± 0.026
1.61PhePro: 1.61 ± 0.033
1.101PheGln: 1.101 ± 0.029
2.36PheArg: 2.36 ± 0.043
2.272PheSer: 2.272 ± 0.044
2.063PheThr: 2.063 ± 0.041
2.909PheVal: 2.909 ± 0.042
0.571PheTrp: 0.571 ± 0.023
0.896PheTyr: 0.896 ± 0.024
0.0PheXaa: 0.0 ± 0.0
Gly
8.418GlyAla: 8.418 ± 0.082
0.847GlyCys: 0.847 ± 0.022
3.937GlyAsp: 3.937 ± 0.059
4.632GlyGlu: 4.632 ± 0.056
3.594GlyPhe: 3.594 ± 0.049
6.896GlyGly: 6.896 ± 0.081
1.914GlyHis: 1.914 ± 0.035
4.817GlyIle: 4.817 ± 0.065
3.042GlyLys: 3.042 ± 0.049
8.971GlyLeu: 8.971 ± 0.073
2.149GlyMet: 2.149 ± 0.04
2.104GlyAsn: 2.104 ± 0.048
3.533GlyPro: 3.533 ± 0.048
3.039GlyGln: 3.039 ± 0.047
6.104GlyArg: 6.104 ± 0.066
4.961GlySer: 4.961 ± 0.058
4.574GlyThr: 4.574 ± 0.063
5.917GlyVal: 5.917 ± 0.061
1.299GlyTrp: 1.299 ± 0.029
2.235GlyTyr: 2.235 ± 0.037
0.0GlyXaa: 0.0 ± 0.0
His
2.289HisAla: 2.289 ± 0.04
0.228HisCys: 0.228 ± 0.012
1.176HisAsp: 1.176 ± 0.031
1.217HisGlu: 1.217 ± 0.027
0.862HisPhe: 0.862 ± 0.026
1.922HisGly: 1.922 ± 0.035
0.626HisHis: 0.626 ± 0.025
0.995HisIle: 0.995 ± 0.024
0.531HisLys: 0.531 ± 0.018
2.2HisLeu: 2.2 ± 0.04
0.506HisMet: 0.506 ± 0.017
0.501HisAsn: 0.501 ± 0.019
1.427HisPro: 1.427 ± 0.032
0.635HisGln: 0.635 ± 0.02
1.582HisArg: 1.582 ± 0.026
0.951HisSer: 0.951 ± 0.023
0.869HisThr: 0.869 ± 0.021
1.574HisVal: 1.574 ± 0.036
0.327HisTrp: 0.327 ± 0.014
0.532HisTyr: 0.532 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
7.367IleAla: 7.367 ± 0.071
0.526IleCys: 0.526 ± 0.017
3.44IleAsp: 3.44 ± 0.045
3.817IleGlu: 3.817 ± 0.053
1.719IlePhe: 1.719 ± 0.034
5.182IleGly: 5.182 ± 0.057
1.025IleHis: 1.025 ± 0.025
2.599IleIle: 2.599 ± 0.048
1.644IleLys: 1.644 ± 0.033
5.119IleLeu: 5.119 ± 0.069
1.13IleMet: 1.13 ± 0.025
1.412IleAsn: 1.412 ± 0.032
2.631IlePro: 2.631 ± 0.036
1.458IleGln: 1.458 ± 0.036
3.595IleArg: 3.595 ± 0.048
2.935IleSer: 2.935 ± 0.044
2.89IleThr: 2.89 ± 0.047
4.625IleVal: 4.625 ± 0.055
0.632IleTrp: 0.632 ± 0.021
1.158IleTyr: 1.158 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
4.605LysAla: 4.605 ± 0.064
0.155LysCys: 0.155 ± 0.009
1.939LysAsp: 1.939 ± 0.038
1.953LysGlu: 1.953 ± 0.041
0.874LysPhe: 0.874 ± 0.024
2.949LysGly: 2.949 ± 0.049
0.644LysHis: 0.644 ± 0.021
1.839LysIle: 1.839 ± 0.035
1.381LysLys: 1.381 ± 0.037
3.29LysLeu: 3.29 ± 0.053
0.735LysMet: 0.735 ± 0.018
0.953LysAsn: 0.953 ± 0.026
2.322LysPro: 2.322 ± 0.039
1.085LysGln: 1.085 ± 0.028
2.587LysArg: 2.587 ± 0.048
1.828LysSer: 1.828 ± 0.04
2.024LysThr: 2.024 ± 0.038
2.606LysVal: 2.606 ± 0.045
0.346LysTrp: 0.346 ± 0.016
0.577LysTyr: 0.577 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
12.664LeuAla: 12.664 ± 0.116
0.862LeuCys: 0.862 ± 0.028
5.809LeuAsp: 5.809 ± 0.063
5.522LeuGlu: 5.522 ± 0.059
3.653LeuPhe: 3.653 ± 0.054
8.465LeuGly: 8.465 ± 0.075
1.937LeuHis: 1.937 ± 0.035
5.47LeuIle: 5.47 ± 0.065
4.052LeuLys: 4.052 ± 0.054
9.562LeuLeu: 9.562 ± 0.103
2.419LeuMet: 2.419 ± 0.038
2.731LeuAsn: 2.731 ± 0.039
5.547LeuPro: 5.547 ± 0.063
3.084LeuGln: 3.084 ± 0.044
6.923LeuArg: 6.923 ± 0.078
6.875LeuSer: 6.875 ± 0.067
5.69LeuThr: 5.69 ± 0.072
7.81LeuVal: 7.81 ± 0.065
1.187LeuTrp: 1.187 ± 0.031
2.08LeuTyr: 2.08 ± 0.04
0.0LeuXaa: 0.0 ± 0.0
Met
3.016MetAla: 3.016 ± 0.046
0.134MetCys: 0.134 ± 0.009
1.079MetAsp: 1.079 ± 0.028
1.165MetGlu: 1.165 ± 0.031
0.649MetPhe: 0.649 ± 0.023
1.796MetGly: 1.796 ± 0.035
0.453MetHis: 0.453 ± 0.015
1.416MetIle: 1.416 ± 0.034
1.045MetLys: 1.045 ± 0.023
2.408MetLeu: 2.408 ± 0.042
0.651MetMet: 0.651 ± 0.02
0.813MetAsn: 0.813 ± 0.021
1.48MetPro: 1.48 ± 0.032
0.822MetGln: 0.822 ± 0.022
1.89MetArg: 1.89 ± 0.03
1.656MetSer: 1.656 ± 0.028
1.784MetThr: 1.784 ± 0.029
1.623MetVal: 1.623 ± 0.031
0.179MetTrp: 0.179 ± 0.01
0.32MetTyr: 0.32 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.013AsnAla: 3.013 ± 0.044
0.231AsnCys: 0.231 ± 0.013
1.398AsnAsp: 1.398 ± 0.04
1.471AsnGlu: 1.471 ± 0.031
0.917AsnPhe: 0.917 ± 0.026
2.376AsnGly: 2.376 ± 0.046
0.519AsnHis: 0.519 ± 0.018
1.331AsnIle: 1.331 ± 0.029
0.762AsnLys: 0.762 ± 0.021
2.765AsnLeu: 2.765 ± 0.042
0.572AsnMet: 0.572 ± 0.018
0.715AsnAsn: 0.715 ± 0.023
1.922AsnPro: 1.922 ± 0.036
0.789AsnGln: 0.789 ± 0.022
1.85AsnArg: 1.85 ± 0.032
1.24AsnSer: 1.24 ± 0.033
1.263AsnThr: 1.263 ± 0.027
2.032AsnVal: 2.032 ± 0.038
0.389AsnTrp: 0.389 ± 0.018
0.625AsnTyr: 0.625 ± 0.021
0.0AsnXaa: 0.0 ± 0.0
Pro
5.576ProAla: 5.576 ± 0.068
0.333ProCys: 0.333 ± 0.015
3.445ProAsp: 3.445 ± 0.046
3.858ProGlu: 3.858 ± 0.053
2.072ProPhe: 2.072 ± 0.036
4.407ProGly: 4.407 ± 0.06
1.207ProHis: 1.207 ± 0.026
2.556ProIle: 2.556 ± 0.04
1.922ProLys: 1.922 ± 0.037
4.871ProLeu: 4.871 ± 0.052
1.178ProMet: 1.178 ± 0.029
1.472ProAsn: 1.472 ± 0.027
2.46ProPro: 2.46 ± 0.044
1.799ProGln: 1.799 ± 0.036
3.096ProArg: 3.096 ± 0.059
3.177ProSer: 3.177 ± 0.043
2.63ProThr: 2.63 ± 0.042
4.253ProVal: 4.253 ± 0.054
0.703ProTrp: 0.703 ± 0.02
1.229ProTyr: 1.229 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
4.327GlnAla: 4.327 ± 0.053
0.172GlnCys: 0.172 ± 0.01
1.678GlnAsp: 1.678 ± 0.03
1.901GlnGlu: 1.901 ± 0.041
1.008GlnPhe: 1.008 ± 0.025
2.574GlnGly: 2.574 ± 0.046
0.681GlnHis: 0.681 ± 0.019
1.898GlnIle: 1.898 ± 0.031
1.179GlnLys: 1.179 ± 0.027
2.825GlnLeu: 2.825 ± 0.049
0.861GlnMet: 0.861 ± 0.023
0.925GlnAsn: 0.925 ± 0.028
1.849GlnPro: 1.849 ± 0.037
1.31GlnGln: 1.31 ± 0.035
2.602GlnArg: 2.602 ± 0.044
1.827GlnSer: 1.827 ± 0.04
1.798GlnThr: 1.798 ± 0.035
2.524GlnVal: 2.524 ± 0.042
0.403GlnTrp: 0.403 ± 0.016
0.639GlnTyr: 0.639 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
7.414ArgAla: 7.414 ± 0.082
0.548ArgCys: 0.548 ± 0.021
3.743ArgAsp: 3.743 ± 0.052
4.634ArgGlu: 4.634 ± 0.063
3.052ArgPhe: 3.052 ± 0.048
4.703ArgGly: 4.703 ± 0.052
1.757ArgHis: 1.757 ± 0.036
4.411ArgIle: 4.411 ± 0.055
2.403ArgLys: 2.403 ± 0.041
7.947ArgLeu: 7.947 ± 0.093
1.94ArgMet: 1.94 ± 0.034
1.905ArgAsn: 1.905 ± 0.031
3.616ArgPro: 3.616 ± 0.062
2.737ArgGln: 2.737 ± 0.041
5.829ArgArg: 5.829 ± 0.075
4.087ArgSer: 4.087 ± 0.055
3.581ArgThr: 3.581 ± 0.046
4.873ArgVal: 4.873 ± 0.055
1.037ArgTrp: 1.037 ± 0.026
1.786ArgTyr: 1.786 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
6.015SerAla: 6.015 ± 0.066
0.452SerCys: 0.452 ± 0.016
2.934SerAsp: 2.934 ± 0.047
3.06SerGlu: 3.06 ± 0.046
2.425SerPhe: 2.425 ± 0.047
5.441SerGly: 5.441 ± 0.066
1.138SerHis: 1.138 ± 0.027
3.009SerIle: 3.009 ± 0.044
1.75SerLys: 1.75 ± 0.037
5.98SerLeu: 5.98 ± 0.067
1.387SerMet: 1.387 ± 0.029
1.438SerAsn: 1.438 ± 0.027
2.981SerPro: 2.981 ± 0.044
1.696SerGln: 1.696 ± 0.035
3.95SerArg: 3.95 ± 0.053
3.389SerSer: 3.389 ± 0.057
2.849SerThr: 2.849 ± 0.043
4.059SerVal: 4.059 ± 0.051
0.781SerTrp: 0.781 ± 0.023
1.383SerTyr: 1.383 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
5.755ThrAla: 5.755 ± 0.064
0.462ThrCys: 0.462 ± 0.019
2.701ThrAsp: 2.701 ± 0.045
2.779ThrGlu: 2.779 ± 0.043
2.131ThrPhe: 2.131 ± 0.034
4.95ThrGly: 4.95 ± 0.06
1.066ThrHis: 1.066 ± 0.027
3.215ThrIle: 3.215 ± 0.047
1.652ThrLys: 1.652 ± 0.033
5.908ThrLeu: 5.908 ± 0.067
1.227ThrMet: 1.227 ± 0.027
1.349ThrAsn: 1.349 ± 0.032
3.157ThrPro: 3.157 ± 0.046
1.499ThrGln: 1.499 ± 0.03
3.384ThrArg: 3.384 ± 0.052
3.005ThrSer: 3.005 ± 0.048
2.935ThrThr: 2.935 ± 0.054
4.292ThrVal: 4.292 ± 0.066
0.713ThrTrp: 0.713 ± 0.025
1.323ThrTyr: 1.323 ± 0.031
0.0ThrXaa: 0.0 ± 0.0
Val
8.673ValAla: 8.673 ± 0.089
0.645ValCys: 0.645 ± 0.021
3.96ValAsp: 3.96 ± 0.054
4.737ValGlu: 4.737 ± 0.052
2.838ValPhe: 2.838 ± 0.04
5.581ValGly: 5.581 ± 0.061
1.462ValHis: 1.462 ± 0.033
4.251ValIle: 4.251 ± 0.054
2.508ValLys: 2.508 ± 0.044
7.751ValLeu: 7.751 ± 0.073
1.835ValMet: 1.835 ± 0.034
2.015ValAsn: 2.015 ± 0.04
3.913ValPro: 3.913 ± 0.049
2.303ValGln: 2.303 ± 0.04
5.099ValArg: 5.099 ± 0.06
4.486ValSer: 4.486 ± 0.059
4.503ValThr: 4.503 ± 0.065
5.998ValVal: 5.998 ± 0.065
0.935ValTrp: 0.935 ± 0.023
1.525ValTyr: 1.525 ± 0.032
0.0ValXaa: 0.0 ± 0.0
Trp
1.239TrpAla: 1.239 ± 0.029
0.13TrpCys: 0.13 ± 0.009
0.647TrpAsp: 0.647 ± 0.022
0.585TrpGlu: 0.585 ± 0.018
0.516TrpPhe: 0.516 ± 0.017
0.907TrpGly: 0.907 ± 0.024
0.334TrpHis: 0.334 ± 0.016
0.755TrpIle: 0.755 ± 0.022
0.454TrpLys: 0.454 ± 0.016
1.615TrpLeu: 1.615 ± 0.036
0.349TrpMet: 0.349 ± 0.014
0.45TrpAsn: 0.45 ± 0.017
0.68TrpPro: 0.68 ± 0.021
0.522TrpGln: 0.522 ± 0.017
1.144TrpArg: 1.144 ± 0.026
0.818TrpSer: 0.818 ± 0.023
0.753TrpThr: 0.753 ± 0.023
0.842TrpVal: 0.842 ± 0.022
0.231TrpTrp: 0.231 ± 0.012
0.284TrpTyr: 0.284 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.502TyrAla: 2.502 ± 0.04
0.242TyrCys: 0.242 ± 0.012
1.375TyrAsp: 1.375 ± 0.031
1.305TyrGlu: 1.305 ± 0.029
0.883TyrPhe: 0.883 ± 0.025
2.141TyrGly: 2.141 ± 0.04
0.472TyrHis: 0.472 ± 0.016
0.912TyrIle: 0.912 ± 0.025
0.656TyrLys: 0.656 ± 0.022
2.269TyrLeu: 2.269 ± 0.041
0.424TyrMet: 0.424 ± 0.015
0.596TyrAsn: 0.596 ± 0.017
1.158TyrPro: 1.158 ± 0.03
0.713TyrGln: 0.713 ± 0.021
1.826TyrArg: 1.826 ± 0.033
1.124TyrSer: 1.124 ± 0.028
1.153TyrThr: 1.153 ± 0.031
1.652TyrVal: 1.652 ± 0.031
0.379TyrTrp: 0.379 ± 0.015
0.639TyrTyr: 0.639 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5672 proteins (1617832 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski