Amino acid dipepetide frequency for Schistosoma margrebowiei

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.834AlaAla: 3.834 ± 0.041
0.987AlaCys: 0.987 ± 0.013
2.578AlaAsp: 2.578 ± 0.023
2.869AlaGlu: 2.869 ± 0.027
2.099AlaPhe: 2.099 ± 0.021
2.013AlaGly: 2.013 ± 0.024
0.971AlaHis: 0.971 ± 0.015
2.731AlaIle: 2.731 ± 0.022
2.349AlaLys: 2.349 ± 0.023
4.683AlaLeu: 4.683 ± 0.035
0.868AlaMet: 0.868 ± 0.012
2.341AlaAsn: 2.341 ± 0.019
1.852AlaPro: 1.852 ± 0.021
1.708AlaGln: 1.708 ± 0.019
2.612AlaArg: 2.612 ± 0.025
4.149AlaSer: 4.149 ± 0.032
2.694AlaThr: 2.694 ± 0.026
2.971AlaVal: 2.971 ± 0.023
0.424AlaTrp: 0.424 ± 0.009
1.445AlaTyr: 1.445 ± 0.016
0.002AlaXaa: 0.002 ± 0.001
Cys
0.864CysAla: 0.864 ± 0.014
0.489CysCys: 0.489 ± 0.012
0.997CysAsp: 0.997 ± 0.017
1.022CysGlu: 1.022 ± 0.016
0.841CysPhe: 0.841 ± 0.013
0.946CysGly: 0.946 ± 0.014
0.598CysHis: 0.598 ± 0.013
1.536CysIle: 1.536 ± 0.016
1.076CysLys: 1.076 ± 0.015
2.33CysLeu: 2.33 ± 0.022
0.402CysMet: 0.402 ± 0.008
1.153CysAsn: 1.153 ± 0.014
0.946CysPro: 0.946 ± 0.016
0.908CysGln: 0.908 ± 0.013
1.006CysArg: 1.006 ± 0.014
2.005CysSer: 2.005 ± 0.021
1.293CysThr: 1.293 ± 0.014
0.99CysVal: 0.99 ± 0.014
0.224CysTrp: 0.224 ± 0.006
0.698CysTyr: 0.698 ± 0.01
0.001CysXaa: 0.001 ± 0.0
Asp
2.234AspAla: 2.234 ± 0.022
0.92AspCys: 0.92 ± 0.012
4.1AspAsp: 4.1 ± 0.048
3.447AspGlu: 3.447 ± 0.028
2.054AspPhe: 2.054 ± 0.016
2.509AspGly: 2.509 ± 0.026
1.56AspHis: 1.56 ± 0.016
3.432AspIle: 3.432 ± 0.025
2.723AspLys: 2.723 ± 0.024
5.356AspLeu: 5.356 ± 0.033
1.051AspMet: 1.051 ± 0.014
3.567AspAsn: 3.567 ± 0.031
2.231AspPro: 2.231 ± 0.021
1.983AspGln: 1.983 ± 0.017
2.378AspArg: 2.378 ± 0.022
4.249AspSer: 4.249 ± 0.031
2.734AspThr: 2.734 ± 0.018
3.345AspVal: 3.345 ± 0.025
0.587AspTrp: 0.587 ± 0.01
1.724AspTyr: 1.724 ± 0.017
0.002AspXaa: 0.002 ± 0.0
Glu
2.926GluAla: 2.926 ± 0.026
0.941GluCys: 0.941 ± 0.015
3.138GluAsp: 3.138 ± 0.026
4.415GluGlu: 4.415 ± 0.047
1.943GluPhe: 1.943 ± 0.019
2.211GluGly: 2.211 ± 0.02
1.417GluHis: 1.417 ± 0.016
3.602GluIle: 3.602 ± 0.026
3.907GluLys: 3.907 ± 0.031
5.544GluLeu: 5.544 ± 0.038
1.356GluMet: 1.356 ± 0.016
3.839GluAsn: 3.839 ± 0.028
1.626GluPro: 1.626 ± 0.016
2.568GluGln: 2.568 ± 0.024
3.052GluArg: 3.052 ± 0.028
4.352GluSer: 4.352 ± 0.027
3.567GluThr: 3.567 ± 0.025
2.783GluVal: 2.783 ± 0.021
0.622GluTrp: 0.622 ± 0.009
1.703GluTyr: 1.703 ± 0.015
0.003GluXaa: 0.003 ± 0.001
Phe
1.709PheAla: 1.709 ± 0.017
0.77PheCys: 0.77 ± 0.012
2.109PheAsp: 2.109 ± 0.019
1.71PheGlu: 1.71 ± 0.019
1.306PhePhe: 1.306 ± 0.016
1.72PheGly: 1.72 ± 0.019
1.168PheHis: 1.168 ± 0.015
2.849PheIle: 2.849 ± 0.03
2.079PheLys: 2.079 ± 0.019
3.466PheLeu: 3.466 ± 0.028
0.798PheMet: 0.798 ± 0.011
2.758PheAsn: 2.758 ± 0.024
1.624PhePro: 1.624 ± 0.019
1.503PheGln: 1.503 ± 0.016
1.768PheArg: 1.768 ± 0.018
3.239PheSer: 3.239 ± 0.027
2.724PheThr: 2.724 ± 0.019
2.144PheVal: 2.144 ± 0.02
0.351PheTrp: 0.351 ± 0.007
1.341PheTyr: 1.341 ± 0.015
0.001PheXaa: 0.001 ± 0.0
Gly
1.882GlyAla: 1.882 ± 0.023
0.886GlyCys: 0.886 ± 0.013
2.086GlyAsp: 2.086 ± 0.021
2.489GlyGlu: 2.489 ± 0.028
2.063GlyPhe: 2.063 ± 0.02
3.042GlyGly: 3.042 ± 0.032
1.339GlyHis: 1.339 ± 0.014
3.073GlyIle: 3.073 ± 0.022
3.119GlyLys: 3.119 ± 0.026
4.298GlyLeu: 4.298 ± 0.034
0.869GlyMet: 0.869 ± 0.013
2.37GlyAsn: 2.37 ± 0.018
1.728GlyPro: 1.728 ± 0.044
1.783GlyGln: 1.783 ± 0.018
2.6GlyArg: 2.6 ± 0.021
4.402GlySer: 4.402 ± 0.031
2.511GlyThr: 2.511 ± 0.023
2.556GlyVal: 2.556 ± 0.022
0.66GlyTrp: 0.66 ± 0.01
1.464GlyTyr: 1.464 ± 0.019
0.001GlyXaa: 0.001 ± 0.0
His
0.901HisAla: 0.901 ± 0.011
0.631HisCys: 0.631 ± 0.011
1.261HisAsp: 1.261 ± 0.012
1.608HisGlu: 1.608 ± 0.017
1.056HisPhe: 1.056 ± 0.014
1.539HisGly: 1.539 ± 0.015
1.741HisHis: 1.741 ± 0.029
1.792HisIle: 1.792 ± 0.018
1.928HisLys: 1.928 ± 0.021
2.927HisLeu: 2.927 ± 0.025
0.578HisMet: 0.578 ± 0.009
1.751HisAsn: 1.751 ± 0.021
1.215HisPro: 1.215 ± 0.013
1.343HisGln: 1.343 ± 0.017
1.404HisArg: 1.404 ± 0.015
2.68HisSer: 2.68 ± 0.025
1.845HisThr: 1.845 ± 0.022
1.26HisVal: 1.26 ± 0.015
0.536HisTrp: 0.536 ± 0.01
1.013HisTyr: 1.013 ± 0.013
0.001HisXaa: 0.001 ± 0.0
Ile
2.704IleAla: 2.704 ± 0.022
1.403IleCys: 1.403 ± 0.017
3.979IleAsp: 3.979 ± 0.03
3.487IleGlu: 3.487 ± 0.029
2.414IlePhe: 2.414 ± 0.024
3.099IleGly: 3.099 ± 0.025
2.39IleHis: 2.39 ± 0.021
5.051IleIle: 5.051 ± 0.044
4.102IleLys: 4.102 ± 0.027
5.977IleLeu: 5.977 ± 0.039
1.427IleMet: 1.427 ± 0.015
4.987IleAsn: 4.987 ± 0.041
3.087IlePro: 3.087 ± 0.026
3.384IleGln: 3.384 ± 0.028
3.268IleArg: 3.268 ± 0.023
6.214IleSer: 6.214 ± 0.036
4.549IleThr: 4.549 ± 0.028
3.043IleVal: 3.043 ± 0.026
0.822IleTrp: 0.822 ± 0.012
2.004IleTyr: 2.004 ± 0.021
0.003IleXaa: 0.003 ± 0.001
Lys
3.534LysAla: 3.534 ± 0.03
1.306LysCys: 1.306 ± 0.016
2.542LysAsp: 2.542 ± 0.022
3.714LysGlu: 3.714 ± 0.034
2.122LysPhe: 2.122 ± 0.021
1.97LysGly: 1.97 ± 0.027
1.854LysHis: 1.854 ± 0.019
3.739LysIle: 3.739 ± 0.026
4.02LysLys: 4.02 ± 0.04
5.722LysLeu: 5.722 ± 0.036
1.367LysMet: 1.367 ± 0.014
3.497LysAsn: 3.497 ± 0.028
2.4LysPro: 2.4 ± 0.024
3.081LysGln: 3.081 ± 0.025
3.647LysArg: 3.647 ± 0.032
5.566LysSer: 5.566 ± 0.037
4.217LysThr: 4.217 ± 0.03
3.021LysVal: 3.021 ± 0.024
0.674LysTrp: 0.674 ± 0.011
2.019LysTyr: 2.019 ± 0.021
0.003LysXaa: 0.003 ± 0.001
Leu
4.488LeuAla: 4.488 ± 0.033
2.07LeuCys: 2.07 ± 0.022
5.131LeuAsp: 5.131 ± 0.032
4.677LeuGlu: 4.677 ± 0.04
3.678LeuPhe: 3.678 ± 0.03
4.005LeuGly: 4.005 ± 0.025
2.515LeuHis: 2.515 ± 0.021
5.822LeuIle: 5.822 ± 0.043
5.771LeuLys: 5.771 ± 0.035
10.059LeuLeu: 10.059 ± 0.102
2.12LeuMet: 2.12 ± 0.019
6.632LeuAsn: 6.632 ± 0.04
4.701LeuPro: 4.701 ± 0.033
4.102LeuGln: 4.102 ± 0.029
5.387LeuArg: 5.387 ± 0.037
9.451LeuSer: 9.451 ± 0.04
5.614LeuThr: 5.614 ± 0.031
4.906LeuVal: 4.906 ± 0.034
1.08LeuTrp: 1.08 ± 0.014
2.936LeuTyr: 2.936 ± 0.024
0.005LeuXaa: 0.005 ± 0.001
Met
0.98MetAla: 0.98 ± 0.014
0.347MetCys: 0.347 ± 0.007
1.055MetAsp: 1.055 ± 0.014
1.475MetGlu: 1.475 ± 0.018
0.739MetPhe: 0.739 ± 0.01
0.988MetGly: 0.988 ± 0.014
0.479MetHis: 0.479 ± 0.009
1.238MetIle: 1.238 ± 0.015
1.97MetLys: 1.97 ± 0.018
2.107MetLeu: 2.107 ± 0.019
0.658MetMet: 0.658 ± 0.022
1.964MetAsn: 1.964 ± 0.019
0.751MetPro: 0.751 ± 0.012
1.094MetGln: 1.094 ± 0.015
1.058MetArg: 1.058 ± 0.012
1.586MetSer: 1.586 ± 0.015
1.106MetThr: 1.106 ± 0.014
1.195MetVal: 1.195 ± 0.015
0.251MetTrp: 0.251 ± 0.006
0.604MetTyr: 0.604 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
2.574AsnAla: 2.574 ± 0.021
1.357AsnCys: 1.357 ± 0.016
3.991AsnAsp: 3.991 ± 0.035
3.935AsnGlu: 3.935 ± 0.035
2.069AsnPhe: 2.069 ± 0.019
2.702AsnGly: 2.702 ± 0.028
2.075AsnHis: 2.075 ± 0.023
5.179AsnIle: 5.179 ± 0.032
4.117AsnLys: 4.117 ± 0.035
5.472AsnLeu: 5.472 ± 0.036
1.327AsnMet: 1.327 ± 0.015
9.637AsnAsn: 9.637 ± 0.119
2.936AsnPro: 2.936 ± 0.02
3.302AsnGln: 3.302 ± 0.028
2.969AsnArg: 2.969 ± 0.022
7.288AsnSer: 7.288 ± 0.055
4.882AsnThr: 4.882 ± 0.045
3.354AsnVal: 3.354 ± 0.022
0.841AsnTrp: 0.841 ± 0.011
2.177AsnTyr: 2.177 ± 0.023
0.002AsnXaa: 0.002 ± 0.001
Pro
1.802ProAla: 1.802 ± 0.018
0.84ProCys: 0.84 ± 0.014
2.388ProAsp: 2.388 ± 0.02
2.236ProGlu: 2.236 ± 0.02
1.529ProPhe: 1.529 ± 0.017
2.268ProGly: 2.268 ± 0.046
1.181ProHis: 1.181 ± 0.015
3.151ProIle: 3.151 ± 0.024
2.162ProLys: 2.162 ± 0.022
3.551ProLeu: 3.551 ± 0.026
0.861ProMet: 0.861 ± 0.011
2.676ProAsn: 2.676 ± 0.022
2.657ProPro: 2.657 ± 0.038
1.457ProGln: 1.457 ± 0.02
1.863ProArg: 1.863 ± 0.019
4.455ProSer: 4.455 ± 0.038
3.042ProThr: 3.042 ± 0.021
2.906ProVal: 2.906 ± 0.024
0.354ProTrp: 0.354 ± 0.007
1.313ProTyr: 1.313 ± 0.016
0.001ProXaa: 0.001 ± 0.0
Gln
2.006GlnAla: 2.006 ± 0.022
0.986GlnCys: 0.986 ± 0.013
1.541GlnAsp: 1.541 ± 0.014
2.274GlnGlu: 2.274 ± 0.021
1.624GlnPhe: 1.624 ± 0.017
1.652GlnGly: 1.652 ± 0.018
1.289GlnHis: 1.289 ± 0.019
2.911GlnIle: 2.911 ± 0.024
2.193GlnLys: 2.193 ± 0.019
5.454GlnLeu: 5.454 ± 0.038
1.252GlnMet: 1.252 ± 0.015
2.602GlnAsn: 2.602 ± 0.026
1.744GlnPro: 1.744 ± 0.02
3.533GlnGln: 3.533 ± 0.051
2.14GlnArg: 2.14 ± 0.021
4.103GlnSer: 4.103 ± 0.03
2.508GlnThr: 2.508 ± 0.024
2.062GlnVal: 2.062 ± 0.02
0.629GlnTrp: 0.629 ± 0.011
1.358GlnTyr: 1.358 ± 0.014
0.001GlnXaa: 0.001 ± 0.0
Arg
2.171ArgAla: 2.171 ± 0.018
0.955ArgCys: 0.955 ± 0.014
2.03ArgAsp: 2.03 ± 0.019
2.578ArgGlu: 2.578 ± 0.024
2.037ArgPhe: 2.037 ± 0.021
2.012ArgGly: 2.012 ± 0.025
1.576ArgHis: 1.576 ± 0.017
3.805ArgIle: 3.805 ± 0.027
3.848ArgLys: 3.848 ± 0.037
5.056ArgLeu: 5.056 ± 0.04
1.326ArgMet: 1.326 ± 0.015
3.056ArgAsn: 3.056 ± 0.026
2.231ArgPro: 2.231 ± 0.018
2.581ArgGln: 2.581 ± 0.023
3.909ArgArg: 3.909 ± 0.04
4.122ArgSer: 4.122 ± 0.033
3.406ArgThr: 3.406 ± 0.025
2.367ArgVal: 2.367 ± 0.023
0.757ArgTrp: 0.757 ± 0.013
1.676ArgTyr: 1.676 ± 0.016
0.001ArgXaa: 0.001 ± 0.001
Ser
3.846SerAla: 3.846 ± 0.029
2.062SerCys: 2.062 ± 0.021
4.927SerAsp: 4.927 ± 0.029
4.681SerGlu: 4.681 ± 0.031
3.622SerPhe: 3.622 ± 0.026
4.523SerGly: 4.523 ± 0.03
2.555SerHis: 2.555 ± 0.021
6.466SerIle: 6.466 ± 0.04
5.22SerLys: 5.22 ± 0.031
8.01SerLeu: 8.01 ± 0.043
1.983SerMet: 1.983 ± 0.015
7.453SerAsn: 7.453 ± 0.048
4.145SerPro: 4.145 ± 0.037
3.671SerGln: 3.671 ± 0.026
4.403SerArg: 4.403 ± 0.033
13.0SerSer: 13.0 ± 0.124
7.08SerThr: 7.08 ± 0.046
5.492SerVal: 5.492 ± 0.036
0.798SerTrp: 0.798 ± 0.013
2.568SerTyr: 2.568 ± 0.021
0.002SerXaa: 0.002 ± 0.001
Thr
3.199ThrAla: 3.199 ± 0.027
1.278ThrCys: 1.278 ± 0.015
3.262ThrAsp: 3.262 ± 0.023
3.379ThrGlu: 3.379 ± 0.026
2.103ThrPhe: 2.103 ± 0.016
2.929ThrGly: 2.929 ± 0.027
1.6ThrHis: 1.6 ± 0.02
4.564ThrIle: 4.564 ± 0.032
3.536ThrLys: 3.536 ± 0.025
5.583ThrLeu: 5.583 ± 0.029
1.43ThrMet: 1.43 ± 0.015
5.383ThrAsn: 5.383 ± 0.049
2.587ThrPro: 2.587 ± 0.028
2.311ThrGln: 2.311 ± 0.022
2.944ThrArg: 2.944 ± 0.024
7.087ThrSer: 7.087 ± 0.045
8.049ThrThr: 8.049 ± 0.178
3.645ThrVal: 3.645 ± 0.025
0.905ThrTrp: 0.905 ± 0.013
1.908ThrTyr: 1.908 ± 0.02
0.003ThrXaa: 0.003 ± 0.001
Val
2.819ValAla: 2.819 ± 0.025
1.182ValCys: 1.182 ± 0.014
2.92ValAsp: 2.92 ± 0.024
2.969ValGlu: 2.969 ± 0.026
2.033ValPhe: 2.033 ± 0.019
3.069ValGly: 3.069 ± 0.024
1.449ValHis: 1.449 ± 0.015
3.425ValIle: 3.425 ± 0.024
3.425ValLys: 3.425 ± 0.028
5.149ValLeu: 5.149 ± 0.03
1.007ValMet: 1.007 ± 0.013
3.292ValAsn: 3.292 ± 0.021
2.298ValPro: 2.298 ± 0.022
1.984ValGln: 1.984 ± 0.017
2.797ValArg: 2.797 ± 0.024
4.897ValSer: 4.897 ± 0.034
3.137ValThr: 3.137 ± 0.024
3.173ValVal: 3.173 ± 0.026
0.525ValTrp: 0.525 ± 0.009
1.628ValTyr: 1.628 ± 0.017
0.004ValXaa: 0.004 ± 0.001
Trp
0.35TrpAla: 0.35 ± 0.008
0.204TrpCys: 0.204 ± 0.006
0.501TrpAsp: 0.501 ± 0.009
0.735TrpGlu: 0.735 ± 0.012
0.426TrpPhe: 0.426 ± 0.009
0.324TrpGly: 0.324 ± 0.008
0.22TrpHis: 0.22 ± 0.006
1.116TrpIle: 1.116 ± 0.014
1.01TrpLys: 1.01 ± 0.014
0.956TrpLeu: 0.956 ± 0.015
0.314TrpMet: 0.314 ± 0.007
1.047TrpAsn: 1.047 ± 0.012
0.57TrpPro: 0.57 ± 0.009
0.285TrpGln: 0.285 ± 0.006
0.792TrpArg: 0.792 ± 0.013
0.849TrpSer: 0.849 ± 0.013
0.978TrpThr: 0.978 ± 0.013
0.44TrpVal: 0.44 ± 0.009
0.132TrpTrp: 0.132 ± 0.005
0.28TrpTyr: 0.28 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.355TyrAla: 1.355 ± 0.016
0.71TyrCys: 0.71 ± 0.013
1.652TyrAsp: 1.652 ± 0.017
1.813TyrGlu: 1.813 ± 0.016
1.376TyrPhe: 1.376 ± 0.017
1.697TyrGly: 1.697 ± 0.021
1.031TyrHis: 1.031 ± 0.013
1.905TyrIle: 1.905 ± 0.021
1.458TyrLys: 1.458 ± 0.016
3.448TyrLeu: 3.448 ± 0.025
0.648TyrMet: 0.648 ± 0.011
2.183TyrAsn: 2.183 ± 0.022
1.364TyrPro: 1.364 ± 0.017
1.277TyrGln: 1.277 ± 0.015
1.503TyrArg: 1.503 ± 0.017
2.813TyrSer: 2.813 ± 0.023
1.743TyrThr: 1.743 ± 0.017
1.524TyrVal: 1.524 ± 0.016
0.357TyrTrp: 0.357 ± 0.008
1.175TyrTyr: 1.175 ± 0.02
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.003XaaAla: 0.003 ± 0.001
0.001XaaCys: 0.001 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.002XaaGlu: 0.002 ± 0.001
0.001XaaPhe: 0.001 ± 0.0
0.002XaaGly: 0.002 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.003XaaIle: 0.003 ± 0.001
0.003XaaLys: 0.003 ± 0.001
0.003XaaLeu: 0.003 ± 0.001
0.002XaaMet: 0.002 ± 0.001
0.003XaaAsn: 0.003 ± 0.001
0.002XaaPro: 0.002 ± 0.001
0.002XaaGln: 0.002 ± 0.001
0.002XaaArg: 0.002 ± 0.001
0.003XaaSer: 0.003 ± 0.001
0.003XaaThr: 0.003 ± 0.001
0.002XaaVal: 0.002 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.002XaaXaa: 0.002 ± 0.001
Statistics based on 25527 proteins (6596622 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski