Amino acid dipepetide frequency for Armigeres subalbatus virus SaX06-AK20

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.589AlaAla: 7.589 ± 3.172
0.446AlaCys: 0.446 ± 0.226
2.232AlaAsp: 2.232 ± 1.128
4.464AlaGlu: 4.464 ± 0.402
2.679AlaPhe: 2.679 ± 0.689
6.696AlaGly: 6.696 ± 2.056
1.339AlaHis: 1.339 ± 0.677
5.357AlaIle: 5.357 ± 0.049
4.018AlaLys: 4.018 ± 0.628
6.696AlaLeu: 6.696 ± 1.391
0.893AlaMet: 0.893 ± 0.451
4.911AlaAsn: 4.911 ± 0.488
4.018AlaPro: 4.018 ± 2.031
3.571AlaGln: 3.571 ± 0.476
2.679AlaArg: 2.679 ± 0.64
6.696AlaSer: 6.696 ± 1.391
9.821AlaThr: 9.821 ± 2.306
4.018AlaVal: 4.018 ± 0.628
1.786AlaTrp: 1.786 ± 0.238
2.679AlaTyr: 2.679 ± 0.64
0.0AlaXaa: 0.0 ± 0.0
Cys
0.446CysAla: 0.446 ± 0.226
0.0CysCys: 0.0 ± 0.0
0.446CysAsp: 0.446 ± 0.226
0.893CysGlu: 0.893 ± 0.878
0.0CysPhe: 0.0 ± 0.0
0.446CysGly: 0.446 ± 0.226
0.0CysHis: 0.0 ± 0.0
0.446CysIle: 0.446 ± 0.226
0.446CysLys: 0.446 ± 0.439
1.786CysLeu: 1.786 ± 0.427
0.446CysMet: 0.446 ± 0.226
0.893CysAsn: 0.893 ± 0.451
0.446CysPro: 0.446 ± 0.226
0.893CysGln: 0.893 ± 0.213
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.446CysThr: 0.446 ± 0.226
0.893CysVal: 0.893 ± 0.213
0.0CysTrp: 0.0 ± 0.0
0.446CysTyr: 0.446 ± 0.226
0.0CysXaa: 0.0 ± 0.0
Asp
1.786AspAla: 1.786 ± 0.238
0.446AspCys: 0.446 ± 0.226
3.571AspAsp: 3.571 ± 0.189
4.911AspGlu: 4.911 ± 0.841
4.018AspPhe: 4.018 ± 2.622
1.786AspGly: 1.786 ± 0.427
0.893AspHis: 0.893 ± 0.878
4.018AspIle: 4.018 ± 0.702
2.232AspLys: 2.232 ± 0.866
3.571AspLeu: 3.571 ± 1.141
1.339AspMet: 1.339 ± 0.012
2.232AspAsn: 2.232 ± 0.464
1.786AspPro: 1.786 ± 0.427
2.232AspGln: 2.232 ± 1.128
2.679AspArg: 2.679 ± 0.689
1.786AspSer: 1.786 ± 0.238
2.679AspThr: 2.679 ± 0.689
4.911AspVal: 4.911 ± 2.482
0.446AspTrp: 0.446 ± 0.439
2.232AspTyr: 2.232 ± 1.128
0.0AspXaa: 0.0 ± 0.0
Glu
3.571GluAla: 3.571 ± 0.189
0.0GluCys: 0.0 ± 0.0
3.125GluAsp: 3.125 ± 1.079
3.125GluGlu: 3.125 ± 0.25
2.232GluPhe: 2.232 ± 0.866
0.893GluGly: 0.893 ± 0.878
1.339GluHis: 1.339 ± 0.652
3.571GluIle: 3.571 ± 2.847
3.125GluLys: 3.125 ± 0.915
6.25GluLeu: 6.25 ± 1.493
1.339GluMet: 1.339 ± 0.012
1.339GluAsn: 1.339 ± 0.677
3.125GluPro: 3.125 ± 1.079
1.786GluGln: 1.786 ± 0.238
3.125GluArg: 3.125 ± 0.25
2.679GluSer: 2.679 ± 0.64
4.018GluThr: 4.018 ± 0.702
2.232GluVal: 2.232 ± 0.866
0.893GluTrp: 0.893 ± 0.213
3.571GluTyr: 3.571 ± 0.189
0.0GluXaa: 0.0 ± 0.0
Phe
4.464PheAla: 4.464 ± 1.592
1.339PheCys: 1.339 ± 0.012
4.464PheAsp: 4.464 ± 0.927
3.125PheGlu: 3.125 ± 1.744
0.446PhePhe: 0.446 ± 0.226
2.679PheGly: 2.679 ± 0.64
0.893PheHis: 0.893 ± 0.451
2.232PheIle: 2.232 ± 1.53
3.125PheLys: 3.125 ± 1.744
1.786PheLeu: 1.786 ± 0.238
0.0PheMet: 0.0 ± 0.0
3.125PheAsn: 3.125 ± 0.414
1.786PhePro: 1.786 ± 0.238
0.0PheGln: 0.0 ± 0.0
0.893PheArg: 0.893 ± 0.878
2.232PheSer: 2.232 ± 0.464
2.679PheThr: 2.679 ± 0.025
4.464PheVal: 4.464 ± 1.067
0.446PheTrp: 0.446 ± 0.226
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.357GlyAla: 5.357 ± 2.708
0.446GlyCys: 0.446 ± 0.439
2.679GlyAsp: 2.679 ± 0.025
2.232GlyGlu: 2.232 ± 0.201
2.232GlyPhe: 2.232 ± 0.866
5.357GlyGly: 5.357 ± 0.049
1.339GlyHis: 1.339 ± 1.317
3.571GlyIle: 3.571 ± 0.853
2.232GlyLys: 2.232 ± 2.195
5.804GlyLeu: 5.804 ± 0.275
0.893GlyMet: 0.893 ± 0.213
5.804GlyAsn: 5.804 ± 1.604
2.679GlyPro: 2.679 ± 0.025
1.339GlyGln: 1.339 ± 0.677
3.571GlyArg: 3.571 ± 0.476
1.786GlySer: 1.786 ± 0.427
1.339GlyThr: 1.339 ± 0.677
3.571GlyVal: 3.571 ± 1.141
1.786GlyTrp: 1.786 ± 1.756
1.786GlyTyr: 1.786 ± 0.427
0.0GlyXaa: 0.0 ± 0.0
His
0.893HisAla: 0.893 ± 0.451
0.0HisCys: 0.0 ± 0.0
2.232HisAsp: 2.232 ± 1.128
1.339HisGlu: 1.339 ± 0.652
0.893HisPhe: 0.893 ± 0.213
1.786HisGly: 1.786 ± 1.091
0.893HisHis: 0.893 ± 0.451
0.893HisIle: 0.893 ± 0.213
0.446HisLys: 0.446 ± 0.226
3.571HisLeu: 3.571 ± 1.518
0.0HisMet: 0.0 ± 0.0
1.339HisAsn: 1.339 ± 0.677
0.893HisPro: 0.893 ± 0.213
1.339HisGln: 1.339 ± 1.317
1.339HisArg: 1.339 ± 0.012
1.786HisSer: 1.786 ± 0.903
1.786HisThr: 1.786 ± 0.903
1.786HisVal: 1.786 ± 0.427
0.446HisTrp: 0.446 ± 0.439
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.696IleAla: 6.696 ± 0.726
1.339IleCys: 1.339 ± 0.652
1.786IleAsp: 1.786 ± 0.903
2.679IleGlu: 2.679 ± 0.64
2.679IlePhe: 2.679 ± 0.64
4.464IleGly: 4.464 ± 1.731
1.339IleHis: 1.339 ± 0.012
3.125IleIle: 3.125 ± 0.414
2.232IleLys: 2.232 ± 1.53
7.143IleLeu: 7.143 ± 1.707
2.232IleMet: 2.232 ± 0.385
4.018IleAsn: 4.018 ± 0.628
5.357IlePro: 5.357 ± 0.049
2.232IleGln: 2.232 ± 0.866
2.679IleArg: 2.679 ± 1.305
4.018IleSer: 4.018 ± 1.292
4.911IleThr: 4.911 ± 0.176
3.125IleVal: 3.125 ± 0.25
0.446IleTrp: 0.446 ± 0.226
1.339IleTyr: 1.339 ± 0.652
0.0IleXaa: 0.0 ± 0.0
Lys
2.679LysAla: 2.679 ± 0.025
0.446LysCys: 0.446 ± 0.226
2.679LysAsp: 2.679 ± 1.305
3.571LysGlu: 3.571 ± 1.518
1.786LysPhe: 1.786 ± 0.427
4.018LysGly: 4.018 ± 1.292
0.893LysHis: 0.893 ± 0.213
4.464LysIle: 4.464 ± 1.731
1.339LysLys: 1.339 ± 0.012
2.679LysLeu: 2.679 ± 1.969
1.339LysMet: 1.339 ± 0.677
2.679LysAsn: 2.679 ± 0.64
2.679LysPro: 2.679 ± 0.025
1.339LysGln: 1.339 ± 0.012
4.911LysArg: 4.911 ± 1.506
1.786LysSer: 1.786 ± 0.238
4.464LysThr: 4.464 ± 0.927
4.911LysVal: 4.911 ± 0.841
1.339LysTrp: 1.339 ± 0.652
2.679LysTyr: 2.679 ± 0.64
0.0LysXaa: 0.0 ± 0.0
Leu
8.036LeuAla: 8.036 ± 0.591
2.232LeuCys: 2.232 ± 0.201
4.464LeuAsp: 4.464 ± 0.402
1.786LeuGlu: 1.786 ± 0.238
0.893LeuPhe: 0.893 ± 0.878
4.018LeuGly: 4.018 ± 0.702
3.125LeuHis: 3.125 ± 1.079
4.464LeuIle: 4.464 ± 1.067
4.018LeuLys: 4.018 ± 0.628
4.911LeuLeu: 4.911 ± 1.153
1.339LeuMet: 1.339 ± 0.012
5.357LeuAsn: 5.357 ± 1.379
5.804LeuPro: 5.804 ± 0.275
2.679LeuGln: 2.679 ± 0.025
4.911LeuArg: 4.911 ± 0.841
5.357LeuSer: 5.357 ± 0.714
7.143LeuThr: 7.143 ± 0.287
4.464LeuVal: 4.464 ± 0.927
1.786LeuTrp: 1.786 ± 0.427
4.911LeuTyr: 4.911 ± 0.841
0.0LeuXaa: 0.0 ± 0.0
Met
1.786MetAla: 1.786 ± 0.238
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.786MetGlu: 1.786 ± 0.238
1.339MetPhe: 1.339 ± 0.012
0.446MetGly: 0.446 ± 0.226
0.893MetHis: 0.893 ± 0.451
0.893MetIle: 0.893 ± 0.213
1.786MetLys: 1.786 ± 0.427
1.339MetLeu: 1.339 ± 0.012
0.446MetMet: 0.446 ± 0.439
2.232MetAsn: 2.232 ± 0.464
1.339MetPro: 1.339 ± 0.652
0.0MetGln: 0.0 ± 0.0
0.893MetArg: 0.893 ± 0.213
1.339MetSer: 1.339 ± 0.677
1.786MetThr: 1.786 ± 1.091
1.339MetVal: 1.339 ± 0.012
0.0MetTrp: 0.0 ± 0.0
0.446MetTyr: 0.446 ± 0.226
0.0MetXaa: 0.0 ± 0.0
Asn
6.25AsnAla: 6.25 ± 1.83
0.446AsnCys: 0.446 ± 0.226
3.571AsnAsp: 3.571 ± 0.189
2.232AsnGlu: 2.232 ± 0.464
3.125AsnPhe: 3.125 ± 0.915
4.911AsnGly: 4.911 ± 0.841
1.339AsnHis: 1.339 ± 0.677
6.696AsnIle: 6.696 ± 3.262
3.125AsnLys: 3.125 ± 0.915
4.464AsnLeu: 4.464 ± 0.402
0.893AsnMet: 0.893 ± 0.213
7.589AsnAsn: 7.589 ± 1.178
4.018AsnPro: 4.018 ± 0.702
4.464AsnGln: 4.464 ± 0.927
2.679AsnArg: 2.679 ± 0.689
3.571AsnSer: 3.571 ± 0.476
2.679AsnThr: 2.679 ± 1.354
3.571AsnVal: 3.571 ± 1.141
1.339AsnTrp: 1.339 ± 0.012
2.232AsnTyr: 2.232 ± 0.464
0.0AsnXaa: 0.0 ± 0.0
Pro
4.018ProAla: 4.018 ± 0.702
0.446ProCys: 0.446 ± 0.226
3.125ProAsp: 3.125 ± 0.915
2.679ProGlu: 2.679 ± 0.689
3.571ProPhe: 3.571 ± 0.189
2.679ProGly: 2.679 ± 0.025
0.893ProHis: 0.893 ± 0.213
3.571ProIle: 3.571 ± 0.476
4.018ProLys: 4.018 ± 0.037
4.911ProLeu: 4.911 ± 1.153
1.786ProMet: 1.786 ± 0.427
4.018ProAsn: 4.018 ± 1.366
4.464ProPro: 4.464 ± 0.263
3.125ProGln: 3.125 ± 0.414
3.125ProArg: 3.125 ± 0.915
4.464ProSer: 4.464 ± 1.067
5.357ProThr: 5.357 ± 0.615
4.911ProVal: 4.911 ± 0.488
1.786ProTrp: 1.786 ± 1.091
1.339ProTyr: 1.339 ± 0.677
0.0ProXaa: 0.0 ± 0.0
Gln
3.571GlnAla: 3.571 ± 0.476
0.446GlnCys: 0.446 ± 0.439
0.446GlnAsp: 0.446 ± 0.226
1.339GlnGlu: 1.339 ± 0.652
0.893GlnPhe: 0.893 ± 0.451
1.339GlnGly: 1.339 ± 0.012
1.339GlnHis: 1.339 ± 1.317
3.571GlnIle: 3.571 ± 0.476
2.232GlnLys: 2.232 ± 0.866
1.786GlnLeu: 1.786 ± 0.238
0.0GlnMet: 0.0 ± 0.0
3.125GlnAsn: 3.125 ± 0.414
2.679GlnPro: 2.679 ± 0.64
1.339GlnGln: 1.339 ± 0.677
3.571GlnArg: 3.571 ± 0.189
1.786GlnSer: 1.786 ± 0.238
1.786GlnThr: 1.786 ± 0.238
3.125GlnVal: 3.125 ± 0.915
0.893GlnTrp: 0.893 ± 0.451
2.232GlnTyr: 2.232 ± 1.128
0.0GlnXaa: 0.0 ± 0.0
Arg
2.679ArgAla: 2.679 ± 0.025
0.0ArgCys: 0.0 ± 0.0
4.464ArgAsp: 4.464 ± 0.402
2.679ArgGlu: 2.679 ± 1.305
2.679ArgPhe: 2.679 ± 0.025
2.232ArgGly: 2.232 ± 1.128
0.893ArgHis: 0.893 ± 0.451
3.571ArgIle: 3.571 ± 1.141
2.232ArgLys: 2.232 ± 0.201
5.357ArgLeu: 5.357 ± 0.615
1.786ArgMet: 1.786 ± 0.427
3.125ArgAsn: 3.125 ± 1.079
3.571ArgPro: 3.571 ± 0.476
2.679ArgGln: 2.679 ± 0.64
2.679ArgArg: 2.679 ± 1.305
2.679ArgSer: 2.679 ± 0.025
2.679ArgThr: 2.679 ± 1.305
1.339ArgVal: 1.339 ± 0.012
0.446ArgTrp: 0.446 ± 0.226
1.339ArgTyr: 1.339 ± 0.652
0.0ArgXaa: 0.0 ± 0.0
Ser
3.571SerAla: 3.571 ± 1.805
0.0SerCys: 0.0 ± 0.0
1.339SerAsp: 1.339 ± 0.677
1.339SerGlu: 1.339 ± 0.652
1.339SerPhe: 1.339 ± 0.652
3.571SerGly: 3.571 ± 0.189
1.339SerHis: 1.339 ± 0.677
2.679SerIle: 2.679 ± 0.64
4.018SerLys: 4.018 ± 1.957
4.911SerLeu: 4.911 ± 0.841
0.893SerMet: 0.893 ± 0.451
5.357SerAsn: 5.357 ± 1.379
3.571SerPro: 3.571 ± 0.476
2.232SerGln: 2.232 ± 0.866
2.232SerArg: 2.232 ± 0.866
4.018SerSer: 4.018 ± 0.628
4.464SerThr: 4.464 ± 0.927
6.25SerVal: 6.25 ± 1.83
1.786SerTrp: 1.786 ± 0.238
3.571SerTyr: 3.571 ± 0.476
0.0SerXaa: 0.0 ± 0.0
Thr
5.357ThrAla: 5.357 ± 1.379
0.446ThrCys: 0.446 ± 0.226
4.018ThrAsp: 4.018 ± 0.628
2.679ThrGlu: 2.679 ± 0.025
3.125ThrPhe: 3.125 ± 0.414
3.571ThrGly: 3.571 ± 0.476
1.786ThrHis: 1.786 ± 0.903
5.357ThrIle: 5.357 ± 0.714
6.25ThrLys: 6.25 ± 2.158
7.589ThrLeu: 7.589 ± 1.842
1.786ThrMet: 1.786 ± 1.091
5.357ThrAsn: 5.357 ± 1.379
4.018ThrPro: 4.018 ± 0.702
2.232ThrGln: 2.232 ± 1.128
3.125ThrArg: 3.125 ± 0.915
5.357ThrSer: 5.357 ± 1.379
5.357ThrThr: 5.357 ± 1.379
2.232ThrVal: 2.232 ± 0.201
1.339ThrTrp: 1.339 ± 0.677
1.786ThrTyr: 1.786 ± 0.427
0.0ThrXaa: 0.0 ± 0.0
Val
8.482ValAla: 8.482 ± 0.3
0.446ValCys: 0.446 ± 0.226
2.679ValAsp: 2.679 ± 0.689
5.357ValGlu: 5.357 ± 0.049
3.571ValPhe: 3.571 ± 1.141
2.679ValGly: 2.679 ± 0.689
0.446ValHis: 0.446 ± 0.439
2.232ValIle: 2.232 ± 0.464
3.571ValLys: 3.571 ± 0.476
2.679ValLeu: 2.679 ± 0.689
1.786ValMet: 1.786 ± 0.238
3.125ValAsn: 3.125 ± 1.744
6.25ValPro: 6.25 ± 1.83
2.679ValGln: 2.679 ± 0.025
2.679ValArg: 2.679 ± 1.305
3.571ValSer: 3.571 ± 0.476
4.018ValThr: 4.018 ± 1.366
1.339ValVal: 1.339 ± 0.012
1.786ValTrp: 1.786 ± 0.427
1.786ValTyr: 1.786 ± 0.903
0.0ValXaa: 0.0 ± 0.0
Trp
2.679TrpAla: 2.679 ± 0.689
0.0TrpCys: 0.0 ± 0.0
0.893TrpAsp: 0.893 ± 0.878
0.893TrpGlu: 0.893 ± 0.213
0.446TrpPhe: 0.446 ± 0.226
0.446TrpGly: 0.446 ± 0.226
0.446TrpHis: 0.446 ± 0.439
1.339TrpIle: 1.339 ± 0.652
0.446TrpLys: 0.446 ± 0.439
1.786TrpLeu: 1.786 ± 0.238
0.0TrpMet: 0.0 ± 0.0
2.679TrpAsn: 2.679 ± 0.025
1.339TrpPro: 1.339 ± 0.652
0.446TrpGln: 0.446 ± 0.226
0.446TrpArg: 0.446 ± 0.226
1.786TrpSer: 1.786 ± 1.756
1.339TrpThr: 1.339 ± 0.652
0.446TrpVal: 0.446 ± 0.226
0.446TrpTrp: 0.446 ± 0.439
1.339TrpTyr: 1.339 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.679TyrAla: 2.679 ± 1.969
0.446TyrCys: 0.446 ± 0.226
1.339TyrAsp: 1.339 ± 0.677
2.232TyrGlu: 2.232 ± 0.201
2.232TyrPhe: 2.232 ± 0.464
1.786TyrGly: 1.786 ± 0.238
2.232TyrHis: 2.232 ± 0.464
1.786TyrIle: 1.786 ± 0.427
1.786TyrLys: 1.786 ± 0.903
2.679TyrLeu: 2.679 ± 0.64
0.446TyrMet: 0.446 ± 0.165
0.893TyrAsn: 0.893 ± 0.451
4.464TyrPro: 4.464 ± 0.263
0.893TyrGln: 0.893 ± 0.213
0.893TyrArg: 0.893 ± 0.451
1.786TyrSer: 1.786 ± 0.238
4.018TyrThr: 4.018 ± 0.037
2.232TyrVal: 2.232 ± 0.464
0.446TyrTrp: 0.446 ± 0.439
1.786TyrTyr: 1.786 ± 0.427
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2241 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski