Amino acid dipepetide frequency for Penicillium aurantiogriseum bipartite virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.541AlaAla: 4.541 ± 1.494
0.0AlaCys: 0.0 ± 0.0
9.083AlaAsp: 9.083 ± 1.812
9.083AlaGlu: 9.083 ± 2.101
0.908AlaPhe: 0.908 ± 0.519
9.083AlaGly: 9.083 ± 0.736
0.0AlaHis: 0.0 ± 0.0
2.725AlaIle: 2.725 ± 1.277
3.633AlaLys: 3.633 ± 0.978
5.45AlaLeu: 5.45 ± 3.019
0.908AlaMet: 0.908 ± 0.843
5.45AlaAsn: 5.45 ± 1.708
3.633AlaPro: 3.633 ± 0.978
5.45AlaGln: 5.45 ± 2.01
4.541AlaArg: 4.541 ± 3.517
3.633AlaSer: 3.633 ± 2.722
5.45AlaThr: 5.45 ± 0.408
5.45AlaVal: 5.45 ± 2.612
1.817AlaTrp: 1.817 ± 1.039
4.541AlaTyr: 4.541 ± 1.77
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.908CysGly: 0.908 ± 0.519
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.908CysLys: 0.908 ± 0.519
0.908CysLeu: 0.908 ± 0.519
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.908CysArg: 0.908 ± 0.519
0.0CysSer: 0.0 ± 0.0
0.908CysThr: 0.908 ± 0.519
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
11.807AspAla: 11.807 ± 2.316
0.0AspCys: 0.0 ± 0.0
4.541AspAsp: 4.541 ± 1.494
2.725AspGlu: 2.725 ± 1.827
1.817AspPhe: 1.817 ± 1.687
4.541AspGly: 4.541 ± 1.807
0.0AspHis: 0.0 ± 0.0
2.725AspIle: 2.725 ± 1.558
1.817AspLys: 1.817 ± 0.489
6.358AspLeu: 6.358 ± 1.325
2.725AspMet: 2.725 ± 1.277
1.817AspAsn: 1.817 ± 1.039
8.174AspPro: 8.174 ± 1.66
1.817AspGln: 1.817 ± 1.039
1.817AspArg: 1.817 ± 1.039
1.817AspSer: 1.817 ± 0.489
3.633AspThr: 3.633 ± 1.446
9.991AspVal: 9.991 ± 2.526
2.725AspTrp: 2.725 ± 1.005
1.817AspTyr: 1.817 ± 0.489
0.0AspXaa: 0.0 ± 0.0
Glu
3.633GluAla: 3.633 ± 0.978
0.0GluCys: 0.0 ± 0.0
2.725GluAsp: 2.725 ± 1.558
6.358GluGlu: 6.358 ± 2.443
0.908GluPhe: 0.908 ± 0.519
0.908GluGly: 0.908 ± 0.519
2.725GluHis: 2.725 ± 2.206
3.633GluIle: 3.633 ± 0.955
2.725GluLys: 2.725 ± 0.845
6.358GluLeu: 6.358 ± 3.532
3.633GluMet: 3.633 ± 1.446
1.817GluAsn: 1.817 ± 0.871
2.725GluPro: 2.725 ± 0.553
8.174GluGln: 8.174 ± 1.854
4.541GluArg: 4.541 ± 1.74
3.633GluSer: 3.633 ± 0.473
5.45GluThr: 5.45 ± 0.928
6.358GluVal: 6.358 ± 2.214
0.908GluTrp: 0.908 ± 0.519
4.541GluTyr: 4.541 ± 2.597
0.0GluXaa: 0.0 ± 0.0
Phe
4.541PheAla: 4.541 ± 1.77
0.0PheCys: 0.0 ± 0.0
0.908PheAsp: 0.908 ± 0.519
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
3.633PheGly: 3.633 ± 0.955
0.908PheHis: 0.908 ± 0.519
0.908PheIle: 0.908 ± 0.519
1.817PheLys: 1.817 ± 1.039
4.541PheLeu: 4.541 ± 0.906
0.908PheMet: 0.908 ± 1.023
0.0PheAsn: 0.0 ± 0.0
0.908PhePro: 0.908 ± 0.519
1.817PheGln: 1.817 ± 1.039
2.725PheArg: 2.725 ± 0.553
2.725PheSer: 2.725 ± 1.277
0.908PheThr: 0.908 ± 0.843
2.725PheVal: 2.725 ± 1.558
0.0PheTrp: 0.0 ± 0.0
0.908PheTyr: 0.908 ± 0.519
0.0PheXaa: 0.0 ± 0.0
Gly
1.817GlyAla: 1.817 ± 1.687
0.0GlyCys: 0.0 ± 0.0
5.45GlyAsp: 5.45 ± 0.928
5.45GlyGlu: 5.45 ± 2.517
0.908GlyPhe: 0.908 ± 0.519
6.358GlyGly: 6.358 ± 3.842
2.725GlyHis: 2.725 ± 0.553
4.541GlyIle: 4.541 ± 0.906
6.358GlyLys: 6.358 ± 0.115
1.817GlyLeu: 1.817 ± 0.871
2.725GlyMet: 2.725 ± 0.553
2.725GlyAsn: 2.725 ± 1.005
1.817GlyPro: 1.817 ± 0.871
1.817GlyGln: 1.817 ± 0.489
3.633GlyArg: 3.633 ± 0.955
4.541GlySer: 4.541 ± 3.241
6.358GlyThr: 6.358 ± 1.347
8.174GlyVal: 8.174 ± 2.18
0.0GlyTrp: 0.0 ± 0.0
1.817GlyTyr: 1.817 ± 0.871
0.0GlyXaa: 0.0 ± 0.0
His
3.633HisAla: 3.633 ± 1.811
0.0HisCys: 0.0 ± 0.0
0.908HisAsp: 0.908 ± 0.519
0.908HisGlu: 0.908 ± 0.519
0.908HisPhe: 0.908 ± 1.023
0.908HisGly: 0.908 ± 0.519
1.817HisHis: 1.817 ± 0.871
0.908HisIle: 0.908 ± 0.519
0.908HisLys: 0.908 ± 0.519
0.908HisLeu: 0.908 ± 0.519
0.908HisMet: 0.908 ± 0.519
0.908HisAsn: 0.908 ± 0.519
0.908HisPro: 0.908 ± 0.843
0.908HisGln: 0.908 ± 0.843
0.908HisArg: 0.908 ± 0.843
1.817HisSer: 1.817 ± 0.871
0.908HisThr: 0.908 ± 0.519
1.817HisVal: 1.817 ± 0.489
1.817HisTrp: 1.817 ± 1.687
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.541IleAla: 4.541 ± 0.522
1.817IleCys: 1.817 ± 1.039
2.725IleAsp: 2.725 ± 1.005
0.0IleGlu: 0.0 ± 0.0
0.908IlePhe: 0.908 ± 0.519
1.817IleGly: 1.817 ± 0.489
0.0IleHis: 0.0 ± 0.0
3.633IleIle: 3.633 ± 0.473
1.817IleLys: 1.817 ± 1.039
1.817IleLeu: 1.817 ± 0.489
2.725IleMet: 2.725 ± 0.641
3.633IleAsn: 3.633 ± 2.109
3.633IlePro: 3.633 ± 2.109
3.633IleGln: 3.633 ± 0.978
1.817IleArg: 1.817 ± 1.039
5.45IleSer: 5.45 ± 2.236
0.908IleThr: 0.908 ± 0.519
2.725IleVal: 2.725 ± 1.558
0.0IleTrp: 0.0 ± 0.0
0.908IleTyr: 0.908 ± 0.519
0.0IleXaa: 0.0 ± 0.0
Lys
4.541LysAla: 4.541 ± 0.906
0.0LysCys: 0.0 ± 0.0
1.817LysAsp: 1.817 ± 0.489
2.725LysGlu: 2.725 ± 0.553
2.725LysPhe: 2.725 ± 0.553
2.725LysGly: 2.725 ± 1.558
0.908LysHis: 0.908 ± 1.023
0.908LysIle: 0.908 ± 0.519
2.725LysLys: 2.725 ± 0.553
1.817LysLeu: 1.817 ± 1.039
1.817LysMet: 1.817 ± 1.687
2.725LysAsn: 2.725 ± 0.553
0.0LysPro: 0.0 ± 0.0
1.817LysGln: 1.817 ± 1.687
4.541LysArg: 4.541 ± 1.435
3.633LysSer: 3.633 ± 1.342
2.725LysThr: 2.725 ± 1.277
3.633LysVal: 3.633 ± 2.109
0.908LysTrp: 0.908 ± 0.519
2.725LysTyr: 2.725 ± 0.553
0.0LysXaa: 0.0 ± 0.0
Leu
4.541LeuAla: 4.541 ± 2.212
0.908LeuCys: 0.908 ± 0.519
7.266LeuAsp: 7.266 ± 0.632
6.358LeuGlu: 6.358 ± 2.295
0.908LeuPhe: 0.908 ± 1.023
9.083LeuGly: 9.083 ± 1.854
0.908LeuHis: 0.908 ± 0.519
3.633LeuIle: 3.633 ± 2.077
4.541LeuLys: 4.541 ± 1.435
10.899LeuLeu: 10.899 ± 0.954
1.817LeuMet: 1.817 ± 0.871
1.817LeuAsn: 1.817 ± 0.489
4.541LeuPro: 4.541 ± 0.906
7.266LeuGln: 7.266 ± 0.767
8.174LeuArg: 8.174 ± 3.015
2.725LeuSer: 2.725 ± 0.553
3.633LeuThr: 3.633 ± 1.342
3.633LeuVal: 3.633 ± 0.473
0.0LeuTrp: 0.0 ± 0.0
2.725LeuTyr: 2.725 ± 0.553
0.0LeuXaa: 0.0 ± 0.0
Met
3.633MetAla: 3.633 ± 2.641
0.0MetCys: 0.0 ± 0.0
1.817MetAsp: 1.817 ± 1.32
2.725MetGlu: 2.725 ± 1.005
0.908MetPhe: 0.908 ± 0.843
3.633MetGly: 3.633 ± 1.811
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.908MetLys: 0.908 ± 0.519
0.908MetLeu: 0.908 ± 0.519
0.908MetMet: 0.908 ± 0.843
0.908MetAsn: 0.908 ± 0.519
1.817MetPro: 1.817 ± 0.489
0.0MetGln: 0.0 ± 0.0
2.725MetArg: 2.725 ± 1.277
1.817MetSer: 1.817 ± 1.039
2.725MetThr: 2.725 ± 2.53
1.817MetVal: 1.817 ± 0.871
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.908AsnAla: 0.908 ± 0.843
0.0AsnCys: 0.0 ± 0.0
1.817AsnAsp: 1.817 ± 1.32
1.817AsnGlu: 1.817 ± 0.489
1.817AsnPhe: 1.817 ± 1.039
0.908AsnGly: 0.908 ± 0.843
0.908AsnHis: 0.908 ± 0.519
2.725AsnIle: 2.725 ± 1.558
0.0AsnLys: 0.0 ± 0.0
2.725AsnLeu: 2.725 ± 1.966
0.0AsnMet: 0.0 ± 0.807
0.0AsnAsn: 0.0 ± 0.0
3.633AsnPro: 3.633 ± 0.955
0.908AsnGln: 0.908 ± 0.843
1.817AsnArg: 1.817 ± 1.687
0.0AsnSer: 0.0 ± 0.0
1.817AsnThr: 1.817 ± 1.039
1.817AsnVal: 1.817 ± 0.489
1.817AsnTrp: 1.817 ± 0.489
2.725AsnTyr: 2.725 ± 1.558
0.0AsnXaa: 0.0 ± 0.0
Pro
3.633ProAla: 3.633 ± 0.955
0.0ProCys: 0.0 ± 0.0
3.633ProAsp: 3.633 ± 1.446
6.358ProGlu: 6.358 ± 2.443
1.817ProPhe: 1.817 ± 1.039
0.0ProGly: 0.0 ± 0.0
1.817ProHis: 1.817 ± 0.489
4.541ProIle: 4.541 ± 1.74
0.908ProLys: 0.908 ± 0.843
7.266ProLeu: 7.266 ± 0.945
0.908ProMet: 0.908 ± 1.023
0.0ProAsn: 0.0 ± 0.0
2.725ProPro: 2.725 ± 0.553
2.725ProGln: 2.725 ± 0.553
2.725ProArg: 2.725 ± 0.553
6.358ProSer: 6.358 ± 2.53
2.725ProThr: 2.725 ± 0.553
4.541ProVal: 4.541 ± 2.597
0.908ProTrp: 0.908 ± 0.519
2.725ProTyr: 2.725 ± 1.005
0.0ProXaa: 0.0 ± 0.0
Gln
2.725GlnAla: 2.725 ± 1.966
0.0GlnCys: 0.0 ± 0.0
2.725GlnAsp: 2.725 ± 0.553
1.817GlnGlu: 1.817 ± 1.687
2.725GlnPhe: 2.725 ± 1.558
5.45GlnGly: 5.45 ± 1.69
1.817GlnHis: 1.817 ± 1.039
2.725GlnIle: 2.725 ± 0.553
0.908GlnLys: 0.908 ± 0.519
3.633GlnLeu: 3.633 ± 0.955
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.908GlnPro: 0.908 ± 0.519
2.725GlnGln: 2.725 ± 0.553
4.541GlnArg: 4.541 ± 0.906
5.45GlnSer: 5.45 ± 1.107
2.725GlnThr: 2.725 ± 1.277
0.908GlnVal: 0.908 ± 0.519
3.633GlnTrp: 3.633 ± 1.741
0.908GlnTyr: 0.908 ± 0.519
0.0GlnXaa: 0.0 ± 0.0
Arg
4.541ArgAla: 4.541 ± 1.74
0.0ArgCys: 0.0 ± 0.0
6.358ArgAsp: 6.358 ± 0.115
3.633ArgGlu: 3.633 ± 0.955
3.633ArgPhe: 3.633 ± 0.955
3.633ArgGly: 3.633 ± 1.446
0.0ArgHis: 0.0 ± 0.0
0.908ArgIle: 0.908 ± 0.519
3.633ArgLys: 3.633 ± 0.955
9.083ArgLeu: 9.083 ± 1.812
3.633ArgMet: 3.633 ± 0.473
3.633ArgAsn: 3.633 ± 0.978
4.541ArgPro: 4.541 ± 1.435
0.908ArgGln: 0.908 ± 0.843
9.083ArgArg: 9.083 ± 2.445
4.541ArgSer: 4.541 ± 0.522
2.725ArgThr: 2.725 ± 1.827
5.45ArgVal: 5.45 ± 1.69
0.908ArgTrp: 0.908 ± 0.519
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
9.083SerAla: 9.083 ± 1.771
0.0SerCys: 0.0 ± 0.0
5.45SerAsp: 5.45 ± 2.01
2.725SerGlu: 2.725 ± 1.277
2.725SerPhe: 2.725 ± 0.553
1.817SerGly: 1.817 ± 0.489
1.817SerHis: 1.817 ± 0.489
2.725SerIle: 2.725 ± 1.277
1.817SerLys: 1.817 ± 1.687
3.633SerLeu: 3.633 ± 0.955
0.908SerMet: 0.908 ± 0.843
0.908SerAsn: 0.908 ± 0.519
4.541SerPro: 4.541 ± 0.522
1.817SerGln: 1.817 ± 1.039
3.633SerArg: 3.633 ± 1.446
0.908SerSer: 0.908 ± 0.843
2.725SerThr: 2.725 ± 0.845
4.541SerVal: 4.541 ± 0.906
0.908SerTrp: 0.908 ± 0.519
3.633SerTyr: 3.633 ± 1.741
0.0SerXaa: 0.0 ± 0.0
Thr
7.266ThrAla: 7.266 ± 2.055
0.0ThrCys: 0.0 ± 0.0
5.45ThrAsp: 5.45 ± 3.019
4.541ThrGlu: 4.541 ± 1.435
3.633ThrPhe: 3.633 ± 0.955
2.725ThrGly: 2.725 ± 0.845
1.817ThrHis: 1.817 ± 1.039
0.908ThrIle: 0.908 ± 0.843
2.725ThrLys: 2.725 ± 1.277
6.358ThrLeu: 6.358 ± 1.41
0.0ThrMet: 0.0 ± 0.0
0.908ThrAsn: 0.908 ± 0.519
3.633ThrPro: 3.633 ± 1.811
2.725ThrGln: 2.725 ± 0.553
5.45ThrArg: 5.45 ± 0.928
3.633ThrSer: 3.633 ± 2.077
4.541ThrThr: 4.541 ± 0.522
0.0ThrVal: 0.0 ± 0.0
1.817ThrTrp: 1.817 ± 0.871
0.908ThrTyr: 0.908 ± 0.843
0.0ThrXaa: 0.0 ± 0.0
Val
6.358ValAla: 6.358 ± 3.539
0.0ValCys: 0.0 ± 0.0
5.45ValAsp: 5.45 ± 1.708
10.899ValGlu: 10.899 ± 1.856
0.908ValPhe: 0.908 ± 0.519
8.174ValGly: 8.174 ± 1.854
3.633ValHis: 3.633 ± 2.109
4.541ValIle: 4.541 ± 1.807
1.817ValLys: 1.817 ± 0.489
4.541ValLeu: 4.541 ± 1.494
1.817ValMet: 1.817 ± 0.871
1.817ValAsn: 1.817 ± 1.687
3.633ValPro: 3.633 ± 0.473
1.817ValGln: 1.817 ± 0.489
2.725ValArg: 2.725 ± 0.553
2.725ValSer: 2.725 ± 0.553
3.633ValThr: 3.633 ± 1.342
6.358ValVal: 6.358 ± 1.218
0.908ValTrp: 0.908 ± 1.023
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.908TrpAla: 0.908 ± 0.519
0.0TrpCys: 0.0 ± 0.0
0.908TrpAsp: 0.908 ± 0.519
0.908TrpGlu: 0.908 ± 0.843
1.817TrpPhe: 1.817 ± 1.039
0.908TrpGly: 0.908 ± 0.843
0.0TrpHis: 0.0 ± 0.0
0.908TrpIle: 0.908 ± 1.023
1.817TrpLys: 1.817 ± 0.489
4.541TrpLeu: 4.541 ± 1.77
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.817TrpPro: 1.817 ± 1.039
0.0TrpGln: 0.0 ± 0.0
2.725TrpArg: 2.725 ± 1.005
0.0TrpSer: 0.0 ± 0.0
0.908TrpThr: 0.908 ± 0.519
1.817TrpVal: 1.817 ± 2.045
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.725TyrAla: 2.725 ± 0.553
1.817TyrCys: 1.817 ± 1.039
3.633TyrAsp: 3.633 ± 2.077
2.725TyrGlu: 2.725 ± 1.005
0.908TyrPhe: 0.908 ± 0.519
1.817TyrGly: 1.817 ± 1.039
0.908TyrHis: 0.908 ± 1.023
0.908TyrIle: 0.908 ± 0.519
3.633TyrLys: 3.633 ± 0.978
1.817TyrLeu: 1.817 ± 1.039
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
2.725TyrPro: 2.725 ± 1.005
0.0TyrGln: 0.0 ± 0.0
1.817TyrArg: 1.817 ± 0.871
0.908TyrSer: 0.908 ± 1.023
3.633TyrThr: 3.633 ± 0.955
0.0TyrVal: 0.0 ± 0.0
0.908TyrTrp: 0.908 ± 0.519
0.908TyrTyr: 0.908 ± 0.519
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1102 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski