Amino acid dipepetide frequency for Heterosigma akashiwo virus 01 (HaV01)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.341AlaAla: 2.341 ± 1.255
0.0AlaCys: 0.0 ± 0.0
0.936AlaAsp: 0.936 ± 1.041
3.745AlaGlu: 3.745 ± 1.556
1.873AlaPhe: 1.873 ± 1.242
1.404AlaGly: 1.404 ± 0.879
0.468AlaHis: 0.468 ± 0.521
3.745AlaIle: 3.745 ± 1.616
1.873AlaLys: 1.873 ± 0.581
2.341AlaLeu: 2.341 ± 1.857
0.936AlaMet: 0.936 ± 0.534
1.873AlaAsn: 1.873 ± 0.961
1.404AlaPro: 1.404 ± 0.569
0.936AlaGln: 0.936 ± 1.041
0.936AlaArg: 0.936 ± 0.514
3.745AlaSer: 3.745 ± 0.932
2.341AlaThr: 2.341 ± 0.791
0.936AlaVal: 0.936 ± 1.041
0.0AlaTrp: 0.0 ± 0.0
2.809AlaTyr: 2.809 ± 1.165
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.936CysAsp: 0.936 ± 0.534
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.936CysGly: 0.936 ± 0.534
0.0CysHis: 0.0 ± 0.0
1.873CysIle: 1.873 ± 0.662
0.936CysLys: 0.936 ± 0.514
1.404CysLeu: 1.404 ± 0.637
0.936CysMet: 0.936 ± 0.621
1.404CysAsn: 1.404 ± 0.645
0.468CysPro: 0.468 ± 0.712
0.468CysGln: 0.468 ± 0.267
0.936CysArg: 0.936 ± 0.388
0.936CysSer: 0.936 ± 0.534
0.468CysThr: 0.468 ± 0.521
1.873CysVal: 1.873 ± 1.068
0.0CysTrp: 0.0 ± 0.0
0.936CysTyr: 0.936 ± 0.534
0.0CysXaa: 0.0 ± 0.0
Asp
2.341AspAla: 2.341 ± 0.419
0.936AspCys: 0.936 ± 0.621
6.554AspAsp: 6.554 ± 5.008
4.213AspGlu: 4.213 ± 2.241
4.213AspPhe: 4.213 ± 1.139
1.873AspGly: 1.873 ± 1.029
1.404AspHis: 1.404 ± 0.623
7.959AspIle: 7.959 ± 2.424
5.15AspLys: 5.15 ± 1.15
3.745AspLeu: 3.745 ± 1.377
1.873AspMet: 1.873 ± 1.029
4.682AspAsn: 4.682 ± 0.846
2.341AspPro: 2.341 ± 0.951
0.936AspGln: 0.936 ± 0.534
1.873AspArg: 1.873 ± 0.777
3.277AspSer: 3.277 ± 1.627
3.745AspThr: 3.745 ± 1.068
3.277AspVal: 3.277 ± 0.283
0.936AspTrp: 0.936 ± 0.388
1.404AspTyr: 1.404 ± 0.645
0.0AspXaa: 0.0 ± 0.0
Glu
2.809GluAla: 2.809 ± 1.27
1.873GluCys: 1.873 ± 0.495
5.15GluAsp: 5.15 ± 1.515
3.745GluGlu: 3.745 ± 1.432
3.277GluPhe: 3.277 ± 1.007
1.873GluGly: 1.873 ± 0.809
2.809GluHis: 2.809 ± 1.046
6.554GluIle: 6.554 ± 0.958
5.618GluLys: 5.618 ± 1.121
9.363GluLeu: 9.363 ± 1.003
1.404GluMet: 1.404 ± 0.801
7.959GluAsn: 7.959 ± 2.348
0.468GluPro: 0.468 ± 0.267
1.873GluGln: 1.873 ± 1.461
5.618GluArg: 5.618 ± 1.744
3.277GluSer: 3.277 ± 1.118
4.213GluThr: 4.213 ± 1.868
2.809GluVal: 2.809 ± 0.832
1.404GluTrp: 1.404 ± 0.623
3.277GluTyr: 3.277 ± 0.952
0.0GluXaa: 0.0 ± 0.0
Phe
3.277PheAla: 3.277 ± 1.764
2.341PheCys: 2.341 ± 1.998
5.15PheAsp: 5.15 ± 1.442
1.873PheGlu: 1.873 ± 0.809
2.809PhePhe: 2.809 ± 1.853
3.277PheGly: 3.277 ± 0.573
0.936PheHis: 0.936 ± 0.534
2.809PheIle: 2.809 ± 1.15
2.809PheLys: 2.809 ± 0.739
4.213PheLeu: 4.213 ± 1.309
1.873PheMet: 1.873 ± 1.068
2.809PheAsn: 2.809 ± 1.245
3.277PhePro: 3.277 ± 1.37
0.0PheGln: 0.0 ± 0.0
1.873PheArg: 1.873 ± 1.708
4.682PheSer: 4.682 ± 1.014
4.213PheThr: 4.213 ± 1.252
1.873PheVal: 1.873 ± 1.019
1.404PheTrp: 1.404 ± 0.645
1.873PheTyr: 1.873 ± 0.754
0.0PheXaa: 0.0 ± 0.0
Gly
1.404GlyAla: 1.404 ± 0.879
0.468GlyCys: 0.468 ± 0.267
3.277GlyAsp: 3.277 ± 1.764
3.745GlyGlu: 3.745 ± 1.773
1.873GlyPhe: 1.873 ± 1.068
1.873GlyGly: 1.873 ± 1.631
0.936GlyHis: 0.936 ± 1.041
5.618GlyIle: 5.618 ± 1.754
3.745GlyLys: 3.745 ± 1.763
4.213GlyLeu: 4.213 ± 2.111
1.404GlyMet: 1.404 ± 0.879
2.809GlyAsn: 2.809 ± 1.053
0.0GlyPro: 0.0 ± 0.0
3.277GlyGln: 3.277 ± 0.757
3.277GlyArg: 3.277 ± 1.37
1.404GlySer: 1.404 ± 0.645
5.15GlyThr: 5.15 ± 3.515
1.873GlyVal: 1.873 ± 1.029
0.468GlyTrp: 0.468 ± 0.267
1.404GlyTyr: 1.404 ± 0.416
0.0GlyXaa: 0.0 ± 0.0
His
1.873HisAla: 1.873 ± 0.961
0.0HisCys: 0.0 ± 0.0
1.404HisAsp: 1.404 ± 0.654
3.277HisGlu: 3.277 ± 1.637
0.936HisPhe: 0.936 ± 0.534
0.0HisGly: 0.0 ± 0.0
0.468HisHis: 0.468 ± 0.521
0.936HisIle: 0.936 ± 0.731
1.404HisLys: 1.404 ± 0.801
0.0HisLeu: 0.0 ± 0.0
0.936HisMet: 0.936 ± 0.534
0.0HisAsn: 0.0 ± 0.0
0.936HisPro: 0.936 ± 0.854
1.404HisGln: 1.404 ± 0.416
0.468HisArg: 0.468 ± 0.267
0.936HisSer: 0.936 ± 0.621
1.404HisThr: 1.404 ± 0.879
1.873HisVal: 1.873 ± 0.505
0.0HisTrp: 0.0 ± 0.0
0.936HisTyr: 0.936 ± 0.621
0.0HisXaa: 0.0 ± 0.0
Ile
1.873IleAla: 1.873 ± 1.068
0.0IleCys: 0.0 ± 0.0
7.022IleAsp: 7.022 ± 0.82
8.895IleGlu: 8.895 ± 3.39
5.618IlePhe: 5.618 ± 3.725
7.022IleGly: 7.022 ± 2.632
0.0IleHis: 0.0 ± 0.0
10.3IleIle: 10.3 ± 2.228
8.895IleLys: 8.895 ± 3.356
6.086IleLeu: 6.086 ± 1.808
0.936IleMet: 0.936 ± 0.621
3.277IleAsn: 3.277 ± 0.682
3.277IlePro: 3.277 ± 0.475
0.936IleGln: 0.936 ± 0.534
4.682IleArg: 4.682 ± 0.386
6.086IleSer: 6.086 ± 0.864
3.745IleThr: 3.745 ± 0.24
6.086IleVal: 6.086 ± 0.872
1.404IleTrp: 1.404 ± 0.416
3.745IleTyr: 3.745 ± 1.161
0.0IleXaa: 0.0 ± 0.0
Lys
1.404LysAla: 1.404 ± 0.645
1.404LysCys: 1.404 ± 0.801
1.873LysAsp: 1.873 ± 1.029
7.022LysGlu: 7.022 ± 1.98
2.341LysPhe: 2.341 ± 1.229
4.213LysGly: 4.213 ± 1.288
2.809LysHis: 2.809 ± 0.821
7.022LysIle: 7.022 ± 1.701
11.236LysLys: 11.236 ± 2.798
7.491LysLeu: 7.491 ± 1.625
3.745LysMet: 3.745 ± 1.423
6.086LysAsn: 6.086 ± 1.484
2.341LysPro: 2.341 ± 1.335
2.341LysGln: 2.341 ± 1.031
4.213LysArg: 4.213 ± 1.911
4.682LysSer: 4.682 ± 1.473
4.682LysThr: 4.682 ± 3.104
4.213LysVal: 4.213 ± 1.025
0.468LysTrp: 0.468 ± 0.267
1.873LysTyr: 1.873 ± 0.495
0.0LysXaa: 0.0 ± 0.0
Leu
2.341LeuAla: 2.341 ± 0.759
0.468LeuCys: 0.468 ± 0.267
3.277LeuAsp: 3.277 ± 0.894
4.213LeuGlu: 4.213 ± 1.911
2.341LeuPhe: 2.341 ± 0.803
2.809LeuGly: 2.809 ± 0.739
0.468LeuHis: 0.468 ± 0.267
7.491LeuIle: 7.491 ± 1.601
6.554LeuLys: 6.554 ± 1.623
6.554LeuLeu: 6.554 ± 2.084
3.745LeuMet: 3.745 ± 1.187
5.618LeuAsn: 5.618 ± 0.974
4.213LeuPro: 4.213 ± 0.747
5.15LeuGln: 5.15 ± 2.576
5.618LeuArg: 5.618 ± 0.573
3.745LeuSer: 3.745 ± 1.698
5.15LeuThr: 5.15 ± 1.109
4.213LeuVal: 4.213 ± 2.779
1.404LeuTrp: 1.404 ± 0.645
5.15LeuTyr: 5.15 ± 0.515
0.0LeuXaa: 0.0 ± 0.0
Met
0.936MetAla: 0.936 ± 0.388
0.468MetCys: 0.468 ± 0.267
2.809MetAsp: 2.809 ± 0.821
1.873MetGlu: 1.873 ± 0.754
0.936MetPhe: 0.936 ± 0.8
1.404MetGly: 1.404 ± 0.416
0.936MetHis: 0.936 ± 0.621
2.341MetIle: 2.341 ± 0.803
3.277MetLys: 3.277 ± 1.869
2.341MetLeu: 2.341 ± 1.031
0.0MetMet: 0.0 ± 0.0
3.277MetAsn: 3.277 ± 1.615
0.936MetPro: 0.936 ± 0.534
0.936MetGln: 0.936 ± 0.388
1.404MetArg: 1.404 ± 0.801
1.404MetSer: 1.404 ± 0.801
0.936MetThr: 0.936 ± 0.388
0.468MetVal: 0.468 ± 0.267
0.0MetTrp: 0.0 ± 0.0
1.873MetTyr: 1.873 ± 0.754
0.0MetXaa: 0.0 ± 0.0
Asn
2.809AsnAla: 2.809 ± 1.937
1.404AsnCys: 1.404 ± 0.801
4.213AsnAsp: 4.213 ± 2.361
5.15AsnGlu: 5.15 ± 1.716
5.15AsnPhe: 5.15 ± 1.376
5.15AsnGly: 5.15 ± 2.076
2.341AsnHis: 2.341 ± 0.759
6.086AsnIle: 6.086 ± 1.808
3.277AsnLys: 3.277 ± 1.115
2.341AsnLeu: 2.341 ± 0.656
2.341AsnMet: 2.341 ± 0.833
3.745AsnAsn: 3.745 ± 1.626
0.936AsnPro: 0.936 ± 0.388
2.341AsnGln: 2.341 ± 0.995
2.341AsnArg: 2.341 ± 1.229
4.213AsnSer: 4.213 ± 0.922
5.618AsnThr: 5.618 ± 1.606
4.682AsnVal: 4.682 ± 0.488
0.468AsnTrp: 0.468 ± 0.267
3.277AsnTyr: 3.277 ± 0.766
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
1.404ProCys: 1.404 ± 0.801
2.809ProAsp: 2.809 ± 0.739
3.277ProGlu: 3.277 ± 1.042
1.404ProPhe: 1.404 ± 0.801
2.341ProGly: 2.341 ± 0.791
0.468ProHis: 0.468 ± 0.267
1.404ProIle: 1.404 ± 0.637
1.404ProLys: 1.404 ± 0.801
2.341ProLeu: 2.341 ± 0.791
1.404ProMet: 1.404 ± 0.801
1.404ProAsn: 1.404 ± 0.654
0.468ProPro: 0.468 ± 0.533
0.936ProGln: 0.936 ± 0.388
1.404ProArg: 1.404 ± 0.801
5.618ProSer: 5.618 ± 1.86
3.277ProThr: 3.277 ± 0.475
0.936ProVal: 0.936 ± 0.854
0.0ProTrp: 0.0 ± 0.0
1.404ProTyr: 1.404 ± 0.416
0.0ProXaa: 0.0 ± 0.0
Gln
0.468GlnAla: 0.468 ± 0.712
0.468GlnCys: 0.468 ± 0.267
0.936GlnAsp: 0.936 ± 0.388
0.936GlnGlu: 0.936 ± 0.514
1.404GlnPhe: 1.404 ± 1.562
0.936GlnGly: 0.936 ± 0.534
0.936GlnHis: 0.936 ± 0.731
2.809GlnIle: 2.809 ± 1.053
5.15GlnLys: 5.15 ± 1.15
4.213GlnLeu: 4.213 ± 2.111
1.404GlnMet: 1.404 ± 0.416
3.277GlnAsn: 3.277 ± 0.283
1.873GlnPro: 1.873 ± 0.777
1.404GlnGln: 1.404 ± 0.416
1.404GlnArg: 1.404 ± 0.879
0.936GlnSer: 0.936 ± 0.534
1.404GlnThr: 1.404 ± 0.879
2.341GlnVal: 2.341 ± 0.574
0.0GlnTrp: 0.0 ± 0.0
1.404GlnTyr: 1.404 ± 1.484
0.0GlnXaa: 0.0 ± 0.0
Arg
0.468ArgAla: 0.468 ± 0.712
0.0ArgCys: 0.0 ± 0.0
2.809ArgAsp: 2.809 ± 0.238
6.086ArgGlu: 6.086 ± 2.529
1.873ArgPhe: 1.873 ± 0.581
0.936ArgGly: 0.936 ± 0.534
1.873ArgHis: 1.873 ± 0.495
6.086ArgIle: 6.086 ± 1.674
4.213ArgLys: 4.213 ± 0.745
4.213ArgLeu: 4.213 ± 1.377
0.0ArgMet: 0.0 ± 0.0
5.15ArgAsn: 5.15 ± 1.207
1.873ArgPro: 1.873 ± 0.777
2.341ArgGln: 2.341 ± 0.419
1.404ArgArg: 1.404 ± 0.645
2.341ArgSer: 2.341 ± 1.949
0.468ArgThr: 0.468 ± 0.267
3.277ArgVal: 3.277 ± 1.299
0.468ArgTrp: 0.468 ± 0.267
0.468ArgTyr: 0.468 ± 0.267
0.0ArgXaa: 0.0 ± 0.0
Ser
1.873SerAla: 1.873 ± 0.777
0.0SerCys: 0.0 ± 0.0
2.809SerAsp: 2.809 ± 2.027
2.341SerGlu: 2.341 ± 2.122
4.682SerPhe: 4.682 ± 0.582
3.745SerGly: 3.745 ± 1.39
0.936SerHis: 0.936 ± 1.041
4.682SerIle: 4.682 ± 1.003
4.213SerLys: 4.213 ± 2.456
7.022SerLeu: 7.022 ± 1.181
0.936SerMet: 0.936 ± 0.534
5.15SerAsn: 5.15 ± 2.635
2.341SerPro: 2.341 ± 1.49
2.809SerGln: 2.809 ± 1.055
3.277SerArg: 3.277 ± 0.573
2.809SerSer: 2.809 ± 1.15
3.277SerThr: 3.277 ± 1.637
3.745SerVal: 3.745 ± 0.992
0.936SerTrp: 0.936 ± 0.731
2.809SerTyr: 2.809 ± 1.046
0.0SerXaa: 0.0 ± 0.0
Thr
1.873ThrAla: 1.873 ± 1.392
1.404ThrCys: 1.404 ± 0.879
4.213ThrAsp: 4.213 ± 1.868
6.086ThrGlu: 6.086 ± 1.086
3.745ThrPhe: 3.745 ± 1.377
4.682ThrGly: 4.682 ± 2.566
0.0ThrHis: 0.0 ± 0.0
5.618ThrIle: 5.618 ± 0.477
5.618ThrLys: 5.618 ± 1.606
3.745ThrLeu: 3.745 ± 1.133
1.873ThrMet: 1.873 ± 0.581
4.213ThrAsn: 4.213 ± 1.397
1.873ThrPro: 1.873 ± 0.777
1.873ThrGln: 1.873 ± 1.068
2.341ThrArg: 2.341 ± 0.574
2.341ThrSer: 2.341 ± 2.044
4.213ThrThr: 4.213 ± 1.501
3.745ThrVal: 3.745 ± 0.861
0.936ThrTrp: 0.936 ± 0.388
1.404ThrTyr: 1.404 ± 0.879
0.0ThrXaa: 0.0 ± 0.0
Val
4.682ValAla: 4.682 ± 1.976
0.936ValCys: 0.936 ± 0.514
3.745ValAsp: 3.745 ± 0.24
4.213ValGlu: 4.213 ± 1.142
5.618ValPhe: 5.618 ± 1.291
0.468ValGly: 0.468 ± 0.267
0.468ValHis: 0.468 ± 0.712
2.809ValIle: 2.809 ± 1.15
3.277ValLys: 3.277 ± 0.757
3.745ValLeu: 3.745 ± 0.627
2.341ValMet: 2.341 ± 0.827
1.873ValAsn: 1.873 ± 0.505
2.341ValPro: 2.341 ± 1.031
2.809ValGln: 2.809 ± 1.271
2.809ValArg: 2.809 ± 0.739
4.682ValSer: 4.682 ± 0.839
2.809ValThr: 2.809 ± 1.046
2.341ValVal: 2.341 ± 1.252
0.0ValTrp: 0.0 ± 0.0
3.745ValTyr: 3.745 ± 1.554
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.468TrpCys: 0.468 ± 0.533
0.468TrpAsp: 0.468 ± 0.521
0.936TrpGlu: 0.936 ± 0.534
2.341TrpPhe: 2.341 ± 0.791
0.468TrpGly: 0.468 ± 0.267
0.0TrpHis: 0.0 ± 0.0
0.468TrpIle: 0.468 ± 0.267
0.468TrpLys: 0.468 ± 0.712
0.936TrpLeu: 0.936 ± 0.731
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.936TrpSer: 0.936 ± 0.534
1.873TrpThr: 1.873 ± 0.581
0.936TrpVal: 0.936 ± 0.388
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.873TyrAla: 1.873 ± 0.777
0.468TyrCys: 0.468 ± 0.267
2.341TyrAsp: 2.341 ± 1.154
3.745TyrGlu: 3.745 ± 0.496
0.936TyrPhe: 0.936 ± 0.388
2.809TyrGly: 2.809 ± 0.574
0.936TyrHis: 0.936 ± 0.388
3.277TyrIle: 3.277 ± 0.975
2.341TyrLys: 2.341 ± 0.935
4.213TyrLeu: 4.213 ± 2.022
0.0TyrMet: 0.0 ± 0.0
3.277TyrAsn: 3.277 ± 1.132
2.341TyrPro: 2.341 ± 1.335
0.936TyrGln: 0.936 ± 0.854
0.468TyrArg: 0.468 ± 0.521
2.341TyrSer: 2.341 ± 0.419
2.809TyrThr: 2.809 ± 1.602
4.682TyrVal: 4.682 ± 0.582
0.0TyrTrp: 0.0 ± 0.0
1.873TyrTyr: 1.873 ± 0.994
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2137 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski