Amino acid dipepetide frequency for Culex mononega-like virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.114AlaAla: 3.114 ± 0.902
0.778AlaCys: 0.778 ± 0.692
1.816AlaAsp: 1.816 ± 0.239
5.189AlaGlu: 5.189 ± 0.929
2.076AlaPhe: 2.076 ± 1.038
4.93AlaGly: 4.93 ± 1.784
0.519AlaHis: 0.519 ± 0.473
4.67AlaIle: 4.67 ± 0.499
3.892AlaLys: 3.892 ± 1.293
5.189AlaLeu: 5.189 ± 1.512
2.076AlaMet: 2.076 ± 0.829
2.854AlaAsn: 2.854 ± 0.901
0.778AlaPro: 0.778 ± 0.465
2.335AlaGln: 2.335 ± 1.255
2.595AlaArg: 2.595 ± 1.664
3.114AlaSer: 3.114 ± 1.157
4.152AlaThr: 4.152 ± 2.103
5.449AlaVal: 5.449 ± 1.027
0.519AlaTrp: 0.519 ± 0.512
2.076AlaTyr: 2.076 ± 1.347
0.0AlaXaa: 0.0 ± 0.0
Cys
2.076CysAla: 2.076 ± 0.697
0.519CysCys: 0.519 ± 0.231
1.038CysAsp: 1.038 ± 0.62
1.816CysGlu: 1.816 ± 0.672
0.778CysPhe: 0.778 ± 0.284
2.076CysGly: 2.076 ± 0.64
0.778CysHis: 0.778 ± 0.322
1.297CysIle: 1.297 ± 1.015
0.259CysLys: 0.259 ± 0.155
2.854CysLeu: 2.854 ± 0.765
1.038CysMet: 1.038 ± 2.117
0.778CysAsn: 0.778 ± 0.479
0.778CysPro: 0.778 ± 0.465
0.519CysGln: 0.519 ± 0.231
1.557CysArg: 1.557 ± 0.702
1.038CysSer: 1.038 ± 0.431
2.076CysThr: 2.076 ± 0.339
1.557CysVal: 1.557 ± 0.964
0.259CysTrp: 0.259 ± 0.378
0.778CysTyr: 0.778 ± 0.364
0.0CysXaa: 0.0 ± 0.0
Asp
2.335AspAla: 2.335 ± 0.795
1.557AspCys: 1.557 ± 0.416
2.595AspAsp: 2.595 ± 1.02
3.373AspGlu: 3.373 ± 1.114
2.854AspPhe: 2.854 ± 0.887
2.076AspGly: 2.076 ± 0.552
1.816AspHis: 1.816 ± 0.739
3.373AspIle: 3.373 ± 0.544
2.595AspLys: 2.595 ± 0.585
5.189AspLeu: 5.189 ± 1.876
1.557AspMet: 1.557 ± 1.254
2.076AspAsn: 2.076 ± 0.932
4.152AspPro: 4.152 ± 0.806
2.335AspGln: 2.335 ± 1.121
1.816AspArg: 1.816 ± 0.418
2.595AspSer: 2.595 ± 0.948
1.557AspThr: 1.557 ± 0.922
2.854AspVal: 2.854 ± 0.516
0.259AspTrp: 0.259 ± 0.378
2.335AspTyr: 2.335 ± 0.538
0.0AspXaa: 0.0 ± 0.0
Glu
4.93GluAla: 4.93 ± 2.197
1.557GluCys: 1.557 ± 0.692
2.854GluAsp: 2.854 ± 0.516
6.227GluGlu: 6.227 ± 1.39
2.335GluPhe: 2.335 ± 0.924
4.67GluGly: 4.67 ± 0.997
1.816GluHis: 1.816 ± 0.984
4.411GluIle: 4.411 ± 1.294
3.373GluLys: 3.373 ± 0.78
7.265GluLeu: 7.265 ± 0.999
2.595GluMet: 2.595 ± 0.889
2.335GluAsn: 2.335 ± 1.179
1.816GluPro: 1.816 ± 0.401
2.595GluGln: 2.595 ± 1.042
4.152GluArg: 4.152 ± 1.571
5.189GluSer: 5.189 ± 0.9
2.595GluThr: 2.595 ± 1.161
6.227GluVal: 6.227 ± 1.21
0.778GluTrp: 0.778 ± 0.402
2.076GluTyr: 2.076 ± 1.214
0.0GluXaa: 0.0 ± 0.0
Phe
2.076PheAla: 2.076 ± 0.497
1.557PheCys: 1.557 ± 0.643
2.076PheAsp: 2.076 ± 0.921
2.076PheGlu: 2.076 ± 0.873
2.854PhePhe: 2.854 ± 1.201
1.816PheGly: 1.816 ± 1.041
0.519PheHis: 0.519 ± 0.33
2.335PheIle: 2.335 ± 0.653
2.854PheLys: 2.854 ± 0.944
4.67PheLeu: 4.67 ± 1.006
1.038PheMet: 1.038 ± 0.458
1.297PheAsn: 1.297 ± 0.53
2.076PhePro: 2.076 ± 0.659
2.595PheGln: 2.595 ± 0.893
1.557PheArg: 1.557 ± 0.416
3.114PheSer: 3.114 ± 1.068
2.854PheThr: 2.854 ± 1.465
1.816PheVal: 1.816 ± 0.401
0.259PheTrp: 0.259 ± 0.271
1.557PheTyr: 1.557 ± 1.354
0.0PheXaa: 0.0 ± 0.0
Gly
1.297GlyAla: 1.297 ± 0.614
1.038GlyCys: 1.038 ± 0.62
3.892GlyAsp: 3.892 ± 0.769
3.633GlyGlu: 3.633 ± 0.953
1.816GlyPhe: 1.816 ± 0.75
4.67GlyGly: 4.67 ± 1.449
1.816GlyHis: 1.816 ± 0.527
3.633GlyIle: 3.633 ± 0.694
4.93GlyLys: 4.93 ± 1.644
3.892GlyLeu: 3.892 ± 1.127
2.595GlyMet: 2.595 ± 0.965
2.595GlyAsn: 2.595 ± 0.93
1.816GlyPro: 1.816 ± 0.717
2.854GlyGln: 2.854 ± 0.926
2.854GlyArg: 2.854 ± 1.065
3.373GlySer: 3.373 ± 0.568
4.93GlyThr: 4.93 ± 0.713
3.633GlyVal: 3.633 ± 0.81
1.297GlyTrp: 1.297 ± 0.342
1.297GlyTyr: 1.297 ± 0.629
0.0GlyXaa: 0.0 ± 0.0
His
0.778HisAla: 0.778 ± 0.504
0.0HisCys: 0.0 ± 0.0
1.297HisAsp: 1.297 ± 0.507
1.557HisGlu: 1.557 ± 0.643
2.335HisPhe: 2.335 ± 0.528
1.557HisGly: 1.557 ± 1.443
1.038HisHis: 1.038 ± 0.55
2.076HisIle: 2.076 ± 1.031
1.038HisLys: 1.038 ± 0.32
2.595HisLeu: 2.595 ± 0.831
0.519HisMet: 0.519 ± 1.03
1.038HisAsn: 1.038 ± 0.396
0.778HisPro: 0.778 ± 0.825
0.778HisGln: 0.778 ± 0.322
1.038HisArg: 1.038 ± 0.329
1.816HisSer: 1.816 ± 0.813
2.335HisThr: 2.335 ± 0.4
0.778HisVal: 0.778 ± 0.351
0.259HisTrp: 0.259 ± 0.378
1.038HisTyr: 1.038 ± 0.395
0.0HisXaa: 0.0 ± 0.0
Ile
6.227IleAla: 6.227 ± 1.15
1.557IleCys: 1.557 ± 0.988
3.892IleAsp: 3.892 ± 0.888
2.595IleGlu: 2.595 ± 0.468
2.335IlePhe: 2.335 ± 3.022
3.114IleGly: 3.114 ± 0.894
1.038IleHis: 1.038 ± 0.705
2.335IleIle: 2.335 ± 1.034
3.114IleLys: 3.114 ± 0.507
6.227IleLeu: 6.227 ± 1.126
1.557IleMet: 1.557 ± 0.673
3.114IleAsn: 3.114 ± 1.757
3.114IlePro: 3.114 ± 1.123
2.335IleGln: 2.335 ± 0.678
4.152IleArg: 4.152 ± 0.946
3.892IleSer: 3.892 ± 0.802
5.968IleThr: 5.968 ± 1.098
3.114IleVal: 3.114 ± 1.166
0.778IleTrp: 0.778 ± 0.364
2.595IleTyr: 2.595 ± 3.02
0.0IleXaa: 0.0 ± 0.0
Lys
2.595LysAla: 2.595 ± 1.116
1.038LysCys: 1.038 ± 0.329
2.854LysAsp: 2.854 ± 1.112
4.152LysGlu: 4.152 ± 0.784
1.557LysPhe: 1.557 ± 0.416
2.595LysGly: 2.595 ± 0.783
0.519LysHis: 0.519 ± 0.31
4.93LysIle: 4.93 ± 0.917
5.708LysLys: 5.708 ± 1.231
5.708LysLeu: 5.708 ± 1.324
1.816LysMet: 1.816 ± 0.622
2.854LysAsn: 2.854 ± 0.683
2.595LysPro: 2.595 ± 0.749
1.816LysGln: 1.816 ± 0.527
1.816LysArg: 1.816 ± 1.085
3.633LysSer: 3.633 ± 0.606
4.152LysThr: 4.152 ± 1.167
3.633LysVal: 3.633 ± 1.446
1.297LysTrp: 1.297 ± 0.616
2.076LysTyr: 2.076 ± 0.68
0.0LysXaa: 0.0 ± 0.0
Leu
4.93LeuAla: 4.93 ± 1.038
2.335LeuCys: 2.335 ± 0.464
4.411LeuAsp: 4.411 ± 1.756
5.968LeuGlu: 5.968 ± 1.055
4.411LeuPhe: 4.411 ± 1.168
2.854LeuGly: 2.854 ± 0.901
2.595LeuHis: 2.595 ± 0.705
4.93LeuIle: 4.93 ± 2.098
4.93LeuLys: 4.93 ± 2.426
8.563LeuLeu: 8.563 ± 1.319
3.633LeuMet: 3.633 ± 1.627
3.892LeuAsn: 3.892 ± 1.327
2.854LeuPro: 2.854 ± 0.799
3.892LeuGln: 3.892 ± 1.582
4.93LeuArg: 4.93 ± 1.767
8.044LeuSer: 8.044 ± 2.462
5.968LeuThr: 5.968 ± 1.546
7.006LeuVal: 7.006 ± 0.865
0.519LeuTrp: 0.519 ± 0.31
5.189LeuTyr: 5.189 ± 0.883
0.0LeuXaa: 0.0 ± 0.0
Met
1.038MetAla: 1.038 ± 0.395
1.038MetCys: 1.038 ± 2.117
1.557MetAsp: 1.557 ± 1.007
2.854MetGlu: 2.854 ± 0.615
0.778MetPhe: 0.778 ± 0.465
2.076MetGly: 2.076 ± 0.751
1.038MetHis: 1.038 ± 2.059
1.816MetIle: 1.816 ± 0.668
2.335MetLys: 2.335 ± 0.6
1.816MetLeu: 1.816 ± 0.239
1.297MetMet: 1.297 ± 0.342
2.076MetAsn: 2.076 ± 0.579
1.038MetPro: 1.038 ± 0.32
0.778MetGln: 0.778 ± 0.479
2.595MetArg: 2.595 ± 0.441
2.595MetSer: 2.595 ± 0.883
2.076MetThr: 2.076 ± 0.813
1.557MetVal: 1.557 ± 0.242
0.519MetTrp: 0.519 ± 1.03
2.335MetTyr: 2.335 ± 1.395
0.0MetXaa: 0.0 ± 0.0
Asn
1.297AsnAla: 1.297 ± 0.614
0.519AsnCys: 0.519 ± 0.231
1.816AsnAsp: 1.816 ± 0.64
2.595AsnGlu: 2.595 ± 1.051
1.297AsnPhe: 1.297 ± 0.493
1.038AsnGly: 1.038 ± 0.586
1.038AsnHis: 1.038 ± 0.32
2.335AsnIle: 2.335 ± 1.121
2.335AsnLys: 2.335 ± 0.343
3.633AsnLeu: 3.633 ± 1.923
0.778AsnMet: 0.778 ± 0.384
1.557AsnAsn: 1.557 ± 0.673
1.816AsnPro: 1.816 ± 0.527
1.557AsnGln: 1.557 ± 0.6
3.373AsnArg: 3.373 ± 0.604
2.076AsnSer: 2.076 ± 0.634
3.373AsnThr: 3.373 ± 0.947
3.633AsnVal: 3.633 ± 1.238
1.038AsnTrp: 1.038 ± 1.314
3.373AsnTyr: 3.373 ± 1.181
0.0AsnXaa: 0.0 ± 0.0
Pro
2.854ProAla: 2.854 ± 1.648
1.038ProCys: 1.038 ± 0.62
2.854ProAsp: 2.854 ± 0.456
3.373ProGlu: 3.373 ± 1.081
0.778ProPhe: 0.778 ± 0.284
3.373ProGly: 3.373 ± 1.012
0.0ProHis: 0.0 ± 0.0
2.335ProIle: 2.335 ± 0.653
1.557ProLys: 1.557 ± 0.569
4.152ProLeu: 4.152 ± 0.944
0.778ProMet: 0.778 ± 0.284
1.038ProAsn: 1.038 ± 0.329
2.854ProPro: 2.854 ± 1.138
1.557ProGln: 1.557 ± 0.242
3.633ProArg: 3.633 ± 0.606
3.373ProSer: 3.373 ± 0.976
1.816ProThr: 1.816 ± 0.64
2.595ProVal: 2.595 ± 1.141
0.259ProTrp: 0.259 ± 0.155
1.816ProTyr: 1.816 ± 0.487
0.0ProXaa: 0.0 ± 0.0
Gln
1.038GlnAla: 1.038 ± 0.395
1.038GlnCys: 1.038 ± 0.32
1.816GlnAsp: 1.816 ± 0.487
1.297GlnGlu: 1.297 ± 0.965
1.557GlnPhe: 1.557 ± 0.6
0.778GlnGly: 0.778 ± 0.364
0.778GlnHis: 0.778 ± 0.364
3.114GlnIle: 3.114 ± 1.052
2.854GlnLys: 2.854 ± 1.112
3.633GlnLeu: 3.633 ± 1.342
1.038GlnMet: 1.038 ± 0.807
1.297GlnAsn: 1.297 ± 0.829
1.297GlnPro: 1.297 ± 0.749
0.778GlnGln: 0.778 ± 0.284
2.076GlnArg: 2.076 ± 0.684
2.854GlnSer: 2.854 ± 0.465
1.816GlnThr: 1.816 ± 1.145
3.633GlnVal: 3.633 ± 1.128
0.519GlnTrp: 0.519 ± 0.314
0.778GlnTyr: 0.778 ± 0.284
0.0GlnXaa: 0.0 ± 0.0
Arg
4.93ArgAla: 4.93 ± 1.176
1.816ArgCys: 1.816 ± 1.197
2.335ArgAsp: 2.335 ± 0.746
4.152ArgGlu: 4.152 ± 0.645
2.595ArgPhe: 2.595 ± 0.89
3.114ArgGly: 3.114 ± 0.686
2.335ArgHis: 2.335 ± 0.825
3.114ArgIle: 3.114 ± 1.327
2.076ArgLys: 2.076 ± 0.965
4.411ArgLeu: 4.411 ± 0.809
1.816ArgMet: 1.816 ± 0.901
3.114ArgAsn: 3.114 ± 0.647
1.297ArgPro: 1.297 ± 0.322
1.816ArgGln: 1.816 ± 0.82
4.152ArgArg: 4.152 ± 1.063
5.449ArgSer: 5.449 ± 1.737
4.67ArgThr: 4.67 ± 1.037
3.633ArgVal: 3.633 ± 0.539
0.519ArgTrp: 0.519 ± 0.231
1.816ArgTyr: 1.816 ± 0.889
0.0ArgXaa: 0.0 ± 0.0
Ser
4.93SerAla: 4.93 ± 0.893
2.595SerCys: 2.595 ± 0.691
2.595SerAsp: 2.595 ± 0.705
4.411SerGlu: 4.411 ± 1.765
3.892SerPhe: 3.892 ± 1.016
5.708SerGly: 5.708 ± 1.049
1.297SerHis: 1.297 ± 0.478
5.189SerIle: 5.189 ± 2.202
4.152SerLys: 4.152 ± 1.485
8.044SerLeu: 8.044 ± 1.409
2.076SerMet: 2.076 ± 0.97
2.076SerAsn: 2.076 ± 0.792
2.335SerPro: 2.335 ± 1.1
1.297SerGln: 1.297 ± 0.631
2.854SerArg: 2.854 ± 1.022
4.411SerSer: 4.411 ± 1.499
2.595SerThr: 2.595 ± 1.117
3.892SerVal: 3.892 ± 0.849
1.038SerTrp: 1.038 ± 0.458
3.114SerTyr: 3.114 ± 1.154
0.0SerXaa: 0.0 ± 0.0
Thr
2.854ThrAla: 2.854 ± 1.314
1.557ThrCys: 1.557 ± 0.416
3.633ThrAsp: 3.633 ± 0.755
5.449ThrGlu: 5.449 ± 2.085
1.816ThrPhe: 1.816 ± 0.534
4.411ThrGly: 4.411 ± 1.087
2.076ThrHis: 2.076 ± 0.921
3.892ThrIle: 3.892 ± 1.637
3.633ThrLys: 3.633 ± 0.779
7.265ThrLeu: 7.265 ± 1.776
1.816ThrMet: 1.816 ± 0.668
1.557ThrAsn: 1.557 ± 0.976
3.373ThrPro: 3.373 ± 1.154
1.557ThrGln: 1.557 ± 0.473
4.411ThrArg: 4.411 ± 1.027
5.189ThrSer: 5.189 ± 1.876
4.411ThrThr: 4.411 ± 2.187
4.93ThrVal: 4.93 ± 0.838
0.519ThrTrp: 0.519 ± 0.31
2.335ThrTyr: 2.335 ± 0.528
0.0ThrXaa: 0.0 ± 0.0
Val
5.708ValAla: 5.708 ± 1.593
0.259ValCys: 0.259 ± 0.155
2.076ValAsp: 2.076 ± 0.791
5.189ValGlu: 5.189 ± 0.862
2.854ValPhe: 2.854 ± 0.636
4.411ValGly: 4.411 ± 1.087
1.557ValHis: 1.557 ± 1.204
3.373ValIle: 3.373 ± 1.455
2.854ValLys: 2.854 ± 0.765
4.67ValLeu: 4.67 ± 1.134
3.114ValMet: 3.114 ± 0.476
3.373ValAsn: 3.373 ± 1.081
3.892ValPro: 3.892 ± 0.37
1.557ValGln: 1.557 ± 1.184
4.67ValArg: 4.67 ± 1.139
4.67ValSer: 4.67 ± 0.785
5.449ValThr: 5.449 ± 1.8
4.411ValVal: 4.411 ± 1.322
0.519ValTrp: 0.519 ± 0.33
2.335ValTyr: 2.335 ± 0.343
0.0ValXaa: 0.0 ± 0.0
Trp
0.778TrpAla: 0.778 ± 1.112
1.038TrpCys: 1.038 ± 1.047
0.259TrpAsp: 0.259 ± 0.377
1.297TrpGlu: 1.297 ± 0.614
1.038TrpPhe: 1.038 ± 0.458
0.519TrpGly: 0.519 ± 0.231
0.519TrpHis: 0.519 ± 0.33
0.519TrpIle: 0.519 ± 0.485
1.038TrpLys: 1.038 ± 0.32
0.519TrpLeu: 0.519 ± 0.31
0.259TrpMet: 0.259 ± 0.155
0.0TrpAsn: 0.0 ± 0.0
0.259TrpPro: 0.259 ± 0.378
0.259TrpGln: 0.259 ± 0.155
1.816TrpArg: 1.816 ± 0.724
0.519TrpSer: 0.519 ± 0.33
0.259TrpThr: 0.259 ± 0.155
0.778TrpVal: 0.778 ± 0.465
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.335TyrAla: 2.335 ± 1.272
1.038TyrCys: 1.038 ± 1.047
3.633TyrAsp: 3.633 ± 0.924
2.595TyrGlu: 2.595 ± 1.227
1.038TyrPhe: 1.038 ± 0.62
2.076TyrGly: 2.076 ± 0.658
1.557TyrHis: 1.557 ± 0.45
3.373TyrIle: 3.373 ± 2.09
2.076TyrLys: 2.076 ± 0.936
1.557TyrLeu: 1.557 ± 1.132
1.557TyrMet: 1.557 ± 0.987
1.557TyrAsn: 1.557 ± 0.627
3.114TyrPro: 3.114 ± 0.643
0.778TyrGln: 0.778 ± 0.69
3.114TyrArg: 3.114 ± 0.823
1.557TyrSer: 1.557 ± 0.504
3.633TyrThr: 3.633 ± 1.001
1.816TyrVal: 1.816 ± 1.286
0.519TyrTrp: 0.519 ± 0.314
1.557TyrTyr: 1.557 ± 0.473
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3855 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski