Amino acid dipepetide frequency for Coleus vein necrosis virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.042AlaAla: 7.042 ± 3.836
0.704AlaCys: 0.704 ± 0.589
2.817AlaAsp: 2.817 ± 1.1
5.986AlaGlu: 5.986 ± 1.215
4.225AlaPhe: 4.225 ± 1.906
4.577AlaGly: 4.577 ± 0.839
1.408AlaHis: 1.408 ± 0.561
4.93AlaIle: 4.93 ± 0.869
5.986AlaLys: 5.986 ± 1.753
7.746AlaLeu: 7.746 ± 1.757
1.408AlaMet: 1.408 ± 1.078
3.521AlaAsn: 3.521 ± 1.066
5.634AlaPro: 5.634 ± 1.737
1.056AlaGln: 1.056 ± 0.978
3.169AlaArg: 3.169 ± 2.108
3.521AlaSer: 3.521 ± 0.963
3.873AlaThr: 3.873 ± 0.948
7.394AlaVal: 7.394 ± 2.054
0.352AlaTrp: 0.352 ± 0.19
1.408AlaTyr: 1.408 ± 0.76
0.0AlaXaa: 0.0 ± 0.0
Cys
1.408CysAla: 1.408 ± 0.561
0.352CysCys: 0.352 ± 0.19
1.056CysAsp: 1.056 ± 0.57
1.761CysGlu: 1.761 ± 1.235
1.056CysPhe: 1.056 ± 0.57
2.465CysGly: 2.465 ± 1.253
0.704CysHis: 0.704 ± 0.775
2.465CysIle: 2.465 ± 1.253
1.056CysLys: 1.056 ± 0.692
2.817CysLeu: 2.817 ± 1.876
0.352CysMet: 0.352 ± 0.19
0.704CysAsn: 0.704 ± 1.143
0.704CysPro: 0.704 ± 0.775
1.056CysGln: 1.056 ± 0.558
1.408CysArg: 1.408 ± 0.645
1.761CysSer: 1.761 ± 0.733
1.761CysThr: 1.761 ± 0.811
2.113CysVal: 2.113 ± 1.725
0.352CysTrp: 0.352 ± 0.622
2.113CysTyr: 2.113 ± 1.14
0.0CysXaa: 0.0 ± 0.0
Asp
5.282AspAla: 5.282 ± 1.859
1.056AspCys: 1.056 ± 0.57
1.056AspAsp: 1.056 ± 0.57
4.225AspGlu: 4.225 ± 1.148
2.465AspPhe: 2.465 ± 1.605
3.169AspGly: 3.169 ± 1.252
0.0AspHis: 0.0 ± 0.0
3.169AspIle: 3.169 ± 1.21
1.408AspLys: 1.408 ± 0.76
4.93AspLeu: 4.93 ± 1.461
1.408AspMet: 1.408 ± 0.786
2.113AspAsn: 2.113 ± 1.35
2.465AspPro: 2.465 ± 2.959
0.704AspGln: 0.704 ± 0.589
2.465AspArg: 2.465 ± 1.406
2.465AspSer: 2.465 ± 1.133
1.408AspThr: 1.408 ± 1.078
3.873AspVal: 3.873 ± 1.618
0.704AspTrp: 0.704 ± 0.38
1.761AspTyr: 1.761 ± 0.95
0.0AspXaa: 0.0 ± 0.0
Glu
5.282GluAla: 5.282 ± 1.855
0.704GluCys: 0.704 ± 0.38
2.465GluAsp: 2.465 ± 0.705
4.225GluGlu: 4.225 ± 1.906
3.873GluPhe: 3.873 ± 1.193
4.225GluGly: 4.225 ± 1.148
1.056GluHis: 1.056 ± 0.57
3.169GluIle: 3.169 ± 0.792
5.634GluLys: 5.634 ± 1.278
7.746GluLeu: 7.746 ± 2.315
2.113GluMet: 2.113 ± 1.133
1.408GluAsn: 1.408 ± 0.561
2.113GluPro: 2.113 ± 1.14
2.465GluGln: 2.465 ± 0.945
2.113GluArg: 2.113 ± 1.176
4.225GluSer: 4.225 ± 1.444
3.169GluThr: 3.169 ± 1.549
7.746GluVal: 7.746 ± 2.7
0.352GluTrp: 0.352 ± 0.19
1.761GluTyr: 1.761 ± 0.566
0.0GluXaa: 0.0 ± 0.0
Phe
4.93PheAla: 4.93 ± 1.494
1.761PheCys: 1.761 ± 0.66
3.873PheAsp: 3.873 ± 1.125
5.282PheGlu: 5.282 ± 1.859
1.056PhePhe: 1.056 ± 0.57
3.873PheGly: 3.873 ± 3.598
1.408PheHis: 1.408 ± 0.561
3.169PheIle: 3.169 ± 1.188
1.761PheLys: 1.761 ± 0.66
4.225PheLeu: 4.225 ± 1.623
1.408PheMet: 1.408 ± 0.76
3.873PheAsn: 3.873 ± 0.973
2.465PhePro: 2.465 ± 0.953
2.817PheGln: 2.817 ± 1.109
1.761PheArg: 1.761 ± 0.95
5.282PheSer: 5.282 ± 1.63
4.225PheThr: 4.225 ± 1.144
2.817PheVal: 2.817 ± 1.19
0.0PheTrp: 0.0 ± 0.0
1.056PheTyr: 1.056 ± 0.675
0.0PheXaa: 0.0 ± 0.0
Gly
2.817GlyAla: 2.817 ± 0.785
2.113GlyCys: 2.113 ± 1.091
3.873GlyAsp: 3.873 ± 0.948
2.817GlyGlu: 2.817 ± 1.539
4.577GlyPhe: 4.577 ± 0.93
5.634GlyGly: 5.634 ± 1.982
1.408GlyHis: 1.408 ± 0.76
2.817GlyIle: 2.817 ± 1.52
5.634GlyLys: 5.634 ± 1.099
8.803GlyLeu: 8.803 ± 1.229
1.761GlyMet: 1.761 ± 0.66
1.056GlyAsn: 1.056 ± 0.675
1.761GlyPro: 1.761 ± 0.823
1.056GlyGln: 1.056 ± 0.558
5.282GlyArg: 5.282 ± 1.528
8.451GlySer: 8.451 ± 1.084
2.817GlyThr: 2.817 ± 1.68
3.169GlyVal: 3.169 ± 0.718
1.056GlyTrp: 1.056 ± 1.081
2.817GlyTyr: 2.817 ± 1.528
0.0GlyXaa: 0.0 ± 0.0
His
2.817HisAla: 2.817 ± 1.109
1.056HisCys: 1.056 ± 1.251
0.704HisAsp: 0.704 ± 0.38
2.465HisGlu: 2.465 ± 0.945
0.352HisPhe: 0.352 ± 0.19
2.465HisGly: 2.465 ± 0.953
0.704HisHis: 0.704 ± 0.38
1.761HisIle: 1.761 ± 1.054
1.408HisLys: 1.408 ± 0.76
1.761HisLeu: 1.761 ± 0.673
0.352HisMet: 0.352 ± 0.615
1.056HisAsn: 1.056 ± 0.516
1.056HisPro: 1.056 ± 0.57
0.352HisGln: 0.352 ± 0.19
1.056HisArg: 1.056 ± 0.558
3.169HisSer: 3.169 ± 1.4
0.352HisThr: 0.352 ± 0.892
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
2.465HisTyr: 2.465 ± 0.845
0.0HisXaa: 0.0 ± 0.0
Ile
4.577IleAla: 4.577 ± 1.013
0.704IleCys: 0.704 ± 0.38
1.761IleAsp: 1.761 ± 0.823
2.817IleGlu: 2.817 ± 1.18
3.169IlePhe: 3.169 ± 1.271
2.817IleGly: 2.817 ± 1.251
1.056IleHis: 1.056 ± 0.675
3.521IleIle: 3.521 ± 2.225
5.634IleLys: 5.634 ± 1.614
4.577IleLeu: 4.577 ± 3.246
2.465IleMet: 2.465 ± 1.055
2.113IleAsn: 2.113 ± 1.116
2.113IlePro: 2.113 ± 1.116
1.761IleGln: 1.761 ± 0.566
1.761IleArg: 1.761 ± 0.566
3.873IleSer: 3.873 ± 1.646
3.169IleThr: 3.169 ± 1.581
1.761IleVal: 1.761 ± 0.823
0.0IleTrp: 0.0 ± 0.0
1.761IleTyr: 1.761 ± 0.673
0.0IleXaa: 0.0 ± 0.0
Lys
5.282LysAla: 5.282 ± 1.506
1.056LysCys: 1.056 ± 0.692
2.113LysAsp: 2.113 ± 1.325
2.465LysGlu: 2.465 ± 1.204
3.169LysPhe: 3.169 ± 1.081
4.225LysGly: 4.225 ± 2.066
2.113LysHis: 2.113 ± 1.14
2.465LysIle: 2.465 ± 0.845
2.113LysLys: 2.113 ± 0.799
7.746LysLeu: 7.746 ± 1.756
0.704LysMet: 0.704 ± 0.38
2.817LysAsn: 2.817 ± 1.109
3.169LysPro: 3.169 ± 0.718
1.761LysGln: 1.761 ± 0.721
3.521LysArg: 3.521 ± 1.455
3.521LysSer: 3.521 ± 0.62
5.282LysThr: 5.282 ± 1.317
3.873LysVal: 3.873 ± 2.09
0.704LysTrp: 0.704 ± 0.38
2.113LysTyr: 2.113 ± 1.536
0.0LysXaa: 0.0 ± 0.0
Leu
7.746LeuAla: 7.746 ± 3.599
2.113LeuCys: 2.113 ± 0.641
5.634LeuAsp: 5.634 ± 1.311
6.338LeuGlu: 6.338 ± 1.734
3.521LeuPhe: 3.521 ± 1.9
8.099LeuGly: 8.099 ± 1.538
2.817LeuHis: 2.817 ± 1.109
4.93LeuIle: 4.93 ± 1.773
4.577LeuLys: 4.577 ± 1.734
8.803LeuLeu: 8.803 ± 2.006
1.408LeuMet: 1.408 ± 0.617
2.113LeuAsn: 2.113 ± 0.793
4.93LeuPro: 4.93 ± 2.202
4.225LeuGln: 4.225 ± 0.919
7.746LeuArg: 7.746 ± 1.227
5.986LeuSer: 5.986 ± 1.522
8.099LeuThr: 8.099 ± 1.834
8.803LeuVal: 8.803 ± 1.459
0.352LeuTrp: 0.352 ± 0.814
1.408LeuTyr: 1.408 ± 0.893
0.0LeuXaa: 0.0 ± 0.0
Met
2.113MetAla: 2.113 ± 0.793
0.352MetCys: 0.352 ± 0.19
1.056MetAsp: 1.056 ± 0.558
1.408MetGlu: 1.408 ± 0.838
0.704MetPhe: 0.704 ± 0.38
1.408MetGly: 1.408 ± 0.76
0.0MetHis: 0.0 ± 0.0
0.704MetIle: 0.704 ± 1.244
1.408MetLys: 1.408 ± 0.76
2.817MetLeu: 2.817 ± 1.311
0.0MetMet: 0.0 ± 0.0
0.704MetAsn: 0.704 ± 0.539
1.408MetPro: 1.408 ± 0.678
1.056MetGln: 1.056 ± 0.57
2.817MetArg: 2.817 ± 0.785
1.408MetSer: 1.408 ± 0.561
0.0MetThr: 0.0 ± 0.0
0.704MetVal: 0.704 ± 0.775
0.352MetTrp: 0.352 ± 0.19
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.521AsnAla: 3.521 ± 2.606
1.056AsnCys: 1.056 ± 1.42
1.056AsnAsp: 1.056 ± 0.516
1.761AsnGlu: 1.761 ± 0.95
3.873AsnPhe: 3.873 ± 1.635
3.521AsnGly: 3.521 ± 2.48
1.408AsnHis: 1.408 ± 0.76
0.704AsnIle: 0.704 ± 0.723
2.113AsnLys: 2.113 ± 0.952
4.93AsnLeu: 4.93 ± 1.014
0.352AsnMet: 0.352 ± 0.19
1.761AsnAsn: 1.761 ± 1.016
1.761AsnPro: 1.761 ± 0.66
1.056AsnGln: 1.056 ± 0.978
2.113AsnArg: 2.113 ± 1.202
2.113AsnSer: 2.113 ± 0.793
1.761AsnThr: 1.761 ± 0.811
3.169AsnVal: 3.169 ± 1.271
0.704AsnTrp: 0.704 ± 0.539
0.704AsnTyr: 0.704 ± 0.38
0.0AsnXaa: 0.0 ± 0.0
Pro
3.169ProAla: 3.169 ± 0.904
1.761ProCys: 1.761 ± 1.473
2.113ProAsp: 2.113 ± 0.829
4.577ProGlu: 4.577 ± 1.259
1.408ProPhe: 1.408 ± 1.214
1.761ProGly: 1.761 ± 0.811
1.408ProHis: 1.408 ± 0.678
2.465ProIle: 2.465 ± 1.748
2.465ProLys: 2.465 ± 1.062
3.521ProLeu: 3.521 ± 1.183
0.0ProMet: 0.0 ± 0.0
0.0ProAsn: 0.0 ± 0.0
3.169ProPro: 3.169 ± 2.285
2.113ProGln: 2.113 ± 2.477
2.817ProArg: 2.817 ± 0.977
3.169ProSer: 3.169 ± 1.59
2.817ProThr: 2.817 ± 0.639
2.113ProVal: 2.113 ± 1.14
0.704ProTrp: 0.704 ± 0.38
1.761ProTyr: 1.761 ± 0.676
0.0ProXaa: 0.0 ± 0.0
Gln
1.408GlnAla: 1.408 ± 1.078
0.0GlnCys: 0.0 ± 0.0
0.704GlnAsp: 0.704 ± 0.38
4.225GlnGlu: 4.225 ± 0.792
1.408GlnPhe: 1.408 ± 0.59
2.465GlnGly: 2.465 ± 0.589
1.408GlnHis: 1.408 ± 0.59
1.408GlnIle: 1.408 ± 0.656
1.408GlnLys: 1.408 ± 0.59
3.873GlnLeu: 3.873 ± 1.338
0.704GlnMet: 0.704 ± 1.132
1.761GlnAsn: 1.761 ± 0.721
1.408GlnPro: 1.408 ± 0.561
0.0GlnGln: 0.0 ± 0.0
1.761GlnArg: 1.761 ± 0.721
2.465GlnSer: 2.465 ± 0.589
1.761GlnThr: 1.761 ± 2.021
1.056GlnVal: 1.056 ± 0.945
0.0GlnTrp: 0.0 ± 0.0
0.704GlnTyr: 0.704 ± 0.775
0.0GlnXaa: 0.0 ± 0.0
Arg
4.225ArgAla: 4.225 ± 2.63
1.761ArgCys: 1.761 ± 1.948
2.465ArgAsp: 2.465 ± 0.589
1.761ArgGlu: 1.761 ± 0.66
4.93ArgPhe: 4.93 ± 0.792
2.465ArgGly: 2.465 ± 0.705
0.704ArgHis: 0.704 ± 0.589
2.817ArgIle: 2.817 ± 1.154
3.169ArgLys: 3.169 ± 0.605
5.282ArgLeu: 5.282 ± 1.344
1.761ArgMet: 1.761 ± 0.95
2.465ArgAsn: 2.465 ± 1.069
1.408ArgPro: 1.408 ± 1.97
1.761ArgGln: 1.761 ± 1.651
4.577ArgArg: 4.577 ± 2.197
3.873ArgSer: 3.873 ± 1.561
2.465ArgThr: 2.465 ± 0.713
2.817ArgVal: 2.817 ± 1.154
0.704ArgTrp: 0.704 ± 0.38
2.465ArgTyr: 2.465 ± 0.945
0.0ArgXaa: 0.0 ± 0.0
Ser
4.93SerAla: 4.93 ± 0.644
2.113SerCys: 2.113 ± 1.14
6.69SerAsp: 6.69 ± 2.057
5.282SerGlu: 5.282 ± 0.732
3.521SerPhe: 3.521 ± 0.833
5.986SerGly: 5.986 ± 1.565
2.817SerHis: 2.817 ± 0.729
3.521SerIle: 3.521 ± 1.261
4.93SerLys: 4.93 ± 1.631
4.577SerLeu: 4.577 ± 1.006
1.408SerMet: 1.408 ± 0.596
4.225SerAsn: 4.225 ± 1.933
1.761SerPro: 1.761 ± 0.752
2.465SerGln: 2.465 ± 0.945
2.817SerArg: 2.817 ± 0.729
6.69SerSer: 6.69 ± 1.467
5.282SerThr: 5.282 ± 2.582
3.521SerVal: 3.521 ± 1.793
0.352SerTrp: 0.352 ± 0.674
1.056SerTyr: 1.056 ± 0.675
0.0SerXaa: 0.0 ± 0.0
Thr
2.817ThrAla: 2.817 ± 0.91
2.465ThrCys: 2.465 ± 2.633
2.113ThrAsp: 2.113 ± 1.179
2.817ThrGlu: 2.817 ± 1.122
7.394ThrPhe: 7.394 ± 1.484
3.873ThrGly: 3.873 ± 2.144
2.113ThrHis: 2.113 ± 0.829
3.169ThrIle: 3.169 ± 1.613
3.873ThrLys: 3.873 ± 1.707
5.634ThrLeu: 5.634 ± 2.33
1.408ThrMet: 1.408 ± 0.76
2.817ThrAsn: 2.817 ± 1.068
2.113ThrPro: 2.113 ± 1.14
1.056ThrGln: 1.056 ± 0.692
2.465ThrArg: 2.465 ± 2.738
4.577ThrSer: 4.577 ± 2.578
2.113ThrThr: 2.113 ± 1.385
2.113ThrVal: 2.113 ± 0.799
0.704ThrTrp: 0.704 ± 0.907
1.408ThrTyr: 1.408 ± 0.76
0.0ThrXaa: 0.0 ± 0.0
Val
5.282ValAla: 5.282 ± 1.459
3.873ValCys: 3.873 ± 1.251
3.169ValAsp: 3.169 ± 1.28
3.873ValGlu: 3.873 ± 1.06
3.873ValPhe: 3.873 ± 1.446
4.93ValGly: 4.93 ± 1.085
2.113ValHis: 2.113 ± 1.725
2.465ValIle: 2.465 ± 0.589
3.521ValLys: 3.521 ± 0.963
6.338ValLeu: 6.338 ± 1.353
0.704ValMet: 0.704 ± 0.38
2.465ValAsn: 2.465 ± 0.589
2.113ValPro: 2.113 ± 0.545
2.113ValGln: 2.113 ± 0.74
2.465ValArg: 2.465 ± 1.615
4.93ValSer: 4.93 ± 0.865
3.521ValThr: 3.521 ± 1.97
5.634ValVal: 5.634 ± 2.186
0.0ValTrp: 0.0 ± 0.0
1.761ValTyr: 1.761 ± 0.66
0.0ValXaa: 0.0 ± 0.0
Trp
0.352TrpAla: 0.352 ± 0.622
0.352TrpCys: 0.352 ± 0.19
0.704TrpAsp: 0.704 ± 0.38
0.704TrpGlu: 0.704 ± 0.589
0.704TrpPhe: 0.704 ± 0.38
0.704TrpGly: 0.704 ± 1.143
0.352TrpHis: 0.352 ± 0.19
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.408TrpLeu: 1.408 ± 0.76
0.0TrpMet: 0.0 ± 0.0
1.056TrpAsn: 1.056 ± 0.516
0.352TrpPro: 0.352 ± 0.814
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.352TrpSer: 0.352 ± 0.622
0.0TrpThr: 0.0 ± 0.0
0.352TrpVal: 0.352 ± 0.19
0.0TrpTrp: 0.0 ± 0.0
0.352TrpTyr: 0.352 ± 0.674
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.761TyrAla: 1.761 ± 1.436
2.113TyrCys: 2.113 ± 0.864
0.704TyrAsp: 0.704 ± 0.38
1.056TyrGlu: 1.056 ± 0.57
1.761TyrPhe: 1.761 ± 0.95
1.056TyrGly: 1.056 ± 0.558
0.352TyrHis: 0.352 ± 0.19
1.761TyrIle: 1.761 ± 0.823
2.113TyrLys: 2.113 ± 0.769
2.465TyrLeu: 2.465 ± 0.845
0.352TyrMet: 0.352 ± 0.19
1.408TyrAsn: 1.408 ± 0.933
1.408TyrPro: 1.408 ± 0.59
1.056TyrGln: 1.056 ± 0.558
1.761TyrArg: 1.761 ± 0.673
2.113TyrSer: 2.113 ± 1.14
3.169TyrThr: 3.169 ± 0.605
2.113TyrVal: 2.113 ± 0.793
0.352TyrTrp: 0.352 ± 0.19
0.704TyrTyr: 0.704 ± 0.38
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2841 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski