Acessibilidade / Reportar erro

Automatic smoothing by optimal splines

Abstracts

We propose a method that is capable to filter out noise as well as suppress outliers of sampled real functions under fairly general conditions. From an a priori selection of the number of knots that define the adjusting spline, but not their location in that curve, the method automatically determines the adjusting cubic spline in a least-squares optimal sense. The method is fast and easily allows for selection of various possible number of knots, adding a desirable flexibility to the procedure. As an illustration, we apply the method to some typical situations found in geophysical problems.

Smoothing; cubic splines


Propomos um método capaz de filtrar ruído bem como suprimir outliers de uma função real amostrada, impondo praticamente nenhuma condição. A partir de uma seleção a priori da quantidade de nós que definirão a spline, porém não suas localizações, o método automaticamente determina a spline cúbica que melhor ajusta os dados, no sentido de quadrados mínimos. Dado que o método é rápido, torna-se viável a realização de vários ajustes com quantidades distintas de nós, conferindo assim a flexibilidade desejada ao processo. Como exemplo, aplicamos o método a alguns problemas em Geofísica.

suavização; spline cúbica


Automatic smoothing by optimal splines

Ricardo BilotiI; Lúcio T. SantosII; Martin TygelIII

IUniversidade Federal do Paraná - Departamento de Matemática Centro Politécnico, Jd. das Américas - Cx.P. 19081 - CEP 81531-980, Curitiba, PR - Fone/fax: (41) 361 3400 - e-mail: biloti@mat.ufpr.br

IIDepartamento de Matemática Aplicada - IMECC - UNICAMP - Caixa Postal 6065 - 13081-970 Campinas (SP) - Fax. (19) 3289-5766 - Tel. (19) 3788-5975 - e-mail: lucio@ime.unicamp.br

IIIUNICAMP – IMECC - Praça Sergio Buarque de Holanda, 651 - Cidade Universitária - Barão Geraldo - Caixa Postal: 6065 - Campinas - São Paulo - 13083-859 - e-mail: tygel@unicamp.br

ABSTRACT

We propose a method that is capable to filter out noise as well as suppress outliers of sampled real functions under fairly general conditions. From an a priori selection of the number of knots that define the adjusting spline, but not their location in that curve, the method automatically determines the adjusting cubic spline in a least-squares optimal sense. The method is fast and easily allows for selection of various possible number of knots, adding a desirable flexibility to the procedure. As an illustration, we apply the method to some typical situations found in geophysical problems.

Keywords: Smoothing, cubic splines

RESUMO

Propomos um método capaz de filtrar ruído bem como suprimir outliers de uma função real amostrada, impondo praticamente nenhuma condição. A partir de uma seleção a priori da quantidade de nós que definirão a spline, porém não suas localizações, o método automaticamente determina a spline cúbica que melhor ajusta os dados, no sentido de quadrados mínimos. Dado que o método é rápido, torna-se viável a realização de vários ajustes com quantidades distintas de nós, conferindo assim a flexibilidade desejada ao processo. Como exemplo, aplicamos o método a alguns problemas em Geofísica.

Palavras-chave: suavização, spline cúbica

INTRODUCTION

In experimental sciences we are often required to represent a set of measured data in the form of a smooth curve from which desirable parameters or attributes are to be extracted. A common problem is the presence of noise in the data. In the literature we find several methods that try to filter and/or smooth the data. Many of them provide us, after application, a different sampled dataset that, following some criteria, can be seen as smoother than the original one. We can also think of an interpolation approach, where the noisy data is replaced by corresponding points that belong to an interpolating function.

The natural question is how should we choose the "interpolating points" from the data so as to construct the desired smoothing function. Normally, these points are extracted from the data, in a regular fashion or manually selected.

We propose a method that optimally selects points to define a cubic spline that best represents the data in the least-squares sense. An interesting feature of the method is that the knot points are no longer required to belong to the original data set. As we will see below, the method is suitable, not only for smoothing, but also for discarding outliers. The method is designed to handle datasets composed by samples of rather complicated real functions. It is to be stressed that, by construction, the obtained function is naturally smooth up to second-order derivative.

The proposed method is applied to two important problems in geophysics. The first problem is to recover horizons as part of a macrovelocity model inversion from multi-coverage seismic data and to smooth seismic traveltime attributes (BILOTI; SANTOS, 2002). The second application refers to smoothing well-log data for anomaly detection and inversion purposes.

FORMULATION

Consider a noisy data W = {(xj, yj) Î IR2 |j = 1,...,M} be the number of interpolating points and G = {(Xi, Yi) Î IR2 | |Xi-1 < Xi, i= 1,..., N} be the set of these points that defines the sought-for cubic spline. To obtain the best set G, in the least-squares sense, we must solve the 2N-variable problem

To solve this problem, we have employed the optimization solver called GENCAN proposed by Birgin and Martínez (2002). GENCAN is an active-set method for smooth box-constrained minimization. The algorithm combines an unconstrained method, including a line search which aims to add many constraints to the working set at a single iteration, with a recently introduced technique (spectral projected gradient) for dropping constraints from the working set. As usual, the optimization process needs an initial approximation. For this purpose, we chose the initial set as composed by N regularly sampled pairs on the originally given set W, that is, (Xi, Yi)=(xj,yj), with j =ë1 +(i-1). .(M–1)/(N-1)û , for i=1,...,N, where ëxû denotes the greater integer less than or equal to x.

Note that we have not made any consideration on how to choose the number, N of interpolating points. The method is designed to automatically find, in the least-squares sense, the best cubic spline for the specified number of knots N. Of course, for a small number of points N, the obtained spline will not be able to represent more than the general trend of the curve. On the other extreme, for large values of N, the spline will tend to fit even the outliers. Since the method is fast, it is reasonable to estimate the cubic spline for several choices of N. This flexibility can be very useful to the user or interpreter, in the sense that a number of inexpensive trials can be implemented before a final decision on which level of smoothness is the best choice for the problem.

APPLICATIONS

We now illustrate the application of the proposed method to some common practical situations. We start by testing the ability of the method to smooth a sequence of the four datasets of increasing difficulty, shown in Figure 1. We next apply the procedure to two problems related to seismic imaging and inversion, shown if Figures 2 and 3, respectively.





General Situations

Figure 1(a) shows that the method efficiently handles and removes white noise. In the next example (Figure 1(b)), besides the noise, some outliers were added. Again, the optimized cubic spline represents the data very well. In Figure 1(c), we see that the obtained curve is able to well describe abrupt variations of the data. Finally, in Figure 1(d), we see that the method is robust enough to provide good results even in the presence of discontinuities. Note that, in particular, there are no Runge effects near the discontinuities.

HORIZON RECONSTRUCTION

We present the results of an algorithm for reconstruction of interfaces and inversion of attributes from 2D-multi-coverage seismic data. As reported in Biloti et al. (2001), the procedure has been successfully applied to invert a layered macro-velocity model from the data. Figure 2(a) depicts the inverted model, where the estimated interfaces (solid lines) were approximated by conventional cubic splines (constructed by selection of knots among the sampled set). In Figure 2(b), we can see the improvement of the inverted model, when the interfaces where constructed upon the application of the proposed technique.

WELL-LOG ANALYSIS

Well logs play the important role of linking rock parameters to seismic data. As an example, impedance functions derived from well logs are generally used for identification and characterization of reservoir anomalies. Well data (e.g., P-and S-velocities and density) are, in general, very noisy, so it may be desirable to consider the parameters as smooth functions of depth (or time). Figure 3 shows the application of the method to smooth a couple real data well logs. On the top of the figure, the new method was used with N = 20 knots. On the bottom of the figure, the number of knots was N = 30.

CONCLUSIONS

We presented an automatic method for smoothing and outlier suppression of data sets that consist of sampled real function points. After an a priori selection of the number of knots, the procedure automatically finds the location of the interpolating points, in such a way that the resulting smoothing function (a cubic spline) is optimal in the least-squares sense. As the method is fast, it allows the user to apply the procedure to different numbers of knots, so as to choose the degree of smoothness that best fits the data. A particular feature of the method is its ability to adjust to abrupt discontinuities on the data.

The few illustrations presented in the text show a wide applicability of the method. As further applications, the method could be useful for purposes such as tomographic inversion and automatic picking.

Acknowledgements

This work was kindly supported by CNPq & FAPESP (Brazil) and the sponsors of the Wave Inversion Technology (WIT) Consortium, Karlsruhe, Germany.

Submetido em 15, agosto, 2003/Aceito em 31, março, 2004

Submited August 1, 2003/Accepted March 31, 2004

NOTES ABOUT THE AUTHORS

Ricardo Biloti é graduado em Matemática Aplicada (UNICAMP/1995). Mestre em Matemática Aplicada (UNICAMP/1998). Doutor em Matemática Aplicada (UNICAMP/2001). Em 2001, passou dois meses em visita ao Instituto de Geofísica da Universidade de Karlsruhe, Alemanha. Desde 2002 é Professor Adjunto no Departamento de Matemática da Universidade Federal do Paraná, atuando no grupos de Computação Científica. Otimização. Suas áreas de atuação concentram-se em Análise Numérica e aplicações à Geofísica. Seus interesses profissionais incluem processamento sísmico multiparamétrico e estimação de modelos de velocidade. É membro da SEG, SBMAC e SIAM.

Lúcio T. Santos é graduado em Matemática Aplicada (UNICAMP/1982), Mestre em Matemática Aplicada (UNICAMP/1985), Doutor em Engenharia Elétrica (UNICAMP/1991) e Livre-Docente em Otimização (UNICAMP/1999). De 1985 a 1988 foi Professor Assistente na USP/São Carlos e desde 1998 é professor no Departamento de Matemática Aplicada do IMECC/UNICAMP, atuando nos grupos de Geofísica Computacional e Otimização. De Agosto/1994 a Setembro/1995 fez Pós-Doutorado na Rice University (EUA) e durante Janeiro/Fevereiro de 1998, 1999 e 2001 foi Professor Visitante no Instituto de Geofísica da Universidade de Karslruhe (Alemanha). Seus interesses profissionais incluem modelamento e imageamento sísmicos e otimização não linear. Suas pesquisas recentes envolvem novas aproximações para o coeficiente de reflexão P-P, atributos alternativos para a análise de AVO, expressões multiparamétricas para tempos de trânsito e métodos de diferenças finitas para a equação Iocal. É membro da SBGf e da SEG.

Martin Tygel é Formado em Física (UERJ/1969), Mestre em Matemática (PUC-Rio/1973) e PhD em Matemática (Stanford University/ 1979). É Professor da UNICAMP desde 1984, tendo lecionado na UFRN (Natal/1979-1980), PPPG-UFBa (Salvador/1981-1983) e Instituto de Geofísica da Universidade de Karlsruhe (Alemanha) em 1990. É detentor dos prêmios Schlumberger (EAGE/2002) e Zeferino Vaz (UNICAMP/1997 e 2003). É fundador do Laboratório de Geofísica Computacional da UNICAMP. O Laboratório é um dos membros do Consórcio Wave Inversion Technology (WIT), com sede em Karlsruhe. É consultor da Petrobras desde 1991 em processamento e imageamento sísmicos. É Editor Associado da Revista Geophysics (SEG).

  • BILOTI, R.; SANTOS, L. T.; TYGEL, M. Layered velocity model from kinematic. In: INTERNATIONAL CONGRESS OF THE BRAZILIAN GEOPHYSICAL SOCIETY, 7., 2001, [S.l.]. Proceedings [S,l.: s.n.], p. 1055-1058, 2001. v. 2.
  • ______; ______. Multiparametric traveltime inversion. Stud. Geophys. Geod., [S.l.], v. 46, p. 177-192, 2002.
  • BIRGIN, E. G.; MARTÍNEZ, J. M. Large-scale active-set box-constrained optimization method with spectral projected gradients. Computational Optimization and Applications, [S.l.], v. 23, p. 101-125, 2002.

Publication Dates

  • Publication in this collection
    24 Jan 2007
  • Date of issue
    June 2003

History

  • Accepted
    31 Mar 2004
  • Received
    15 Aug 2003
Sociedade Brasileira de Geofísica Av. Rio Branco, 156, sala 2510, 20043-900 Rio de Janeiro RJ - Brazil, Tel. / Fax: (55 21) 2533-0064 - São Paulo - SP - Brazil
E-mail: sbgf@sbgf.org.br