Design and Modeling of Optoelectronic Photocurrent Reconfigurable (OPR) Multifunctional Logic Devices (MFLD) as the Universal Circuitry Basis for Advanced Parallel High-Performance Processing

One of the problems in high speed computing is the limited capabilities of communication links in digital high performance electronic systems. Too slow and too few interconnects be‐ tween VLSI circuits cause a bottleneck in the communication between processor and memo‐ ry or, especially in multiprocessor systems, among the processors. Moreover, the problem is getting worse since the increasing integration density of devices like transistors leads to a higher requirement in the number of necessary channels for the off-chip communication. Hence, we are currently in a situation, which is characterized by too few off-chip links and too slow long on-chip lines, what is described as the interconnect crisis in VLSI technology [1]. More than ten years the use of optical interconnects is discussed as an alternative to solve the mentioned problems on interconnect in VLSI technology [2]. A lot of prototypes and demonstrator systems were built to prove the use of optics or optoelectronics for offchip and on-chip interconnects [3]. The possibilities of current VLSI technology would allow integrating a massively-parallel array processor consisting of a few hundred thousand sim‐ ple processor elements (PEs) on a chip. Unfortunately it would be a huge problem to ar‐ range several of such PE arrays one after the other in order to realize a highly–parallel superscalar and super-pipelined architecture as well as an efficient coupling to a memory chip. The reason for these difficulties is the not sufficient number of external interconnects to move high data volumes from and to the circuits. In optoelectronic VLSI one tries to solve limitation problem by realizing external interconnects not at the edge of a chip but with ar‐


Introduction
One of the problems in high speed computing is the limited capabilities of communication links in digital high performance electronic systems.Too slow and too few interconnects between VLSI circuits cause a bottleneck in the communication between processor and memory or, especially in multiprocessor systems, among the processors.Moreover, the problem is getting worse since the increasing integration density of devices like transistors leads to a higher requirement in the number of necessary channels for the off-chip communication.Hence, we are currently in a situation, which is characterized by too few off-chip links and too slow long on-chip lines, what is described as the interconnect crisis in VLSI technology [1].More than ten years the use of optical interconnects is discussed as an alternative to solve the mentioned problems on interconnect in VLSI technology [2].A lot of prototypes and demonstrator systems were built to prove the use of optics or optoelectronics for offchip and on-chip interconnects [3].The possibilities of current VLSI technology would allow integrating a massively-parallel array processor consisting of a few hundred thousand simple processor elements (PEs) on a chip.Unfortunately it would be a huge problem to arrange several of such PE arrays one after the other in order to realize a highly-parallel superscalar and super-pipelined architecture as well as an efficient coupling to a memory chip.The reason for these difficulties is the not sufficient number of external interconnects to move high data volumes from and to the circuits.In optoelectronic VLSI one tries to solve limitation problem by realizing external interconnects not at the edge of a chip but with ar-rays of optical detectors and light emitters which send and receive data directly out from the chip area.Honeywell has developed such devices with VCSEL diodes (vertical surface emitting laser diodes) and metal -semiconductor -metal photo-detectors in research project [4].This allows the realization of stacked 3-D chip architecture in principle.The main problems are not the manufacturing and operating of single devices but the combination of different passive optical elements with active optoelectronic and electronic circuits in one system.This requires sophisticated mounting and alignment techniques which allow low mechanical tolerances and the handling of thermal problems.At present the situation for smart detector circuits is much easier.They can be regarded as a subset of OE-VLSI circuits because they consist only of arrays of photo-detectors with corresponding evaluation circuit for analogue to digital converting.Optical detectors based on PN or PIN photodiodes can be monolithically integrated with digital electronics in silicon what simplifies the design enormously compared with OE-VLSI circuits that in addition contain sender devices realized in GaAs technologies.Furthermore smart detector circuits can be manufactured in nearly every semiconductor fabric.Smart detectors or smart optical sensors show a great application field and market potential.Therefore our approach favors a smart pixel like architecture combining parallel signal detection with parallel signal processing in one circuit.Each pixel has its own PE what guarantees the fastest processing.
The strategic direction of solution of various scientific problems, including the problem of creation of artificial intelligence (AI) systems, human brain simulators, robotics systems, monitoring and control systems, decision-making systems, as well as systems based on artificial neural networks, etc., becomes fast-acting and parallel processing of large2-D arrays of data (up to 1024x1024 and higher) using non-conventional computational systems, corresponding matrix logics (multi-valued, signed-digit, fuzzy logics, continuous, neural-fuzzy and others) and corresponding mathematical apparatus [5][6][7][8][9][10][11].For numerous perspective realizations of optical learning neural networks (NN) with two dimensional structure [5], of recurrent optical NN [6], of the continuous logic equivalency models (CLEM) NN [7][8][9][10], the elements of matrix logic are required, and not only of two-valued property, threshold, hybrid but also continuous, neural-fuzzy logics and adequate structure of vector-matrix computational procedures with basic operations of above-mentioned logics.Optic and optoelectronic technologies, methods and principles as well as corresponding element base provide attractive alternative for 2D data processing.These technologies and methods successfully decide problems of parallelism, input-output and interconnections.Advanced nontraditional parallel computing structures and systems, including neural networks, require both parallel processing and parallel information input/output.At the same time there are many new approaches that are based on new logics (neural-fuzzy, multi-valued, continuous etc.).The using of the standard sequential algorithms based on a few operations makes the approaches long-running.But only a few of them [12] can be used for processing of 2D data and perform wide range of needed arithmetic and logic operations).Generalization of scalar two-valued logic on matrix case has led to intensive development of binary images algebra (BIA) [13] and 2D Boolean elements for optic and optoelectronic processors [12][13][14][15][16][17].
Taking into consideration the above-described approach, consisting in universality, let us recollect some known facts regarding the number of functions.The number of Boolean functions of n variables in algebra of two-valued logic (TVL), which is also Boolean algebra, equals2 2 n .In this TVL there are N 2 = 2 n atoms, which are minterms.Functions of n variables k-valued logic (k>2) are reflectionsA n → A, where A={0, 1,... k-1}, and the number of func- , in which any function on any set of arguments takes the value of one of the arguments or its negation, is finite.In this case the number N ∧ (n) of functions of n argu- ments increases with increase of n very rapidly [4]:N ∧ (0) = 2 ;N ∧ (1) = 6;N ∧ (2) = 84;N ∧ (3) = 43918.
We would like to draw the attention to the fact, that both natural neurons and their more complex physical and mathematical models suggest discrete-analog and purely analog means for information processing with different level of accuracy, with the possibility of rearrangement of chosen coding system.This, in its turn, requires corresponding image neuron circuit engineering with programmable logic operations, with transition from analog to discrete processing, to storing etc.
Thus, the search of means aimed at construction of elements, especially universal (at least quasi-universal or multifunctional) with programmable tuning, able to perform not only operations of two-valued logic, but other matrix (multi-valued, continuous, neural-fuzzy, etc.) logic operations is very actual problem [15].One of promising directions of research in this sphere is the application of time-pulse-coded architectures (TPCA) that were considered in works [18][19][20].These architectures were generalized in [11], taking into account basic possible approaches as well as system and mathematical requirements.The time-pulse representation of matrix continuous-logic variables by two-level optic signals not only permits to increase functional possibilities (up to universality), stability to noise, stability and decrease requirements regarding alignment and optical system, but also simplify control circuits and adjustment circuits to required function, operation, and keep untouched the whole methodological basis of such universal elements construction, irrespective of valuedness of a logic and type of a logic.
But there is another approach based on the use of universal logic elements with the structure of multiple-input multiple-output (MIMO) and time-pulse coding.We call such elementsthe elements of picture type (PT).At increase of number of input operands and valuedness of logic (up to continuous) the number of executable functions also increases by the exponential law.This property allows simplifying operation algorithms of such universal optoelectronic logical elements and hence to raise information processing speed.Most general conceptual approaches to construction of universal picture neural elements and their mathematical rationales were presented in paper [11].But those were only system and structural solutions that is why they require further development and perfection.Mathematical and other theoretical fundamentals of design of matrix multi-functional logical devices with fast acting programmable tuning were considered in paper [19], where expediency of functional basis unification, that is promising for optoelectronic parallel-pipeline systems (OEPS) with command-flow 2D-page (picture) organization [20], necessity in arrays of optic or optoelectronic triggers (memory elements) of picture type for storage of information and controlling adjusting operands as well as perspective principles of presentation and coding of multi-valued matrix data (spatial, time-pulse and spectral) were shown.Besides, the analysis of various algebra logics [11,19,[21][22][23][24] for functional systems of switching functions, in spite of their diversity allows us to suggest a very useful idea, in our opinion, that lies in following.
It is possible to create more sophisticated problem-oriented processors, in which the specific time-pulse operands encoding and only elements of two-valued logic are used, which will realize functions of different logics, continuous etc. Taking into account the universality, parallel information processing of the universal elements and the use of only two-valued logic elements for implementation of all other operations the approach is a very promising.
That is why the aim of the given work is to consider the results of design and investigation of optoelectronic smart time-pulse coded photocurrent reconfigurable MFLD as basic components for 2D-array logic devices for advanced neural networks and optical computers.

Picture continuous logic elements (PCLE)
Figure 1 shows the structural diagram of picture neural element (PNE) for computation of all basic matrix-continuous-logic (MCL) operations in matrix quasiBoolean algebra C=((А,В),^, ˇ,-) [11] for which in any set of MCL arguments matrix continuous logic function (MCLF) F takes the value of a subregion of one of the arguments or its supplement.The PE of matrix two-valued logic (MTVL), performing MTVL operations over matrix temporal functions O i t (t) (in point of fact two-valued 2D-operands) realize MCLF over continuous logic variables (CLV) O i t .The time-pulse coding of a grayscale picture is shown in Figure 1.
As it is seen in Figure 2  ¯, complementarya ¯= 1 − a, equivalence, etc [10,11,23].The duration of MTVL formed at the output and as a result of PNE, signal , is CL function of input binary temporal variables durations.
Thus, as it is seen from Figure 3, almost all basic operations of continuous logic, neural-fuzzy logic, that are shown in work [21], can be realized with the help of the time-pulse coding of variablesX 1 ,…,X n and universal (or multifunctional)picture element (UPE) of two-valued logic (TVL).But for that pulse width modulator (PWM) of PT is needed.It is not needed to form contrast-conversion (complementary operand) image for analog picture optic inputs if PWMs PT have complementary outputs.Thus, becomes obvious that for time -pulse coding realization of PNE of matrix-continuous -logic (MCL) with programmable tuning is necessary UPE of TVL or picture MFLD, by means of which continuously -logic operations over time -pulse signals can be realized.In Figure 1 selection of picture logic functions is carried out by electric adjusting signals and all array cells will realize the same function at the same time.For many appendices it is expedient to choose a logic function at each point of the matrix processor, and therefore there is a desire to make management and tuning also in the form of optical matrix operands.It essentially expands functionality of such processors and MFLD on which basis they are realized.
In work [25] MFLD of two-valued logic (TVL) on current mirrors, photodiodes and LEDs with schemes of their drivers are described and simulated.They are relatively difficult as contain four current mirrors (CM), four schemes ХОR, four elements АND and one logic element OR.In the same work different optoelectronic circuitry were offered on base of 2-4 CM and one photo diode, realizing the Boolean operations AND, NOT, OR, NOR, et al with potential and current outputs.They are based on threshold elements, comparators of currents (photocurrents) on current mirrors and circuitry of limited subtraction (CLS).Such base elements also were used for realization of other elements of continuous logic, including operations equivalence (nonequivalence) and etc. [21,26,27].Therefore developing further this approach we use for design of the OPR MFLD.The diagrams in Figures 6b, 6c, 6d, similar to Figure 6a shows the corresponding input and output currents of the circuit.The difference lies in the different modes for different input pulse durations and the presence of additional power consumption graphics.In Figure 7а dependence of power consumption of OPR MFLD-1 from I 0 and supply voltage is shown, and in Figure 7b dependence of t preset and t fronts from I 0 = I max is shown.From them it is visi- ble, that the power consumption of OPR MFLD-1 P drain (without drivers and output part) is about 0.1-2.5mW.If to take into account that the currents of LEDs must (taking into account the coefficient of transformation and sensitiveness of photo-detectors PD) to be at least in 5÷10 times more, the P drain will increase in 2÷4 times.But, for example, atI 0 = 10μА, the power consumption will be P drain ≤ 4÷5mW.At currents 1÷3µA it decreases to 1mW.Delay time is no more than 50÷100 ns, and the period T of time pulse processed signals go into in a microsecond range 1÷16 µs.If to use not 1.5µm technologies CMOS transistors, but more ad-vanced, that is possibility to receive processing time T at level 1÷10ns, i.e. to raise productivity of one channel OPR MFLD-1 to 10 8 -10 9 CL-logic operations/sec.
We tested experimentally the circuit for all functions that it can implement.The experiments confirm the implementation of all theoretically possible functions in a wide range of voltages, currents and operating periods of treatment.But given the size limitations of article, here we do not present all results and charts.

Modeling of array of the OPR MFLD-1 with MathCAD
Modeling results of the OPR MFLD-1 with MathCAD which confirm normal functioning of OPR MFLD-1 for all 16 possible functions of binary logic and corresponding functions of continuous logic are shown in figure 8-11.Two inputs 2D operands XA and XB (Figure 8) with dimensional of 32x32 pixels are transformed to XAR and XBR by multiplication of one pixel to 2x2 pixels.Matrixes XAR, XBR have dimensional of 64x64 pixels.Matrix SAB is formed as sum of AXR, BXR and OP.Threshold processing is done over elements of SAB matrix and matrix QSAB is formed: The threshold value tr =3.Four subpixels are united to one pixel with formula and output matrix ESAB is formed.
For more detailed consideration fragments AP, BP, OPP, OSP, OQP, QSP with dimensional of 2x2 subpixels or 4x4 pixels from matrixes AXR, BXR, OP, SAB, UQSAB, ESAB are shown in Figure 10.The fragments are shown as matrixes and images.For conventional presentation of the images in MathCAD the matrixes are multiplied by 80. Output of equivalence operation is QSP with dimensional of 2x2, but for OPR MFLD correct operation matrixes QSAP and QABP with dimensional of 4x4 are used.The second circuit variant is shown in Figure 12.It differs from the previously discussed first variant that the input optical signals from each of the i,j-th base cell of two picture operands are fed to a photo-detector.One of the picture input using the appropriate shadow mask weakens the signals of one of the operands is a factor of 2. Therefore, the first unit of the circuit consists of current comparators, which convert the output voltages into a digital form that is uniquely appropriate input situation.If change of the vector set to {0, 0, 1, 0} than there is a signal at the output which differs only in the short false pulses.Change of durations of the input signals at the same vector set provides the desired signal at the output (see Figure 15).This confirms the correct operation of the scheme.
Figure1shows the structural diagram of picture neural element (PNE) for computation of all basic matrix-continuous-logic (MCL) operations in matrix quasiBoolean algebra C=((А,В),^, ˇ,-)[11] for which in any set of MCL arguments matrix continuous logic function (MCLF) F takes the value of a subregion of one of the arguments or its supplement.The PE of matrix two-valued logic (MTVL), performing MTVL operations over matrix temporal functions O i t (t) (in point of fact two-valued 2D-operands) realize MCLF over continuous logic variables (CLV) O i t .The time-pulse coding of a grayscale picture is shown in Figure1.As it is seen in Figure2at each point of picture output of PNE, MCL can be performed over continuous logic variables (CLV) O 1 ijT ,…O n=2 ijT , presented by t 1 ij ,...t n ij durations of time pulse signals, during each interval T one of the following operations of CL: min(a,b), max(a,b), mod(a-b), mod(a − b)¯, complementarya ¯= 1 − a, equivalence, etc[10,11,23].The du-

Figure 2 .
Figure 2. Time diagrams of CL operation fulfillment by means of time-pulse CL variables

Figure 7 .
Figure 7. а) Dependence of the power consumption from supply voltage and input current range; (b) dependence of time delay and fronts from supply voltage and input current values If cells of the MFLD-1 with P drain = 1÷5mW are integrated into array of 32x32 elements or more, the general productivity of such array OPR MFLD-1 will reach 10 12 CL-logic operations/sec.A modified variant of OPR MFLD-1 in which signals у1¸у4 are realized on current generators with possibility of their programming is also offered.Besides, if the array of cells MFLD-1 realizes the same function it is possible to choose signals with sample corresponding nodes (PD-CM) i .The problem of simplification of the optical system is decided in this case.Because it is necessary to give signals not from three optical apertures, but only from two apertures on the OPR MFLD-1 chip.

Figure 8 .
Figure 8. Simulation results of forming and processing processes using OPR MFLD-1 Four matrixes M1÷M4 are formed with formulas shown in Figure 9.These matrixes are used for selection of one subpixel of four pixels of XAR and XBR.Matrixes AXR and BXR are formed after XAR and XBR by elementwise non-equivalence (⊕) operation on matrixes MA and MB.Tuning 2D operand OP is formed by matrixes M1÷M4 and scalar tuning signals oy1÷oy4 or by signals y1÷y4.

Figure 9 .
Figure 9. Transformations formulas for matrixes, tuning operand OP formation and additions

Figure 10 .
Figure 10.Simulation results of four base cells (2х2 subpixel) of matrix OPR MFLD-1 (function NXOR -EQ) Examples of other functions realizations with the OPR MFLD-1 as fragments of images are shown in Figures 11

Figure 12 .
Figure 12. Circuit diagram of the base cell for the OPR MFLD-2 (the second version) With the help of nodes in the current voltage conversion and control signals Y0-Y3 at the output node is formed by the resulting signal as a current, which corresponds to the selected desired logic function.The set of possible logical set of vector signals Y0-Y3 has 16 possible combinations.Selecting one of them allows you to implement any 16 of possible two-valued logic of binary operations.If the input signals are continuous in the time-pulse coded form, selecting the desired operation as a two-valued logic, such as AND, the operation MIN is implemented from time-pulse encoded signals.For the first model experiments in the scheme of an input photo-sensor used two of the current source to set the time of the input time-pulse signals (TPS).Instead of photo detectors are used to control the function of the sources of Y0 ÷ Y3 current.The reference currents are shown as current sources for simplicity.The current sources can be implemented on the same transistors or may be given by means of optical signals with fixed intensity.For the formation of the amplified output cur-

Figure 13 .
Figure 13.The results of modeling of the base cell for second version of OPR MFLD-2 for implementation of function non-equivalence of continuous logic (CL) based on XOR TVLExperiments have shown that the power consumption of a cell does not exceed 200÷300µW, delay times and pulse fronts are less than 1 nanosecond, and the basic cell is realized on 44 (or 36) transistors and 11 current sources on 11÷15 transistors.The duration time of pulsecoded signal is in the range of processing cycles, and the pulse period is 100 nanoseconds.This shows that it is possible to increase the frame processing rate to 10 MHz but at the expense of accuracy and complexity of matching photodetectors with current mirrors.Simulation results with OrCAD16.3 of the same basic cell circuit of the OPR MFLD-2 in the mode of implementation of the functions of the nonequivalence CL or XOR TVL are shown in Fig.13.Diagrams that explain the work of OPR MFLD-2 in the implementation of functions of the nonequivalence CL or XOR TVL: Id = 5µA, 3V supply voltage, signal durations t a pulse = 50ns, t b pulse = 80ns.In the first diagram above -the output current signal, the second -two input signals and their weighted sum, the down three: the third, fourth and fifth -currents at the output of the threshold units (green solid) and their complements (blue dashed).It uses vector tuning signals Y= {Y0, Y1, Y2, Y3} = {0, 1, 1, 0}, and the current level is 5 µA.At the output the correct signal is formed ≈ 30 ns duration.The change of the vector set to {0, 1, 0, 0} allows for the output function I22 * NI23 (where NI23 -the complement of the signal I23), as shown in Figure14.For credibility, that the function is implemented correctly, we did a change in the duration of signals, such that the first signal t pulse (I22) = 80ns and t pulse (I23) = 50ns (the signals changed their duration).The results showed that there was a signal at the output, which has a duration ≈30 ns.

Figure 14 .
Figure 14.The diagrams of signals in the circuit with a vector set {0, 1, 0, 0} for the implementation of the function AND(a, b ¯), where a=I22, b=I23

Figure 15 .
Figure 15.The diagrams of signals in the implementation of the function AND (a, ¯b)defined by the vector set {0, 0, 1, 0}, where a=I22, b=I23, t p (I22) = 50ns, t p (I23) = 80ns.In Figure 16 (left) the implementation of the equivalence CL (based on NXOR TVL) is shown.The output signal (the first graph above) has the total duration of 70ns.The operation NOR TVL and on its basis the operation max ¯(a, b) CL, or the same operation min(a ¯, b ¯) CL is shown in Figure 16 (right).Duration of the output signal is 20ns.Signal diagrams for mode of formation of min CL-function (based on AND) are shown in Figure 17.Left on the diagrams shows the control signals of the vector Y= {Y0, Y1, Y2, Y3} = {0, 0, 0, 1}, and the right -signals: output, input and intermediate.As can be seen from the simulations, device successfully implements the desired function when changing the supply voltage from 1,5V

Figure 16 .Figure 17 .
Figure 16.Simulation results of the base cell for second version of OPR MFLD-2 for implementation of function: leftequivalence operation CL (NXOR TVL), right -operation max ¯(a, b) CL (NOR TVL)

Figure 18 .Figure 19 .Figure 20 .
Figure 18.The base cell for OPR MFLD-2 with one input and four control photodiodes Design and Modeling of Optoelectronic Photocurrent Reconfigurable (OPR) Multifunctional Logic Devices (MFLD) as the Universal Circuitry Basis for Advanced Parallel High-Performance Processing http://dx.doi.org/10.5772/54540Formulas for simulation processing with MathCAD are shown in Figure 21.At first, input two 2D operands A1 and B1 and its weighted sum SIAB are formed.The coefficient and threshold t 0 = 10 because the current in the OPR MFLD-2 circuit is 10µA.Contrast complementary images are matrixes AN1 and BN1.After threshold processing by current comparators the direct matrixes T1SIAB, T2SIAB, T3SIAB and matrixes TN1SIAB, TN2SIAB, TN3SIAB of complementary images are formed.Four picture tuning operand NY0 ÷NY3 are formed with tuning vector signals ny0÷ny3.Four logical members SY0÷SY3 are formed using simultaneous threshold and state decoding operations.The sum of those members is the output matrix function NF.All operands dimension is 64x64 elements.All images of above mentioned matrixes and some output functions are shown in Figure 22.

Figure 24 .
Figure 24.Simulation results of the OPR MFLD-2 with MathCAD for contour extraction 1 is called continuous logic (CL) algebra, and the number of CL functions, as reflections C u n → C u depending on the CL algebra can be infinite or finite (the set of reflections is always infinite).CL functions are called only those functions of the setN ∧ , which are realized by formulas.The number N ∧ of CL functions in the most developed CL algebra -quasi-Boolean Cleenee algebra Design and Modeling of Optoelectronic Photocurrent Reconfigurable (OPR) Multifunctional Logic Devices (MFLD) as the Universal Circuitry Basis for Advanced Parallel High-Performance Processing http://dx.doi.org/10.5772/54540